Spark Featureselection

Featureselection methods as Spark MLlib Pipelines
Alternatives To Spark Featureselection
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Dagster9,46721333 months ago585December 07, 20232,343apache-2.0Python
An orchestration platform for the development, production, and observation of data assets.
Mage Ai6,324
3 months ago314December 06, 2023189apache-2.0Python
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Transmogrifai2,09932 years ago9June 11, 202044bsd-3-clauseScala
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning
Cube Studio1,710
3 months ago1October 13, 202274otherJupyter Notebook
cube studio开源云原生一站式机器学习/深度学习AI平台,支持sso登录,多租户/多项目组,数据资产对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式算法训练,超参搜索,推理服务VGPU,多集群调度,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型一键微调,llmops,私有知识库,AI应用商店,支持模型一键开发/推理/微调,私有化部署,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式
Mleap1,47915125 months ago26May 07, 2021109apache-2.0Scala
MLeap: Deploy ML Pipelines to Production
Digandburied645
8 years ago4GCC Machine Description
挖坑与填坑
Goodreads_etl_pipeline593
4 years agomitPython
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Keystone472
7 years ago5March 03, 201739apache-2.0Scala
Simplifying robust end-to-end machine learning on Apache Spark.
Sparkflow301
6 months ago13May 18, 20199mitPython
Easy to use library to bring Tensorflow on Apache Spark
Koober301
6 years ago3Scala
Alternatives To Spark Featureselection
Select To Compare


Alternative Project Comparisons
Popular Pipeline Projects
Popular Spark Projects
Popular Data Processing Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Scala
Pipeline
Spark
Probability