Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Dagster | 9,467 | 2 | 133 | 8 months ago | 585 | December 07, 2023 | 2,343 | apache-2.0 | Python | |
An orchestration platform for the development, production, and observation of data assets. | ||||||||||
Mage Ai | 6,324 | 8 months ago | 314 | December 06, 2023 | 189 | apache-2.0 | Python | |||
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data. | ||||||||||
Cube Studio | 1,710 | 8 months ago | 1 | October 13, 2022 | 74 | other | Jupyter Notebook | |||
cube studio开源云原生一站式机器学习/深度学习AI平台,支持sso登录,多租户/多项目组,数据资产对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式算法训练,超参搜索,推理服务VGPU,多集群调度,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型一键微调,llmops,私有知识库,AI应用商店,支持模型一键开发/推理/微调,私有化部署,支持国产cpu/gpu/npu芯片,支持RDMA,支持pytorch/tf/mxnet/deepspeed/paddle/colossalai/horovod/spark/ray/volcano分布式 | ||||||||||
Mleap | 1,479 | 15 | 12 | 10 months ago | 26 | May 07, 2021 | 109 | apache-2.0 | Scala | |
MLeap: Deploy ML Pipelines to Production | ||||||||||
Digandburied | 645 | 8 years ago | 4 | GCC Machine Description | ||||||
挖坑与填坑 | ||||||||||
Goodreads_etl_pipeline | 593 | 5 years ago | mit | Python | ||||||
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. | ||||||||||
Keystone | 472 | 7 years ago | 5 | March 03, 2017 | 39 | apache-2.0 | Scala | |||
Simplifying robust end-to-end machine learning on Apache Spark. | ||||||||||
Sparkflow | 301 | a year ago | 13 | May 18, 2019 | 9 | mit | Python | |||
Easy to use library to bring Tensorflow on Apache Spark | ||||||||||
Koober | 301 | 7 years ago | 3 | Scala | ||||||
Sparktorch | 297 | a year ago | 11 | December 07, 2019 | 12 | mit | Python | |||
Train and run Pytorch models on Apache Spark. |