Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Superset | 57,496 | 21 | 4 days ago | 6 | April 18, 2023 | 1,764 | apache-2.0 | TypeScript | ||
Apache Superset is a Data Visualization and Data Exploration Platform | ||||||||||
Airflow | 33,219 | 320 | 2 months ago | 169 | November 27, 2023 | 890 | apache-2.0 | Python | ||
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows | ||||||||||
Cudf | 6,936 | 3 | 2 months ago | 31 | October 12, 2023 | 1,001 | apache-2.0 | C++ | ||
cuDF - GPU DataFrame Library | ||||||||||
Koalas | 3,291 | 1 | 16 | 6 months ago | 47 | October 19, 2021 | 112 | apache-2.0 | Python | |
Koalas: pandas API on Apache Spark | ||||||||||
Dataflowjavasdk | 853 | 249 | 14 | 3 years ago | 38 | June 26, 2018 | 54 | |||
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines. | ||||||||||
Incubator Liminal | 131 | 6 months ago | 28 | January 25, 2023 | 9 | apache-2.0 | Python | |||
Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow. | ||||||||||
Beyond Jupyter | 125 | 5 years ago | ||||||||
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References) | ||||||||||
Data_sciences_campaign | 96 | 2 months ago | 2 | Jupyter Notebook | ||||||
【数据科学家系列课程】 | ||||||||||
Sit742 | 72 | 5 months ago | 2 | mit | Jupyter Notebook | |||||
SIT742: Modern Data Science | ||||||||||
Kafka Streaming Click Analysis | 56 | 4 years ago | 2 | apache-2.0 | Jupyter Notebook | |||||
Use Kafka and Apache Spark streaming to perform click stream analytics |