Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Alluxio | 6,612 | 31 | 53 | a month ago | 73 | November 29, 2023 | 969 | apache-2.0 | Java | |
Alluxio, data orchestration for analytics and machine learning in the cloud | ||||||||||
Spark Py Notebooks | 1,515 | a year ago | 9 | other | Jupyter Notebook | |||||
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks | ||||||||||
Optimus | 1,446 | 18 days ago | 32 | June 19, 2022 | 29 | apache-2.0 | Python | |||
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark | ||||||||||
Scriptis | 767 | 2 years ago | 22 | apache-2.0 | Vue | |||||
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis. | ||||||||||
Data Science With Ruby | 664 | 9 months ago | 1 | cc0-1.0 | Ruby | |||||
Practical Data Science with Ruby based tools. | ||||||||||
Wedatasphere | 624 | 3 months ago | 24 | |||||||
WeDataSphere is a financial grade, one-stop big data platform suite. | ||||||||||
Onedal | 584 | 4 | 3 months ago | 9 | April 18, 2023 | 52 | apache-2.0 | C++ | ||
oneAPI Data Analytics Library (oneDAL) | ||||||||||
Complete Life Cycle Of A Data Science Project | 499 | 4 months ago | 4 | mit | ||||||
Complete-Life-Cycle-of-a-Data-Science-Project | ||||||||||
Popmon | 461 | 2 | 9 months ago | 36 | July 18, 2023 | 15 | mit | Python | ||
Monitor the stability of a Pandas or Spark dataframe ⚙︎ | ||||||||||
Zat | 414 | 1 | 4 months ago | 11 | January 26, 2023 | 10 | mit | Jupyter Notebook | ||
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark |