Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Deequ | 3,044 | 6 | 3 months ago | 37 | November 09, 2023 | 141 | apache-2.0 | Scala | ||
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets. | ||||||||||
Spark Cassandra Connector | 1,929 | 109 | 22 | 3 months ago | 81 | April 08, 2021 | 25 | apache-2.0 | Scala | |
DataStax Connector for Apache Spark to Apache Cassandra | ||||||||||
Petastorm | 1,693 | 8 | 5 months ago | 86 | February 03, 2023 | 174 | apache-2.0 | Python | ||
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. | ||||||||||
Spark Py Notebooks | 1,515 | a year ago | 9 | other | Jupyter Notebook | |||||
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks | ||||||||||
Mobius | 937 | 6 | 3 months ago | 22 | January 29, 2017 | 88 | mit | C# | ||
C# and F# language binding and extensions to Apache Spark | ||||||||||
Spark Movie Lens | 757 | 3 years ago | 10 | other | Jupyter Notebook | |||||
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset | ||||||||||
Cdap | 735 | 56 | 3 months ago | 23 | September 01, 2023 | 98 | other | Java | ||
An open source framework for building data analytic applications. | ||||||||||
Machinelearning | 684 | 5 years ago | 1 | Python | ||||||
Machine learning resources,including algorithm, paper, dataset, example and so on. | ||||||||||
Complete Life Cycle Of A Data Science Project | 499 | 3 months ago | 4 | mit | ||||||
Complete-Life-Cycle-of-a-Data-Science-Project | ||||||||||
Whylogs Java | 179 | 2 | 3 years ago | 5 | November 01, 2020 | 2 | apache-2.0 | Java | ||
Profile and monitor your ML data pipeline end-to-end |