Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Optimus | 1,438 | 22 days ago | 32 | June 19, 2022 | 29 | apache-2.0 | Python | |||
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark | ||||||||||
Holoclean Legacy Deprecated | 74 | 6 years ago | 28 | apache-2.0 | Python | |||||
A Machine Learning System for Data Enrichment. | ||||||||||
Marshmallow Pyspark | 12 | 4 months ago | 5 | November 11, 2022 | apache-2.0 | Python | ||||
Marshmallow serializer integration with pyspark | ||||||||||
Pypandas | 6 | 6 years ago | n,ull | mit | Python | |||||
PyPandas, a data cleaning framework for Spark | ||||||||||
Sparklyclean | 6 | 4 years ago | mit | Scala | ||||||
Optimal distributed data deduplication and supervised learning pipeline using Apache Spark |