Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Ibis | 3,404 | 24 | 29 | 4 months ago | 68 | December 10, 2023 | 157 | apache-2.0 | Python | |
The flexibility of Python with the scale and performance of modern SQL. | ||||||||||
Eat_pyspark_in_10_days | 534 | 2 years ago | 1 | Python | ||||||
pyspark🍒🥭 is delicious,just eat it!😋😋 | ||||||||||
Pandapy | 483 | 3 years ago | 22 | January 25, 2020 | 2 | Python | ||||
PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai) | ||||||||||
Datacompy | 339 | 10 | 4 months ago | 20 | November 15, 2023 | 16 | apache-2.0 | Python | ||
Pandas and Spark DataFrame comparison for humans and more! | ||||||||||
Sparklingpandas | 338 | 1 | 7 years ago | 7 | August 08, 2015 | 51 | apache-2.0 | Python | ||
Sparkling Pandas | ||||||||||
Hunter | 170 | 3 years ago | mit | Jupyter Notebook | ||||||
A threat hunting / data analysis environment based on Python, Pandas, PySpark and Jupyter Notebook. | ||||||||||
Handyspark | 129 | 5 years ago | 7 | May 19, 2019 | 8 | mit | Jupyter Notebook | |||
HandySpark - bringing pandas-like capabilities to Spark dataframes | ||||||||||
Pypmml | 64 | 6 | a year ago | 15 | November 03, 2022 | 4 | apache-2.0 | Python | ||
Python PMML scoring library | ||||||||||
Cuallee | 56 | 1 | 4 months ago | 54 | October 28, 2023 | 2 | apache-2.0 | Python | ||
A data quality acceleration library to get data sets verified in a friendly interface | ||||||||||
Learn Data Munging | 37 | 4 months ago | mit | Jupyter Notebook | ||||||
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc. |