Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Kglab | 518 | 3 | 7 months ago | 27 | April 20, 2022 | 34 | mit | Jupyter Notebook | ||
Graph Data Science: an abstraction layer in Python for building knowledge graphs, integrated with popular graph libraries – atop Pandas, NetworkX, RAPIDS, RDFlib, pySHACL, PyVis, morph-kgc, pslpython, pyarrow, etc. | ||||||||||
Pystore | 404 | 1 | 1 | 2 years ago | 35 | February 11, 2022 | 21 | apache-2.0 | Python | |
Fast data store for Pandas time-series data | ||||||||||
D6tstack | 166 | 2 | 2 years ago | 11 | July 30, 2018 | 15 | mit | Jupyter Notebook | ||
Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet | ||||||||||
S3parq | 15 | 2 years ago | 23 | April 14, 2022 | 12 | mit | Python | |||
Parquet file management in S3 for Athena / Spectrum / Presto partitioning | ||||||||||
Panel Geodashboard Twitter | 14 | 10 months ago | cc-by-4.0 | Python | ||||||
A simple Panel-based dashboard visualizing geotagged tweets with hvplot and Datashader. | ||||||||||
Pyspark Dataframe Made Easy | 10 | 2 years ago | Jupyter Notebook | |||||||
pyspark dataframe made easy | ||||||||||
Typed Dfs | 8 | 7 months ago | 34 | October 24, 2023 | apache-2.0 | Python | ||||
Make Pandas DataFrames enforce definitions, self-organize, and correctly serialize in 18 formats. | ||||||||||
Pdf2dataset | 8 | 4 years ago | 15 | September 13, 2020 | 9 | apache-2.0 | Python | |||
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features | ||||||||||
Geni Performance Benchmark | 6 | 4 years ago | apache-2.0 | Clojure | ||||||
Pandasglue | 5 | 5 years ago | apache-2.0 | Python | ||||||
Productivity for your Data Lake |