Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Cylon | 286 | 4 months ago | 163 | apache-2.0 | C++ | |||||
Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame. | ||||||||||
How I Extracted Ted Talks For Parallel Corpus | 22 | 7 years ago | 1 | apache-2.0 | Jupyter Notebook | |||||
Glide | 19 | 2 years ago | 45 | April 29, 2022 | mit | Python | ||||
Easy ETL | ||||||||||
Pdf2dataset | 8 | 4 years ago | 15 | September 13, 2020 | 9 | apache-2.0 | Python | |||
Converts a whole subdirectory with a big (or small) volume of PDF documents to a dataset (pandas DataFrame) with error tracking and choice of features | ||||||||||
Pygdelt | 7 | 3 years ago | 1 | May 10, 2018 | mit | Python | ||||
An easy to use interface for GDELT datasets |