Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Airbyte | 12,918 | 11 | 2 months ago | 311 | December 08, 2023 | 5,111 | other | Python | ||
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted. | ||||||||||
Roadtools | 1,540 | 2 | 2 months ago | 22 | December 05, 2023 | 10 | mit | Python | ||
A collection of Azure AD tools for offensive and defensive security purposes | ||||||||||
Datakit | 1,044 | 2 years ago | 34 | apache-2.0 | OCaml | |||||
Connect processes into powerful data pipelines with a simple git-like filesystem interface | ||||||||||
Neumai | 693 | 2 months ago | 7 | apache-2.0 | Python | |||||
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale. | ||||||||||
Paddleocr2pytorch | 553 | a year ago | 45 | apache-2.0 | Python | |||||
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR) | ||||||||||
Versatile Data Kit | 389 | 25 | 2 months ago | 181 | November 28, 2023 | 220 | apache-2.0 | Python | ||
One framework to develop, deploy and operate data workflows with Python and SQL. | ||||||||||
Marcel | 326 | 2 months ago | 131 | November 15, 2023 | 5 | gpl-3.0 | Python | |||
A modern shell | ||||||||||
Data Engineering With Python | 302 | a year ago | 1 | mit | Python | |||||
Data Engineering with Python, published by Packt | ||||||||||
Yuniql | 292 | 1 | 7 | 2 years ago | 25 | May 25, 2022 | 65 | apache-2.0 | C# | |
Free and open source schema versioning and database migration made natively with .NET/6. NEW THIS MAY 2022! v1.3.15 released! | ||||||||||
Logrange | 192 | a year ago | 12 | February 05, 2021 | 15 | apache-2.0 | Go | |||
High performance data aggregating storage |