Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Orchest | 3,876 | a year ago | 19 | December 13, 2022 | 125 | apache-2.0 | TypeScript | |||
Build data pipelines, the easy way 🛠️ | ||||||||||
Open Semantic Search | 741 | a year ago | 187 | gpl-3.0 | Shell | |||||
Open Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph) | ||||||||||
Redun | 464 | 1 | 6 months ago | 18 | November 12, 2023 | 28 | apache-2.0 | Python | ||
Yet another redundant workflow engine | ||||||||||
Abc | 455 | 7 months ago | 29 | apache-2.0 | Go | |||||
Power of appbase.io via CLI, with nifty imports from your favorite data sources | ||||||||||
Neosync | 413 | 4 months ago | 38 | mit | TypeScript | |||||
A developer-first way to create high-fidelity synthetic data or anonymize sensitive data and sync it across all environments for testing, fine-tuning or model training. | ||||||||||
Smooks | 377 | 14 | 4 months ago | 5 | June 19, 2023 | 19 | other | Java | ||
Extensible data integration Java framework for building XML and non-XML fragment-based applications | ||||||||||
Beginner_de_project | 276 | a year ago | 1 | mit | HCL | |||||
Beginner data engineering project - batch edition | ||||||||||
Usaspending Api | 273 | 4 months ago | 59 | cc0-1.0 | Python | |||||
Server application to serve U.S. federal spending data via a RESTful API | ||||||||||
Etl | 135 | 5 months ago | 188 | other | Java | |||||
LinkedPipes ETL is an RDF based, lightweight ETL tool | ||||||||||
Aws Ecs Airflow | 110 | 3 years ago | 6 | mit | HCL | |||||
Run Airflow in AWS ECS(Elastic Container Service) using Fargate tasks |