Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Doris | 11,243 | 8 months ago | 8 | September 27, 2023 | 2,332 | apache-2.0 | Java | |||
Apache Doris is an easy-to-use, high performance and unified analytics database. | ||||||||||
Dagster | 9,467 | 2 | 133 | 10 months ago | 585 | December 07, 2023 | 2,343 | apache-2.0 | Python | |
An orchestration platform for the development, production, and observation of data assets. | ||||||||||
Mage Ai | 6,324 | 10 months ago | 314 | December 06, 2023 | 189 | apache-2.0 | Python | |||
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data. | ||||||||||
Aws Glue Samples | 1,334 | a year ago | 37 | mit-0 | Python | |||||
AWS Glue code samples | ||||||||||
Pyspark Example Project | 1,034 | 2 years ago | 11 | Python | ||||||
Example project implementing best practices for PySpark ETL jobs and applications. | ||||||||||
Goodreads_etl_pipeline | 593 | 5 years ago | mit | Python | ||||||
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. | ||||||||||
Aws Glue Libs | 568 | a year ago | 96 | other | Python | |||||
AWS Glue Libraries are additions and enhancements to Spark for ETL operations. | ||||||||||
Metorikku | 536 | 2 years ago | 126 | February 27, 2023 | 65 | mit | Scala | |||
A simplified, lightweight ETL Framework based on Apache Spark | ||||||||||
Zdh_web | 379 | a year ago | 19 | apache-2.0 | Java | |||||
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批流,私域营销等模块 | ||||||||||
Big_data_architect_skills | 353 | 5 years ago | 1 | |||||||
一个大数据架构师应该掌握的技能 |