Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Easyml | 1,966 | 6 months ago | 47 | apache-2.0 | Java | |||||
Easy Machine Learning is a general-purpose dataflow-based system for easing the process of applying machine learning algorithms to real world tasks. | ||||||||||
Taier | 1,220 | 5 months ago | 2 | February 25, 2022 | 61 | apache-2.0 | Java | |||
Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display | ||||||||||
Goodreads_etl_pipeline | 593 | 4 years ago | mit | Python | ||||||
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. | ||||||||||
Big Whale | 290 | 6 months ago | 11 | apache-2.0 | Java | |||||
Spark、Flink等离线任务的调度以及实时任务的监控 | ||||||||||
Airflow Spark | 64 | 2 years ago | 6 | Python | ||||||
Docker with Airflow and Spark standalone cluster | ||||||||||
Fb_scraper | 52 | 5 years ago | 10 | apache-2.0 | Jupyter Notebook | |||||
FBLYZE is a Facebook scraping system and analysis system. | ||||||||||
Spark With Python My Learning Notes | 39 | 4 years ago | CSS | |||||||
ETL pipeline using pyspark (Spark - Python) | ||||||||||
Spark Native Yarn | 32 | 7 years ago | 6 | apache-2.0 | Scala | |||||
Tez port for Spark API | ||||||||||
Scattersphere | 30 | 6 years ago | 12 | apache-2.0 | Scala | |||||
Job Coordination API for Tasks | ||||||||||
Airflow Livy Operators | 17 | a year ago | 8 | August 27, 2021 | 3 | mit | Python | |||
Lets Airflow DAGs run Spark jobs via Livy: sessions and/or batches. |