Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Goodreads_etl_pipeline | 593 | 4 years ago | mit | Python | ||||||
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform. | ||||||||||
Agile_data_code_2 | 435 | a year ago | 7 | mit | Jupyter Notebook | |||||
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition | ||||||||||
Pyjaws | 36 | 7 months ago | 3 | mit | Python | |||||
PyJaws: A Pythonic Way to Define Databricks Jobs and Workflows | ||||||||||
Stock_streaming_pipeline_project | 5 | 8 months ago | Python | |||||||
Built a real-time streaming pipeline to extract stock data, using Apache Nifi, Debezium, Kafka, and Spark Streaming. Loaded the transformed data into Glue database and created real-time dashboards using Power BI and Tableau with Athena. The pipeline is orchestrated using Airflow. |