Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Scio | 2,505 | 37 | 3 months ago | 96 | November 21, 2023 | 142 | apache-2.0 | Scala | ||
A Scala API for Apache Beam and Google Cloud Dataflow. | ||||||||||
Seldon Server | 1,420 | 4 years ago | 44 | June 28, 2017 | 26 | apache-2.0 | Java | |||
Machine Learning Platform and Recommendation Engine built on Kubernetes | ||||||||||
Devops Python Tools | 709 | 4 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Elasticluster | 334 | 3 | 9 months ago | 12 | October 22, 2014 | 182 | gpl-3.0 | Python | ||
Create clusters of VMs on the cloud and configure them with Ansible. | ||||||||||
Spark Bigquery Connector | 332 | 12 | 3 months ago | 24 | October 31, 2023 | 42 | apache-2.0 | Java | ||
BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables. | ||||||||||
Learning Hadoop And Spark | 160 | 6 months ago | apache-2.0 | HTML | ||||||
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning | ||||||||||
De Zoomcamp Ui | 107 | 3 months ago | Python | |||||||
🎨 UI for the Free Data Engineering Zoomcamp 2023 Course provided by DataTalksClub | ||||||||||
Streamify | 97 | 2 years ago | Python | |||||||
A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more! | ||||||||||
Spark_gce | 45 | 9 years ago | 1 | apache-2.0 | Python | |||||
Spark GCE Script Helps you deploy Spark cluster on Google Cloud. | ||||||||||
Etlflow | 43 | 11 | 9 months ago | 37 | July 19, 2023 | apache-2.0 | Scala | |||
EtlFlow is an ecosystem of functional libraries in Scala based on ZIO for running complex Auditable workflows which can interact with Google Cloud Platform, AWS, Kubernetes, Databases, SFTP servers, On-Prem Systems and more. |