Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Devops Python Tools | 709 | 3 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Data Engineering Interview Questions | 554 | 6 months ago | ||||||||
More than 2000+ Data engineer interview questions. | ||||||||||
Marmaray | 444 | 2 years ago | 14 | other | Java | |||||
Generic Data Ingestion & Dispersal Library for Hadoop | ||||||||||
Iceberg | 409 | 3 years ago | 27 | apache-2.0 | Java | |||||
Iceberg is a table format for large, slow-moving tabular data | ||||||||||
Bigdata Playground | 154 | 5 years ago | 4 | apache-2.0 | TypeScript | |||||
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL | ||||||||||
Eel Sdk | 140 | 1 | 17 | 3 years ago | 103 | February 11, 2019 | 25 | apache-2.0 | Scala | |
Big Data Toolkit for the JVM | ||||||||||
Avro Hadoop Starter | 111 | 8 years ago | other | Java | ||||||
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data. | ||||||||||
Camus | 87 | 10 months ago | 6 | apache-2.0 | Java | |||||
Mirror of Linkedin's Camus | ||||||||||
Hdfs2cass | 75 | 2 years ago | 6 | apache-2.0 | Java | |||||
Hadoop mapreduce job to bulk load data into Cassandra | ||||||||||
Avro Maven Plugin | 34 | 13 years ago | 2 | apache-2.0 | Java | |||||
Maven 2 Plugin for processing Apache Avro files. Avro is a subproject of Apache Hadoop. |