Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Drill | 1,856 | 23 | 16 | 3 months ago | 24 | April 19, 2023 | 100 | apache-2.0 | Java | |
Apache Drill is a distributed MPP query layer for self describing data | ||||||||||
Gaffer | 1,724 | 4 | 31 | 3 months ago | 101 | November 14, 2023 | 142 | apache-2.0 | Java | |
A large-scale entity and relation database supporting aggregation of properties | ||||||||||
Devops Python Tools | 709 | 3 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Iceberg | 409 | 3 years ago | 27 | apache-2.0 | Java | |||||
Iceberg is a table format for large, slow-moving tabular data | ||||||||||
Parquet4s | 267 | 6 | 3 months ago | 57 | November 12, 2023 | 6 | mit | Scala | ||
Read and write Parquet in Scala. Use Scala classes as schema. No need to start a cluster. | ||||||||||
Parquet Go | 228 | 32 | 2 years ago | 44 | August 18, 2022 | 13 | apache-2.0 | Go | ||
Go package to read and write parquet files. parquet is a file format to store nested data structures in a flat columnar data format. It can be used in the Hadoop ecosystem and with tools such as Presto and AWS Athena. | ||||||||||
Bigdata Playground | 154 | 5 years ago | 4 | apache-2.0 | TypeScript | |||||
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL | ||||||||||
Eel Sdk | 140 | 1 | 17 | 3 years ago | 103 | February 11, 2019 | 25 | apache-2.0 | Scala | |
Big Data Toolkit for the JVM | ||||||||||
Parquet Rs | 129 | 5 years ago | 27 | apache-2.0 | Rust | |||||
Apache Parquet implementation in Rust | ||||||||||
Streamx | 95 | 5 years ago | 26 | apache-2.0 | Java | |||||
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3) |