Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Iceberg | 5,179 | 3 months ago | 3 | October 29, 2022 | 1,485 | apache-2.0 | Java | |||
Apache Iceberg | ||||||||||
Gaffer | 1,724 | 4 | 31 | 3 months ago | 101 | November 14, 2023 | 142 | apache-2.0 | Java | |
A large-scale entity and relation database supporting aggregation of properties | ||||||||||
Petastorm | 1,693 | 8 | 5 months ago | 86 | February 03, 2023 | 174 | apache-2.0 | Python | ||
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code. | ||||||||||
Adam | 966 | 20 | 17 | 3 months ago | 14 | December 16, 2020 | 35 | apache-2.0 | Scala | |
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed. | ||||||||||
Devops Python Tools | 709 | 3 months ago | 37 | mit | Python | |||||
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc. | ||||||||||
Iceberg | 409 | 3 years ago | 27 | apache-2.0 | Java | |||||
Iceberg is a table format for large, slow-moving tabular data | ||||||||||
Spindle | 333 | 9 years ago | 2 | apache-2.0 | JavaScript | |||||
Next-generation web analytics processing with Scala, Spark, and Parquet. | ||||||||||
Rumble | 194 | a year ago | 4 | December 03, 2019 | 134 | other | Java | |||
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more | ||||||||||
Spark Programming Guide Zh Cn | 188 | a year ago | other | |||||||
Spark 编程指南简体中文版 | ||||||||||
Parquet Index | 113 | 3 years ago | 16 | apache-2.0 | Scala | |||||
Spark SQL index for Parquet tables |