Project Name	Stars	Repos Using This	Packages Using This	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Iceberg	5,179			3 months ago	3	October 29, 2022	1,485	apache-2.0	Java
Apache Iceberg
Parquet Mr	2,296	259	208	3 months ago	17	May 12, 2023	133	apache-2.0	Java
Apache Parquet
Drill	1,856	23	16	3 months ago	24	April 19, 2023	100	apache-2.0	Java
Apache Drill is a distributed MPP query layer for self describing data
Influxdb_iox	1,805			7 months ago	4	March 16, 2023	494	apache-2.0	Rust
Pronounced (influxdb eye-ox), short for iron oxide. This is the new core of InfluxDB written in Rust on top of Apache Arrow.
Adam	966	20	17	3 months ago	14	December 16, 2020	35	apache-2.0	Scala
ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
Parquetviewer	574			4 months ago				gpl-3.0	C#
Simple windows desktop application for viewing & querying Apache Parquet files
Parquet Dotnet	457	15	23	3 months ago	266	November 14, 2023	15	mit	C#
Fully managed Apache Parquet implementation
Parquet Dotnet	319			2 years ago	4	January 10, 2018	42	mit	C#
🏐 Apache Parquet for modern .NET
Parquet Cpp	312			5 years ago				apache-2.0	C++
Apache Parquet
Bigdata Playground	154			5 years ago			4	apache-2.0	TypeScript
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Alternatives To Parquet Mr

Select To Compare

Iceberg ⭐ 5,179

Apache Iceberg

total releases 3most recent commit 3 months ago

Parquet Mr ⭐ 2,296

Apache Parquet

dependent packages 208total releases 17most recent commit 3 months ago

Drill ⭐ 1,856

Apache Drill is a distributed MPP query layer for self describing data

dependent packages 16total releases 24most recent commit 3 months ago

Influxdb_iox ⭐ 1,805

Pronounced (influxdb eye-ox), short for iron oxide. This is the new core of InfluxDB written in Rust on top of Apache Arrow.

total releases 4most recent commit 7 months ago

Adam ⭐ 966

ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.

dependent packages 17total releases 14most recent commit 3 months ago

Parquetviewer ⭐ 574

Simple windows desktop application for viewing & querying Apache Parquet files

most recent commit 4 months ago

Parquet Dotnet ⭐ 457

Fully managed Apache Parquet implementation

dependent packages 23total releases 266most recent commit 3 months ago

Parquet Dotnet ⭐ 319

🏐 Apache Parquet for modern .NET

total releases 4most recent commit 2 years ago

Parquet Cpp ⭐ 312

Apache Parquet

most recent commit 5 years ago

Bigdata Playground ⭐ 154

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

most recent commit 5 years ago

Suggest An Alternative To parquet-mr

Alternative Project Comparisons

Parquet Mr vs Iceberg

Parquet Mr vs Drill

Parquet Mr vs Influxdb_iox

Parquet Mr vs Adam

Parquet Mr vs Parquetviewer

Parquet Mr vs Parquet Dotnet

Parquet Mr vs Parquet Cpp

Parquet Mr vs Bigdata Playground

Popular Parquet Projects

Dsq ⭐ 3,401

Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.

total releases 2latest release October 20, 2022most recent commit 7 months ago

Roapi ⭐ 2,969

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

total releases 17latest release March 20, 2022most recent commit 4 months ago

Qsv ⭐ 2,079

CSVs sliced, diced & analyzed.

total releases 148latest release November 20, 2023most recent commit 3 months ago

Gaffer ⭐ 1,724

A large-scale entity and relation database supporting aggregation of properties

dependent packages 31total releases 101latest release November 14, 2023most recent commit 3 months ago

Petastorm ⭐ 1,693

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

dependent packages 8total releases 86latest release February 03, 2023most recent commit 5 months ago

Popular Apache Projects

Echarts ⭐ 58,775

Apache ECharts is a powerful, interactive charting and data visualization library for browser

dependent packages 6,345total releases 119latest release July 18, 2023most recent commit 15 days ago

Superset ⭐ 58,051

Apache Superset is a Data Visualization and Data Exploration Platform

dependent packages 21total releases 6latest release April 18, 2023most recent commit 20 days ago

Awesome Cpp ⭐ 53,034

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

most recent commit 3 months ago

Awesome Android Ui ⭐ 47,955

A curated list of awesome Android UI/UX libraries

most recent commit 5 months ago

Spark ⭐ 37,661

Apache Spark - A unified analytics engine for large-scale data processing

dependent packages 939total releases 46latest release May 09, 2021most recent commit 3 months ago

Popular Data Processing Categories