Project Name	Stars	Most Recent Commit	Total Releases	Latest Release	Open Issues	License	Language
Devops Python Tools	709	4 months ago			37	mit	Python
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Bigdata Playground	154	5 years ago			4	apache-2.0	TypeScript
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Boxball	99	5 months ago	8	October 07, 2023	9	apache-2.0	Python
Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.
Spark Mail	45	5 years ago			3	other	HTML
Tutorial on parsing Enron email to Avro and then explore the email set using Spark.
Etl Light	38	7 years ago				mit	Scala
A light Kafka to HDFS/S3 ETL library based on Apache Spark
Deephaven Parquet Viewer	21	4 months ago			4		Shell
A browser-based Parquet file viewer
Spark Lucenerdd Examples	15	7 months ago			2	apache-2.0	Scala
Examples of spark-lucenerdd
Greatex	10	2 years ago			1		Python
A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in Airflow.
Bigdata Platform	6	4 months ago				apache-2.0	Jupyter Notebook
End to end big data project, that aims to show how to implement different big data layers, from the infrastructure layer to the end user one. [HADOOP][Spark][Kafka][Cassandra][Ansible][Jupyter][Docker]

Alternatives To Deephaven Parquet Viewer

Select To Compare

Devops Python Tools ⭐ 709

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

most recent commit 4 months ago

Bigdata Playground ⭐ 154

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

most recent commit 5 years ago

Boxball ⭐ 99

Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.

total releases 8most recent commit 5 months ago

Spark Mail ⭐ 45

Tutorial on parsing Enron email to Avro and then explore the email set using Spark.

most recent commit 5 years ago

Etl Light ⭐ 38

A light Kafka to HDFS/S3 ETL library based on Apache Spark

most recent commit 7 years ago

Deephaven Parquet Viewer ⭐ 21

A browser-based Parquet file viewer

most recent commit 4 months ago

Spark Lucenerdd Examples ⭐ 15

Examples of spark-lucenerdd

most recent commit 7 months ago

Greatex ⭐ 10

A project for exploring how Great Expectations can be used to ensure data quality and validate batches within a data pipeline defined in Airflow.

most recent commit 2 years ago

Bigdata Platform ⭐ 6

End to end big data project, that aims to show how to implement different big data layers, from the infrastructure layer to the end user one. [HADOOP][Spark][Kafka][Cassandra][Ansible][Jupyter

most recent commit 4 months ago

Suggest An Alternative To deephaven-parquet-viewer

Alternative Project Comparisons

Deephaven Parquet Viewer vs Devops Python Tools

Deephaven Parquet Viewer vs Bigdata Playground

Deephaven Parquet Viewer vs Boxball

Deephaven Parquet Viewer vs Spark Mail

Deephaven Parquet Viewer vs Etl Light

Deephaven Parquet Viewer vs Spark Lucenerdd Examples

Deephaven Parquet Viewer vs Greatex

Deephaven Parquet Viewer vs Bigdata Platform

Popular Parquet Projects

Iceberg ⭐ 5,179

Apache Iceberg

total releases 3latest release October 29, 2022most recent commit 3 months ago

Dsq ⭐ 3,401

Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.

total releases 2latest release October 20, 2022most recent commit 7 months ago

Roapi ⭐ 2,969

Create full-fledged APIs for slowly moving datasets without writing a single line of code.

total releases 17latest release March 20, 2022most recent commit 5 months ago

Parquet Mr ⭐ 2,296

Apache Parquet

dependent packages 208total releases 17latest release May 12, 2023most recent commit 4 months ago

Qsv ⭐ 2,079

CSVs sliced, diced & analyzed.

total releases 148latest release November 20, 2023most recent commit 3 months ago

Popular Docker Projects

Mall ⭐ 73,367

mall项目是一套电商系统，包括前台商城系统及后台管理系统，基于SpringBoot+MyBatis 前台商城系统包含首页门户、商品推荐、商品搜索、商品展示、购物车、订单流程、会员中心、客户服务、帮助中后台管理系统包含商品管理、订单管理、会员管理、促销管理、运营管理、内容管理、统计报表、财务管理、权限

most recent commit 4 months ago

Netdata ⭐ 67,808

The open-source observability platform everyone needs!

most recent commit a month ago

Moby ⭐ 67,713

The Moby Project - a collaborative project for the container ecosystem to assemble container-based systems

dependent packages 7,349total releases 849latest release November 13, 2023most recent commit 15 days ago

Excalidraw ⭐ 66,345

Virtual whiteboard for sketching hand-drawn like diagrams

dependent packages 49total releases 295latest release December 06, 2023most recent commit 3 months ago

Devops Exercises ⭐ 60,067

Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions

most recent commit 5 months ago

Popular Data Processing Categories