Awesome Open Source

Programming Languages

Search results for spark presto

27 search results found

Alluxio ⭐ 6,612

Alluxio, data orchestration for analytics and machine learning in the cloud

Sqlglot ⭐ 4,652

Python SQL Parser and Transpiler

Linkis ⭐ 3,224

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

Dockerfiles ⭐ 1,171

50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak

Yanagishima ⭐ 584

Web UI for Trino, Hive and SparkSQL

Iceberg ⭐ 409

Iceberg is a table format for large, slow-moving tabular data

Connectors ⭐ 383

This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.

Transport ⭐ 288

A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.

Bigdata_docker ⭐ 226

Big Data Ecosystem Docker

大数据相关内容汇总，包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词：Hadoop、HBase

Huaweicloud Mrs Example ⭐ 150

Examples for HUAWEI CLOUD MRS.

Terraform Aws Emr Cluster ⭐ 67

Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS

A library that brings useful functions from various modern database management systems to Apache Spark

Learnbasicbigdatatech ⭐ 44

🚀Some projects on Big Data Analysis like Spark, Hive, Presto and Data Visualization like Superset

Hive Metastore ⭐ 36

Apache Hive Metastore as a Standalone server in Docker

Building Data Lakehouse ⭐ 32

Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data

Squerall ⭐ 27

An implementation of the so-called Semantic Data Lake, using Apache Spark and Presto.

Treasure Data Driver for Python

Minikube for big data with Scala and Spark

Swimlane Graphs ⭐ 12

Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs

Chicago Taxi Trips Analysis ⭐ 10

Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset

Spark Hyperloglog ⭐ 8

Algebird's HyperLogLog support for Apache Spark.

Distributable_docker_sql_on_hadoop ⭐ 6

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Tpch Hdinsight ⭐ 6

TPCH benchmark for various engines

Schema_evolution_exploration ⭐ 5

Explore schema evolution using parquet and Spark or Presto

Aws Oss Alternatives ⭐ 5

Open Source Alternatives to AWS Services

Tpcds Hdinsight ⭐ 5

TPCDS benchmark for various engines

Related Searches

Scala Spark (3,279)

Python Spark (2,053)

Java Spark (1,587)

Apache Spark (1,207)

Spark Hadoop (1,188)

Jupyter Notebook Spark (1,151)

Spark Kafka (985)

Spark Streaming (817)

Spark Pyspark (812)

Shell Spark (703)

1-27 of 27 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.