Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark presto
presto
x
spark
x
27 search results found
Alluxio
⭐
6,612
Alluxio, data orchestration for analytics and machine learning in the cloud
Sqlglot
⭐
4,652
Python SQL Parser and Transpiler
Linkis
⭐
3,224
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Dockerfiles
⭐
1,171
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak
Yanagishima
⭐
584
Web UI for Trino, Hive and SparkSQL
Iceberg
⭐
409
Iceberg is a table format for large, slow-moving tabular data
Connectors
⭐
383
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Transport
⭐
288
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
Bigdata_docker
⭐
226
Big Data Ecosystem Docker
Dpkb
⭐
182
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase
Huaweicloud Mrs Example
⭐
150
Examples for HUAWEI CLOUD MRS.
Terraform Aws Emr Cluster
⭐
67
Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS
Itachi
⭐
46
A library that brings useful functions from various modern database management systems to Apache Spark
Learnbasicbigdatatech
⭐
44
🚀Some projects on Big Data Analysis like Spark, Hive, Presto and Data Visualization like Superset
Hive Metastore
⭐
36
Apache Hive Metastore as a Standalone server in Docker
Building Data Lakehouse
⭐
32
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
Squerall
⭐
27
An implementation of the so-called Semantic Data Lake, using Apache Spark and Presto.
Pytd
⭐
17
Treasure Data Driver for Python
Bigkube
⭐
14
Minikube for big data with Scala and Spark
Swimlane Graphs
⭐
12
Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs
Chicago Taxi Trips Analysis
⭐
10
Analysis of City Of Chicago Taxi Trip Dataset Using AWS EMR, Spark, PySpark, Zeppelin and Airbnb's Superset
Spark Hyperloglog
⭐
8
Algebird's HyperLogLog support for Apache Spark.
Distributable_docker_sql_on_hadoop
⭐
6
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Tpch Hdinsight
⭐
6
TPCH benchmark for various engines
Schema_evolution_exploration
⭐
5
Explore schema evolution using parquet and Spark or Presto
Aws Oss Alternatives
⭐
5
Open Source Alternatives to AWS Services
Tpcds Hdinsight
⭐
5
TPCDS benchmark for various engines
Related Searches
Scala Spark (3,279)
Python Spark (2,053)
Java Spark (1,587)
Apache Spark (1,207)
Spark Hadoop (1,188)
Jupyter Notebook Spark (1,151)
Spark Kafka (985)
Spark Streaming (817)
Spark Pyspark (812)
Shell Spark (703)
1-27 of 27 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.