Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for sql spark
spark
x
sql
x
284 search results found
Spark
⭐
35,897
Apache Spark - A unified analytics engine for large-scale data processing
Redash
⭐
23,239
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Doris
⭐
8,387
Apache Doris is an easy-to-use, high performance and unified analytics database.
Mage Ai
⭐
4,762
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
Sqlglot
⭐
3,314
Python SQL Parser and Transpiler
Linkis
⭐
3,076
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Ibis
⭐
2,766
The flexibility of Python with the scale and performance of modern SQL.
Quicksql
⭐
1,939
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Sql Generator
⭐
1,923
🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~
Kyuubi
⭐
1,624
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Fugue
⭐
1,593
A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Ytsaurus
⭐
1,473
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Lakesoul
⭐
1,304
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Utils4s
⭐
1,033
scala、spark使用过程中,各种测试用例以及相关资料整理
Awesome Opensource Data Engineering
⭐
950
An Awesome List of Open-Source Data Engineering Projects
Scriptis
⭐
767
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Datafusion
⭐
626
DataFusion has now been donated to the Apache Arrow project
Pythondatascience Collections
⭐
615
最全数据分析资料汇总(含python、爬虫、数据库、大数据、tableau、统计学等)
Coral
⭐
552
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
Metorikku
⭐
536
A simplified, lightweight ETL Framework based on Apache Spark
Spark Avro
⭐
535
Avro Data Source for Apache Spark
Spark Redshift
⭐
514
Redshift data source for Apache Spark
Spark Sql Perf
⭐
452
Data Engineering Interview Questions
⭐
449
More than 2000+ Data engineer interview questions.
Spark Scala Examples
⭐
443
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
Sylph
⭐
396
Stream computing platform for bigdata
Blaze
⭐
354
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Big_data_architect_skills
⭐
353
一个大数据架构师应该掌握的技能
Miscellaneous
⭐
351
Includes notes on Apache Spark, Spark for Physics, Jupyter notebook examples for Spark, Oracle and other DB systems.
Synapse
⭐
342
Samples for Azure Synapse Analytics
Bahir
⭐
325
Mirror of Apache Bahir
Spark Sql On Hbase
⭐
319
Native, optimized access to HBase Data through Spark SQL/Dataframe Interfaces
Iql
⭐
276
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
Gimel
⭐
230
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Spark Recommendation Engine
⭐
222
Sql Spark Connector
⭐
214
Apache Spark Connector for SQL Server and Azure SQL
Xsql
⭐
207
Unified SQL Analytics Engine Based on SparkSQL
Data Science
⭐
206
Projects and awesome list for all Data Science fields
Kamu Cli
⭐
204
New generation decentralized data warehouse and streaming data pipeline
Spark Elastic
⭐
197
This project combines Apache Spark and Elasticsearch to enable mining & prediction for Elasticsearch.
Example Spark
⭐
197
Spark, Spark Streaming and Spark SQL unit testing strategies
Spark Programming Guide Zh Cn
⭐
188
Spark 编程指南简体中文版
Spark 2.3.1
⭐
174
Spark-2.3.1源码解读
Zio Protoquill
⭐
164
Quill for Scala 3
Spark Authorizer
⭐
158
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Spark Pac4j
⭐
155
Security library for Sparkjava: OAuth, CAS, SAML, OpenID Connect, LDAP, JWT...
Bigdata In Practice
⭐
154
大数据实践项目 Hadoop、Spark、Kafka、Hbase、Flink.....
Spark Bigquery
⭐
149
Google BigQuery support for Spark, SQL, and DataFrames
Spark Ext
⭐
147
Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark
Spark Style Guide
⭐
145
Spark style guide
Sqlflow
⭐
132
SQLflow based on python development, support to Spark, as the underlying distributed computing engine, through a set of unified configuration file to complete the batch, flow calculation, the Rest service development.
Studyspark
⭐
129
学习 Spark 的一个小项目,以及其中各种调优的笔记
Spark Code Analysis
⭐
127
Easy_sql
⭐
115
A library developed to ease the data ETL development process.
Parquet Index
⭐
113
Spark SQL index for Parquet tables
Spring Shiro Spark
⭐
112
Spring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Simba
⭐
100
Spatial In-Memory Big data Analytics
Spark Website
⭐
99
Apache Spark Website
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Jaws Spark Sql Rest
⭐
92
Spear
⭐
92
A playground for experimenting ideas that may apply to Spark SQL/Catalyst
Flink Spark Submiter
⭐
92
从本地IDEA提交Flink/Spark任务到Yarn/k8s集群
Spark Acid
⭐
79
ACID Data Source for Apache Spark based on Hive ACID
Tpch Spark
⭐
74
TPC-H queries in Apache Spark SQL using native DataFrames API
Movalytics Data Warehouse
⭐
74
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
Cleanframes
⭐
70
type-class based data cleansing library for Apache Spark SQL
Stratio Connector Sparksql
⭐
67
(DEPRECATED) A crossdata connector to Spark SQL
Sparksql For Hbase
⭐
66
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers
The Apache Ignite Book
⭐
66
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Azure Sqldb Spark
⭐
66
This project provides a client library that allows Azure SQL DB or SQL Server to act as an input source or output sink for Spark jobs.
Avro Parquet Spark Example
⭐
61
An example of using Avro and Parquet in Spark SQL
Cc Index Table
⭐
60
Index Common Crawl archives in tabular format
Learn Spark
⭐
60
Examples To Help You Learn Apache Spark
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Geosparktemplateproject
⭐
55
Template projects for GeoSpark, GeoSpark-SQL, GeoSpark-Viz
Sparkcore
⭐
54
Spark源码分析,主要包含SparkContext源码、Executor进程启动、Stage划分、
Spark Cep
⭐
51
Spark CEP is an extension of Spark Streaming to support SQL-based query processing
Matrel
⭐
50
A library to support distributed matrix computation for machine learning and data analysis
Uberscriptquery
⭐
50
UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy
Sparkonkudu
⭐
48
Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.
Examples
⭐
47
These are some code examples
Sparkudfexamples
⭐
46
Spark SQL UDF examples
Hanhan Spark Python
⭐
40
Used Spark core python, Spark sql, Spark MLlib, Spark Streaming
Flowman
⭐
39
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Learning Spark Sql
⭐
39
Learning Spark SQL, published by Packt
Spark Ranger
⭐
38
ACL Management for Apache Spark SQL with Apache Ranger. This library has been contributed to https://github.com/apache/submarine as a sub-module, and that module can still be used individually. The project here will no longer be updated. If you have any questions please go to https://github.com/apache/submarine/tree/master/do to learn how to use and give feedback to the apache submarine community by following https://submarine.apache.org/communit
Azure Databricks
⭐
37
Azure Databricks - Advent of 2020 Blogposts
Bigdata Getting Started
⭐
37
大数据相关框架实战项目(Hadoop, Spark, Storm, Flink)
Spark Skewed Join Hint
⭐
36
SparkSQL自定义Hint优化器解决热点数据导致JOIN数据倾斜问题
Wind_python
⭐
36
量化开发 多因子选股模型
Rdf2x
⭐
35
RDF2X converts big RDF datasets to the relational database model, CSV, JSON and ElasticSearch.
Airbnb Spark Thrift
⭐
34
A library for loadling Thrift data into Spark SQL
Sharpetl
⭐
34
Write ETL using your favorite SQL dialects
Aerospark
⭐
34
Aerospike Spark Connector
Cipher
⭐
33
基于hdfs spark的视频非结构化数据计算
Starry
⭐
33
fast spark local mode
Engineeringteam
⭐
32
와이빅타 엔지니어링팀의 자료를 정리해두는 곳입니다.
Ides
⭐
31
智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!
Structuredstreaminginsql
⭐
31
sql实现Structured Streaming
Sope
⭐
31
Apache Spark ETL Utilities
Related Searches
Database Sql (3,816)
Python Sql (3,204)
Javascript Sql (2,981)
Java Sql (2,781)
Mysql Sql (2,405)
C Sharp Sql (2,344)
Php Sql (2,098)
Postgresql Sql (2,097)
Python Spark (2,035)
Java Spark (1,594)
1-100 of 284 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2023 Awesome Open Source. All rights reserved.