Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for sql big data
big-data
x
sql
x
64 search results found
Spark
⭐
37,661
Apache Spark - A unified analytics engine for large-scale data processing
Clickhouse
⭐
34,124
ClickHouse® is a free analytics DBMS for big data
Flink
⭐
22,747
Apache Flink
Tdengine
⭐
22,519
TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.
Shardingsphere
⭐
19,381
Distributed SQL transaction & query engine for data sharding, scaling, encryption, and more - on any database.
Questdb
⭐
13,178
An open source time-series database for fast ingest and SQL queries
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Beam
⭐
7,355
Apache Beam is a unified programming model for Batch and Streaming data processing.
Starrocks
⭐
7,191
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries. InfoWorld’s 2023 BOSSIE Award for best open source software.
Risingwave
⭐
5,799
The distributed streaming database. Engineered to offer the simplest and most cost-efficient way for stream processing and management.
Data Engineer Handbook
⭐
5,650
This is a repo with links to everything you'd ever want to learn about data engineering
Hive
⭐
5,222
Apache Hive
Ignite
⭐
4,626
Apache Ignite
Arrow Datafusion
⭐
4,514
Apache Arrow DataFusion SQL Query Engine
Calcite
⭐
4,216
Apache Calcite
Crate
⭐
3,864
CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. It is PostgreSQL-compatible, and based on Lucene.
Sql Generator
⭐
3,346
🔨 用 JSON 来生成结构化的 SQL 语句,基于 Vue3 + TypeScript + Vite + Ant Design + MonacoEditor 实现,项目简单(重逻辑轻页面)、适合练手~
Featurebase
⭐
2,504
A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docker instance: https://hub.docker.com/r/featurebasedb/featurebase
Data Science Roadmap
⭐
2,445
Data Science Roadmap from A to Z
Griddb
⭐
2,310
GridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
Lakesoul
⭐
2,248
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Flinkstreamsql
⭐
1,972
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Poli
⭐
1,920
An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Drill
⭐
1,856
Apache Drill is a distributed MPP query layer for self describing data
Ytsaurus
⭐
1,694
YTsaurus is a scalable and fault-tolerant open-source big data platform.
Arrow Ballista
⭐
1,111
Apache Arrow Ballista Distributed Query Engine
Utils4s
⭐
1,033
scala、spark使用过程中,各种测试用例以及相关资料整理
Phoenix
⭐
1,006
Mirror of Apache Phoenix
Blaze
⭐
784
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
Tugraph Analytics
⭐
557
TuGraph Analytics is the fastest OLAP graph database.
Metorikku
⭐
536
A simplified, lightweight ETL Framework based on Apache Spark
Clickbench
⭐
510
ClickBench: a Benchmark For Analytical Databases
Sylph
⭐
396
Stream computing platform for bigdata
Big_data_architect_skills
⭐
353
一个大数据架构师应该掌握的技能
Compass
⭐
284
Compass is a task diagnosis platform for bigdata
Usql
⭐
233
U-SQL Examples and Issue Tracking
Gimel
⭐
230
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Dt Sql Parser
⭐
228
SQL Parsers for BigData, built with antlr4.
Calcite Avatica
⭐
225
Apache Calcite Avatica
Flink Notes
⭐
223
flink学习笔记
Presto Go Client
⭐
220
A Presto client for the Go programming language.
Erd Online
⭐
186
ERD Online is an online collaborative data warehouse design software. It does not need to install applications locally and operate databases online. It is an excellent alternative to desktop data modeling tools.
Idp
⭐
165
IDP is an open source AI IDE for data scientists and big data engineers.
Bigdata In Practice
⭐
154
大数据实践项目 Hadoop、Spark、Kafka、Hbase、Flink.....
Ignite 3
⭐
152
Apache Ignite 3
Maha
⭐
126
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Report
⭐
115
自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Calcite Avatica Go
⭐
110
Mirror of Apache Calcite - Avatica Go SQL Driver
Spark Website
⭐
109
Apache Spark Website
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Flowman
⭐
85
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Spark Acid
⭐
79
ACID Data Source for Apache Spark based on Hive ACID
Shardingsphere On Cloud
⭐
75
A collection of tools and best practices to take ShardingSphere into the cloud
The Apache Ignite Book
⭐
72
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Flink Learn
⭐
71
Learning Flink : Flink CEP,Flink Core,Flink SQL
Cleanframes
⭐
70
type-class based data cleansing library for Apache Spark SQL
Ineuron Full Stack Data Science Assignments
⭐
68
This Repository consists of Assignments and projects of the iNeuron Full Stack Data Science Course
Guery
⭐
61
Distributed SQL query engine written in Go for big data
Spark Docker
⭐
53
Official Dockerfile for Apache Spark
Phoenix Connectors
⭐
48
Apache Phoenix Connectors
Phoenix Queryserver
⭐
41
Apache Phoenix Query Server
Flink Book
⭐
38
大数据,流计算,实时计算,Flink框架学习资料。畅销书籍《深入理解Flink核心设计与实践原理》 随书代码,书中讲解的Flink特性均有完整可运行的代码供读者运行和测试。整个工程共有【182个Jav
Opteryx
⭐
37
🦖 A SQL-on-everything Query Engine you can execute over multiple databases and file formats. Query your data, where it lives.
Sharpetl
⭐
36
Write ETL using your favorite SQL dialects
Ides
⭐
32
智能数据探索服务(Intelligent Data Exploration Service),一站式Data + AI数据解决方案!
Framework Of Bigdata
⭐
30
大数据面试题,从0到1走向架构师之路。Flink、Spark、Hive、HBase、Hadoop、K
Vulkn
⭐
26
Love your Data. Love the Environment. Love VULKИ.
Doris Thirdparty
⭐
26
Self-managed thirdparty dependencies for Apache Doris
Movies Analytics In Spark And Scala
⭐
24
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Analytical_dp_with_sql
⭐
24
Code for my "Efficient Data Processing in SQL" book.
Bigdata_platform
⭐
20
大数据建模分析平台
Analyzer
⭐
20
The Analyzer, which runs in Microsoft Access, documents databases, is menu-driven, easy to use, and generates valuable information for developers and users with databases such as Access, and anything Access can connect to such as SQL Server and Oracle. If you have trouble or ideas, go to http://msaccessgurus.com/tool/Analyzer.htm and send me a message!
Awesome Prestosql
⭐
19
A list of Presto/Trino resources
Firstyear
⭐
18
This repository contains the work I've done in my first year along with some study materials which I had collected.
Etlutils
⭐
16
Utilities for easily loading big data from relational databases directly into ffdf objects in R.
Arrow Ballista Python
⭐
16
Apache Arrow Ballista Python bindings
Computing With Data
⭐
15
Code samples for my book "Computing with Data: An Introduction to the Data Industry"
Big Data Engineering
⭐
15
Hadoop Data Ingestion Tool
⭐
15
OLAP and ETL of Big Data
Cheatsheets For Ai
⭐
14
Cheatsheets on numerous topics ranging from DataScience | ML | DL | AI | Big Data.
Tnydb
⭐
12
A little tool for analysing big data
Kio
⭐
12
Kotlin extensions for Apache Beam
Data Paths
⭐
11
Easterbunny
⭐
11
EasterBunny数据分析
Scray
⭐
11
Lambda Architecture Framework for Big Data, Spark, Versioned Data, NoSQL and SQL-Stores.
Alinous Elastic Db
⭐
11
Alinous Elastic DB is database for bigdata. It can scale both the SQL engine and storage engine. This database engine is for scaling and sharding.
Flink Sql Computing Platform
⭐
10
It is a kind of big data computing platform which is driven by the Flink SQL. In particular, it provides the SQL programming.
Bigdata Etl Pipeline
⭐
10
The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a complete data pipeline with all components seamlessly set up and ready to use
Notewarehouse
⭐
9
❄️本仓库包含Java学习笔记和大数据学习笔记,主要包含Java基础、JavaWEB、Java框架、
J2v
⭐
9
Creates Looker Views and Explore based on provided JSON(s).
Anki_revlog_analysis
⭐
8
Anki 复习数据处理与分析
Ignite Teamcity Bot
⭐
8
Apache ignite Teamcity Bot
Mlsql
⭐
8
New Repo: https://github.com/byzer-org/kolo-lang
Awesome Olap
⭐
8
A curated list of awesome Online Analytical Processing databases, frameworks, ressources and other awesomeness.
Hurricanedb
⭐
6
A real-time distributed OLAP engine
Drillworkshop
⭐
6
Learn how to quickly explore your data with Apache Drill
Data Engineer Portfolio
⭐
6
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Hivesql
⭐
5
HiveSQL知识点、HQL实战、HiveSQL练习题
Related Searches
Database Sql (5,501)
Python Sql (3,922)
Mysql Sql (2,867)
Java Sql (2,781)
Javascript Sql (2,662)
C Sharp Sql (2,429)
Postgresql Sql (2,411)
Php Sql (2,276)
Golang Sql (1,383)
Sql Table (1,358)
1-64 of 64 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.