Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for sql hadoop
hadoop
x
sql
x
73 search results found
Spark
⭐
37,661
Apache Spark - A unified analytics engine for large-scale data processing
Doris
⭐
11,243
Apache Doris is an easy-to-use, high performance and unified analytics database.
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Hive
⭐
5,222
Apache Hive
Ignite
⭐
4,626
Apache Ignite
Calcite
⭐
4,216
Apache Calcite
Ibis
⭐
3,404
The flexibility of Python with the scale and performance of modern SQL.
Drill
⭐
1,856
Apache Drill is a distributed MPP query layer for self describing data
Kyuubi
⭐
1,849
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Flink Streaming Platform Web
⭐
1,698
基于flink的实时流计算web平台
Awesome Opensource Data Engineering
⭐
1,331
An Awesome List of Open-Source Data Engineering Projects
Awesome Hadoop
⭐
987
A curated list of amazingly awesome Hadoop and Hadoop ecosystem resources
Hawq
⭐
677
Apache HAWQ
Data Engineering Interview Questions
⭐
554
More than 2000+ Data engineer interview questions.
Spark Redshift
⭐
514
Redshift data source for Apache Spark
Sylph
⭐
396
Stream computing platform for bigdata
Big_data_architect_skills
⭐
353
一个大数据架构师应该掌握的技能
Android Nosql
⭐
287
Lightweight, simple structured NoSQL database for Android
Compass
⭐
284
Compass is a task diagnosis platform for bigdata
Calcite Avatica
⭐
225
Apache Calcite Avatica
Awesome Hbase
⭐
156
A curated list of awesome HBase projects and resources.
Calcite Avatica Go
⭐
110
Mirror of Apache Calcite - Avatica Go SQL Driver
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Antsdb
⭐
97
AntsDB is a low latency, high concurrency, MySQL compliant SQL layer for HBase
Presto
⭐
93
Teradata Distribution of Presto -- A Distributed SQL Query Engine for Big Data
Flink Spark Submiter
⭐
92
从本地IDEA提交Flink/Spark任务到Yarn/k8s集群
Devops Perl Tools
⭐
88
25+ DevOps CLI Tools - Anonymizer, SQL ReCaser (MySQL, PostgreSQL, AWS Redshift, Snowflake, Apache Drill, Hive, Impala, Cassandra CQL, Microsoft SQL Server, Oracle, Couchbase N1QL, Dockerfiles), Hadoop HDFS & Hive tools, Solr/SolrCloud CLI, Nginx stats & HTTP(S) URL watchers for load-balanced web farms, Linux tools etc.
Flowman
⭐
85
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Trident Lambda Splout
⭐
78
A toy example of a "Lambda architecture" using Storm's Trident as real-time layer and Splout SQL as batch layer.
Implyr
⭐
73
SQL backend to dplyr for Impala
The Apache Ignite Book
⭐
72
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Myfox Load Module
⭐
70
Myfox(OLAP系统分布式MySQL代理层)数据装载模块
Sqlwindowing
⭐
65
SQL Windowing Functions for Hadoop
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Redshift Benchmark
⭐
52
Til
⭐
51
Today I Learned
Base
⭐
50
https://www.researchgate.net/profile/Rajah_Iyer
Splout Db
⭐
50
A web-latency SQL spout for Hadoop.
Lingual
⭐
48
Stand-alone ANSI SQL for Cascading on Apache Hadoop
Bigdata Getting Started
⭐
37
大数据相关框架实战项目(Hadoop, Spark, Storm, Flink)
Course
⭐
36
Enterprise SQL-on-Hadoop Solution [Season One]
Pentest Wiki
⭐
35
规范渗透测试报告中的漏洞名称以及修复建议
Cipher
⭐
33
基于hdfs spark的视频非结构化数据计算
Engineeringteam
⭐
32
와이빅타 엔지니어링팀의 자료를 정리해두는 곳입니다.
Hive Mr3
⭐
29
Hive for MR3
Doris Thirdparty
⭐
26
Self-managed thirdparty dependencies for Apache Doris
Movies Analytics In Spark And Scala
⭐
24
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Adherer
⭐
23
Computation of adherence to medications from Electronic Healthcare Data in R
Springboard Data Science Immersive
⭐
23
Bigdata_platform
⭐
20
大数据建模分析平台
Data Science Ebooks
⭐
19
Data Science E-books, Interview Resources and Cheat-sheets
Hadoop Data Ingestion Tool
⭐
15
OLAP and ETL of Big Data
Clickhouse_hadoop
⭐
14
Import data from clickhouse to hadoop with pure SQL
Cheatsheets For Ai
⭐
14
Cheatsheets on numerous topics ranging from DataScience | ML | DL | AI | Big Data.
Dataanalysis_cases
⭐
13
「数据分析师」项目练习、参考资料
Tnydb
⭐
12
A little tool for analysing big data
Ysmart
⭐
12
Mirror of YSmart
Easterbunny
⭐
11
EasterBunny数据分析
Bigdata Etl Pipeline
⭐
10
The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a complete data pipeline with all components seamlessly set up and ready to use
Notewarehouse
⭐
9
❄️本仓库包含Java学习笔记和大数据学习笔记,主要包含Java基础、JavaWEB、Java框架、
Easy2oracle
⭐
9
Easy-To-Oracle is a data integration tool. It can pull data from databases like Microsoft SQL Server, MySQL, Sybase, SQLite, Presto (Hadoop) and Excel directly into your Oracle 10g/11g/12c database
Kyuubi Docker
⭐
9
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Data Interview
⭐
8
数据分析面试准备
Ignite Teamcity Bot
⭐
8
Apache ignite Teamcity Bot
Data Engineer Portfolio
⭐
6
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Drillbook
⭐
6
The Official Source Repository for Learning Apache Drill (O'Reilly, 2018)
Drillworkshop
⭐
6
Learn how to quickly explore your data with Apache Drill
Hadoop Python Hive Tutorial
⭐
6
A tutorial for using Hadoop with Python and Hive
Cse Lab Solutions
⭐
6
Comprehensive CSE Lab Solutions repo; encompassing all my lab manuals, codes, documents, and endsem questions from my B.Tech program (2020-2024).
Distributable_docker_sql_on_hadoop
⭐
6
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Greenplum Pxf Examples
⭐
5
Using Greenplum with PXF to access external data
Doris Sdk
⭐
5
SDK for Apache Doris
Algorithms_jobs
⭐
5
Yeap! For a better job!
Related Searches
Database Sql (5,501)
Python Sql (3,922)
Mysql Sql (2,867)
Java Sql (2,781)
Javascript Sql (2,662)
C Sharp Sql (2,429)
Postgresql Sql (2,411)
Php Sql (2,276)
Java Hadoop (2,112)
Golang Sql (1,383)
1-73 of 73 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.