Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark sql
spark-sql
x
171 search results found
Redash
⭐
24,479
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Zio Quill
⭐
2,135
Compile-time Language Integrated Queries for Scala
Spark
⭐
1,963
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Kyuubi
⭐
1,849
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Almond
⭐
1,560
A Scala kernel for Jupyter
Gluten
⭐
870
Gluten: Plugin to Double SparkSQL's Performance
Hadoop_study
⭐
817
定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)
Useractionanalyzeplatform
⭐
810
电商用户行为分析大数据平台
Yanagishima
⭐
584
Web UI for Trino, Hive and SparkSQL
Learningsparkv2
⭐
570
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Sparta
⭐
526
Real Time Analytics and Data Pipelines based on Spark Streaming
Sparklens
⭐
520
Qubole Sparklens tool for performance tuning Apache Spark
Magellan
⭐
509
Geo Spatial Data Analytics on Spark
Learningspark
⭐
406
Scala examples for learning to use Spark
Sequoiadb
⭐
311
SequoiaDB 巨杉数据库是一款分布式文档型数据库,自研的原生分布式存储引擎支持完整ACID,具备弹性扩展、高并发和 JSON 的半结构化数据格式为基础,兼容SQL协议、S3对象数据引擎接口,进一步形成Multi-Model多模
Data Accelerator
⭐
293
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Spark Hbase Connector
⭐
287
Connect Spark to HBase for reading and writing data with ease
Spark Druid Olap
⭐
283
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Cc Pyspark
⭐
280
Process Common Crawl data with Python and Spark
Iql
⭐
276
An ad hoc query service based on the spark sql engine.(基于spark sql引擎的即席查询服务)
Cuelake
⭐
266
Use SQL to build ELT pipelines on a data lakehouse.
Jupysql
⭐
261
Better SQL in Jupyter. 📊
Spark Workshop
⭐
231
Apache Spark™ and Scala Workshops
Pyspark Cheatsheet
⭐
230
🐍 Quick reference guide to common patterns & functions in PySpark.
Dt Sql Parser
⭐
228
SQL Parsers for BigData, built with antlr4.
Rasterframes
⭐
226
Geospatial Raster support for Spark DataFrames
Ngods Stocks
⭐
217
New Generation Opensource Data Stack Demo
Xsql
⭐
207
Unified SQL Analytics Engine Based on SparkSQL
Zeppelin Notebooks
⭐
206
Gallery of Apache Zeppelin notebooks
Zio Protoquill
⭐
192
Quill for Scala 3
Emotional_analysis
⭐
190
[毕业设计]基于Spark网易云音乐数据分析【1.图计算 2.机器学习预测歌曲分类 3.评论词云 4.评论时间段 5.评论top榜 6.热歌top榜 7.用户性别比例 8.用户星座比例 9.用户年龄比例 10.用户全国地理分布 11.热评搜索等等..】
Mcw Big Data Analytics And Visualization
⭐
184
MCW Big data analytics and visualization
Opaque Sql
⭐
171
An encrypted data analytics platform
Qbeast Spark
⭐
171
Qbeast-spark: DataSource enabling multi-dimensional indexing and efficient data sampling. Big Data, free from the unnecessary!
Spark Alchemy
⭐
169
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Bigdata Playground
⭐
154
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Spark Structured Streaming Examples
⭐
153
Spark Structured Streaming / Kafka / Cassandra / Elastic
Spring Shiro Spark
⭐
112
Spring-Shiro-Spark是Spring-Boot Hibernate Spark Spark-SQL Shiro iView VueJs... ...的集成尝试
Spark R Notebooks
⭐
109
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Pulsar Spark
⭐
103
Spark Connector to read and write with Pulsar
Focusbigdata
⭐
89
【大数据成神之路学习路径+面经+简历】
Sparksql Protobuf
⭐
73
Read SparkSQL parquet file as RDD[Protobuf]
Jupyterlab Sql Editor
⭐
72
A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino
Cleanframes
⭐
70
type-class based data cleansing library for Apache Spark SQL
Stratio Connector Sparksql
⭐
67
(DEPRECATED) A crossdata connector to Spark SQL
Sqlflow
⭐
61
SQLFlow is a bridge that connects a SQL engine, e.g. MySQL, Hive, SparkSQL or SQL Server, with TensorFlow and other machine learning toolkits. SQLFlow extends the SQL language to enable model training, prediction and inference.
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Bigtable Sql
⭐
59
分布式大数据SQL查询可视化界面!
Spark Records
⭐
58
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
Recommendmoteur
⭐
57
电影推荐系统、电影推荐引擎、使用Spark完成的电影推荐引擎
Spark
⭐
55
Apache Spark is a fast, in-memory data processing engine with elegant and expressive development API's to allow data workers to efficiently execute streaming, machine learning or SQL workloads that require fast iterative access to datasets.This project will have sample programs for Spark in Scala language .
Big_data
⭐
55
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Spark Select
⭐
53
A library for Spark DataFrame using MinIO Select API
Awesome Pulsar
⭐
53
A curated list of Pulsar tools, integrations and resources.
Sparkonkudu
⭐
48
Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.
Spark Google Spreadsheets
⭐
46
Google Spreadsheets datasource for SparkSQL and DataFrames
Datapipelines Essentials Python
⭐
45
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Geospark
⭐
45
bring sf to spark in production
Struct Type Encoder
⭐
44
Deriving Spark DataFrame schemas from case classes
Datasource Receiver
⭐
41
Spark Receiver for SQL or NoSQL Databases like Cassandra, MongoDB, Elasticsearch or JDBC
Sparkoptics
⭐
40
Optics for Spark DataFrames
Realtime Data Analytics Using Spark
⭐
39
Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc
Spark Ranger
⭐
38
ACL Management for Apache Spark SQL with Apache Ranger. This library has been contributed to https://github.com/apache/submarine as a sub-module, and that module can still be used individually. The project here will no longer be updated. If you have any questions please go to https://github.com/apache/submarine/tree/master/do to learn how to use and give feedback to the apache submarine community by following https://submarine.apache.org/communit
Sope
⭐
37
Apache Spark ETL Utilities
Sparkgis
⭐
37
GIS extension for SparkSQL
Sharpetl
⭐
36
Write ETL using your favorite SQL dialects
Paraflow
⭐
36
A real-time analytical system for ID-associated data
Litemall Dw
⭐
36
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;
Airbnb Spark Thrift
⭐
34
A library for loadling Thrift data into Spark SQL
Sparkdemo
⭐
34
spark全示例代码(java、scala) Spark most full instance code DEMO (java、scala)
Spark Postgres
⭐
33
PostgreSQL and GreenPlum Data Source for Apache Spark
Spark Twitter Sentiment Analysis
⭐
33
Sentiment Analysis of a Twitter Topic with Spark Structured Streaming
Real Time Data Warehouse
⭐
29
Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi
Vora
⭐
27
SAP HANA Vora
Addb Jedis
⭐
27
Hadooplearning
⭐
25
全套大数据基础学习教程,包含最基础的centos、maven。大数据主要包含hdfs、mr、yarn
Spark Power Bi
⭐
25
Power BI API adapter for Apache Spark (deprecated)
Spark Structured Streaming Examples
⭐
25
Spark structured streaming examples with using of version 3.4.0
Cloud Based Sql Engine Using Spark
⭐
25
Cloud-based SQL engine using SPARK where data is accessible as JDBC/ODBC data source via Spark ThriftServer.
Movies Analytics In Spark And Scala
⭐
24
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Recommendsystem
⭐
24
电影推荐系统
Awesome Sparklyr
⭐
22
An awesome sparklyr related package collection
Prettier Sql
⭐
22
[ARCHIVED] Please use https://github.com/sql-formatter-org/sql-formatter
Spark_pot
⭐
22
Sparglim
⭐
22
Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!
Spark Data Sources
⭐
22
Developing Spark External Data Sources using the V2 API
Spark Workshop
⭐
22
Code examples and docker environment for Spark
Sparkplug
⭐
20
Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌
Data Engineering Zoomcamp
⭐
20
Data Engineering examples covering Airflow and Mage for workflows; dbt for BigQuery, Redshift, ClickHouse; Spark and Kafka for Batch/Streaming Processing
Pre Lt Raster Frames
⭐
19
Spark DataFrames for earth observation data
Spark Cassandra Bulkreader
⭐
19
Spark-Cassandra Bulk Reader CASSANDRA-16222
Spark Addb Connector
⭐
19
Implementation to connect SparkSQL and Redis in form of Relational Model
Spark And Mllib Projects
⭐
18
This repository contains Spark, MLlib, PySpark and Dataframes projects
Sparkprogramminginscala
⭐
18
Apache Spark Course Material
Bigdatatutorial
⭐
18
bigdatatutorial
Spark Aws Messaging
⭐
17
A custom sink provider for Apache Spark that sends the content of a dataframe to an AWS SQS
Albis
⭐
17
Albis: High-Performance File Format for Big Data Systems
Resume Bjkonglu
⭐
17
记录Spark、Flink研究经验
Recsys_spark
⭐
16
Spark SQL 实现 ItemCF,UserCF,Swing,推荐系统,协同过滤
Spark2 Etl Examples
⭐
16
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
1-100 of 171 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.