Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for hive spark sql
hive
x
spark-sql
x
14 search results found
Kyuubi
⭐
1,849
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Hadoop_study
⭐
817
定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)
Yanagishima
⭐
584
Web UI for Trino, Hive and SparkSQL
Jupysql
⭐
261
Better SQL in Jupyter. 📊
Xsql
⭐
207
Unified SQL Analytics Engine Based on SparkSQL
Sqlflow
⭐
61
SQLFlow is a bridge that connects a SQL engine, e.g. MySQL, Hive, SparkSQL or SQL Server, with TensorFlow and other machine learning toolkits. SQLFlow extends the SQL language to enable model training, prediction and inference.
Apachespark
⭐
59
This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.
Sharpetl
⭐
36
Write ETL using your favorite SQL dialects
Litemall Dw
⭐
36
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;
Prettier Sql
⭐
22
[ARCHIVED] Please use https://github.com/sql-formatter-org/sql-formatter
Spark2 Etl Examples
⭐
16
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
Hivetophoenix
⭐
13
An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase
Swimlane Graphs
⭐
12
Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs
Hive Jdbc Proxy
⭐
11
Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。
Huemul Bigdatagovernance
⭐
10
Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de der
Kyuubi Docker
⭐
9
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Retailanalytics
⭐
9
Hortonworks Data Platform Retail Analytics Demo
Mytpcds
⭐
8
Run TPC-DS against different databases including Hive, Spark SQL and IBM BigSQL
Distributable_docker_sql_on_hadoop
⭐
6
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Loganalysis
⭐
6
日志分析项目
Msc In Machine Learning And Artificial Intelligence
⭐
6
Master of Science in Machine Learning & Artificial Intelligence - Indian Institute Technology Madras & Liverpool John Moores University
Worknote
⭐
6
Helthcare System
⭐
6
Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.
Spark Streaming Kafka Demo
⭐
5
spark streaming从kafka读取消息,offset写入Redis,spark计算单词出现频率,最后
Spark Ais Multi
⭐
5
Import, Partition and Query AIS Data using SparkSQL
Related Searches
Hadoop Hive (709)
Java Hive (697)
Spark Hive (529)
Python Hive (376)
Hive Hdfs (338)
Shell Hive (286)
Scala Hive (228)
1-14 of 14 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.