Awesome Open Source

Programming Languages

Search results for hive spark sql

14 search results found

Kyuubi ⭐ 1,849

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Hadoop_study ⭐ 817

定期更新Hadoop生态圈中常用大数据组件文档重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图印象笔记 Scala版本简单demo 常用工具类去敏后的train code 持续更新!!!)

Yanagishima ⭐ 584

Web UI for Trino, Hive and SparkSQL

Jupysql ⭐ 261

Better SQL in Jupyter. 📊

Unified SQL Analytics Engine Based on SparkSQL

SQLFlow is a bridge that connects a SQL engine, e.g. MySQL, Hive, SparkSQL or SQL Server, with TensorFlow and other machine learning toolkits. SQLFlow extends the SQL language to enable model training, prediction and inference.

Apachespark ⭐ 59

This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which we need in our real life experience as a data engineer. We will be using pyspark & sparksql for the development. At the end of the course we also cover few case studies.

Sharpetl ⭐ 36

Write ETL using your favorite SQL dialects

Litemall Dw ⭐ 36

基于开源Litemall电商项目的大数据项目，包含前端埋点(openresty+lua)、后端埋点；

Prettier Sql ⭐ 22

[ARCHIVED] Please use https://github.com/sql-formatter-org/sql-formatter

Spark2 Etl Examples ⭐ 16

A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0

Hivetophoenix ⭐ 13

An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase

Swimlane Graphs ⭐ 12

Swimlane graphs for Hive, SparkSQL, and Presto based on Ganglia resource graphs

Hive Jdbc Proxy ⭐ 11

Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务，具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。

Huemul Bigdatagovernance ⭐ 10

Huemul BigDataGovernance, es una framework que trabaja sobre Spark, Hive y HDFS. Permite la implementación de una estrategia corporativa de dato único, basada en buenas prácticas de Gobierno de Datos. Permite implementar tablas con control de Primary Key y Foreing Key al insertar y actualizar datos utilizando la librería, Validación de nulos, largos de textos, máximos/mínimos de números y fechas, valores únicos y valores por default. También permite clasificar los campos en aplicabilidad de der

Kyuubi Docker ⭐ 9

Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.

Retailanalytics ⭐ 9

Hortonworks Data Platform Retail Analytics Demo

Run TPC-DS against different databases including Hive, Spark SQL and IBM BigSQL

Distributable_docker_sql_on_hadoop ⭐ 6

Toy Hadoop cluster combining various SQL-on-Hadoop variants

Loganalysis ⭐ 6

日志分析项目

Msc In Machine Learning And Artificial Intelligence ⭐ 6

Master of Science in Machine Learning & Artificial Intelligence - Indian Institute Technology Madras & Liverpool John Moores University

Helthcare System ⭐ 6

Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.

Spark Streaming Kafka Demo ⭐ 5

spark streaming从kafka读取消息，offset写入Redis，spark计算单词出现频率，最后

Spark Ais Multi ⭐ 5

Import, Partition and Query AIS Data using SparkSQL

Related Searches

Hadoop Hive (709)

Java Hive (697)

Spark Hive (529)

Python Hive (376)

Hive Hdfs (338)

Shell Hive (286)

Scala Hive (228)

1-14 of 14 search results

Privacy | About | Terms | Follow Us On Twitter

Copyright 2018-2024 Awesome Open Source. All rights reserved.