Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for scala hive
hive
x
scala
x
78 search results found
Bigdata Notes
⭐
14,872
大数据入门指南 ⭐
Bigdataguide
⭐
2,355
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Szt Bigdata
⭐
2,055
深圳地铁大数据客流分析系统🚇🚄🌟
Kyuubi
⭐
1,849
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
Movie_recommend
⭐
1,441
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Carbondata
⭐
1,401
High performance data store solution
Stream Reactor
⭐
960
A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.
Scriptis
⭐
767
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Blinkdb
⭐
625
BlinkDB: Sub-Second Approximate Queries on Very Large Data.
Connectors
⭐
383
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Flink Notes
⭐
223
flink学习笔记
Xsql
⭐
207
Unified SQL Analytics Engine Based on SparkSQL
Spark 2.3.1
⭐
174
Spark-2.3.1源码解读
Spark Authorizer
⭐
158
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Eel Sdk
⭐
140
Big Data Toolkit for the JVM
Spark Atlas Connector
⭐
112
A Spark Atlas connector to track data lineage in Apache Atlas
Schedoscope
⭐
95
Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or whatever you choose to call your Hadoop data warehouse these days.
Smart Data Lake
⭐
87
Smart Automation Tool for building modern Data Lakes and Data Pipelines
Flowman
⭐
85
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
Spark Acid
⭐
79
ACID Data Source for Apache Spark based on Hive ACID
Bigdata Learning Notes
⭐
79
Binlogupdatetohive
⭐
68
mysql数据实时增量导入hive
Spark Gpu
⭐
61
Spark GPU and SIMD Support
Spark Training
⭐
52
Repository used for Spark Trainings
Til
⭐
51
Today I Learned
Spark Hive Udf
⭐
47
Example project showing how to use Hive UDFs in Apache Spark
Itachi
⭐
46
A library that brings useful functions from various modern database management systems to Apache Spark
Hadoop Spark Hive Cluster Docker
⭐
45
hadoop-spark-hive-cluster-docker
Xgbspark Text Classification
⭐
43
XGBoost on Spark for Chinese Text Classification
Xxhadoop
⭐
37
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Litemall Dw
⭐
36
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;
Sharpetl
⭐
36
Write ETL using your favorite SQL dialects
Hive Serde Schema Gen
⭐
32
📋 Generate Hive SerDe schema from a .json file.
Squerall
⭐
27
An implementation of the so-called Semantic Data Lake, using Apache Spark and Presto.
Spark Hive Streaming Sink
⭐
26
A sink to save Spark Structured Streaming DataFrame into Hive table
Daflow
⭐
24
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Spark_log_data
⭐
21
Flume-to-Spark-Streaming Log Parser
Data Model Generator
⭐
20
Data model generator based on Scala case classes
Flamy
⭐
17
the database manager for Apache Hive
Spark2 Etl Examples
⭐
16
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
Spark Waimai
⭐
16
基于spark的外卖大数据平台分析系统
Salesforce2hadoop
⭐
16
Import Salesforce data into Hadoop HDFS in Avro format
Spark Hive Streaming Sink
⭐
15
A sink to save Spark Structured Streaming DataFrame into Hive table
Featurestore
⭐
15
Building blocks and patterns for building data prep transformations and feature engineering in Spark.
Bigdata
⭐
15
小白大数据学习笔记 ⭐
Bigdata Learning
⭐
14
大数据学习,主要涉及Kafka、ZooKeeper、Hive、HBase、Spark
Hivetophoenix
⭐
13
An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase
Bigdata News
⭐
12
基于Spark2.2新闻网大数据实时系统项目
Strata Tutorial 2016 Nyc
⭐
11
Hive Json Schema Gen
⭐
11
Generates Hive schema from JSON
Octopufs
⭐
11
OctopuFS library helps managing cloud storage, ADLSgen2 specifically. It allows you to operate on files (moving, copying, setting ACLs) in very efficient manner. Designed to work on databricks, but should work on any other platform as well.
Hive Jdbc Proxy
⭐
11
Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。
All Kinds Book
⭐
10
java 大数据 spark flink redis hive hbase kafka 面试题 数据结构 算法 设计模式
Spark
⭐
10
Netflix branches of Apache Spark
Berilia
⭐
10
Create hadoop cluster in aws ec2 for development
Spark Workshop
⭐
10
Big Data Pipeline
⭐
9
Big Data
Janusgraph Data Importer
⭐
9
janusgraph data import
Hive
⭐
9
Hive as of Toronto International Electroacoustic Symposium 2014
Bigdatademo
⭐
9
The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big data project relates to Hadoop ecosystem
Hbaseetl
⭐
8
Spark HbaseETL Tools. Support bulk
Cloudera Cca175
⭐
8
CCA Spark and Hadoop Developer Certification
Sgx Spark
⭐
7
SGX-Spark
Etl Processes Using Sqoop Hadoop Hive Spark And Scala
⭐
7
I implemented various ETL processes like loading the data using sqoop from mysql to hdfs, transform the data using Spark and Scala, perform analytics using Spark and Scala and loading the data back to HDFS.
Diane
⭐
7
Hive helper functions for apache spark users
1kuang_datas
⭐
7
亿矿云大数据处理框架:借助Hadoop、Spark、Storm等分布式处理架构,满足海量数据的批处理 亿矿云大数据预处理:运用数据冗余剔除、异常检测、归一化等方法对原始数据进行清洗,为后续存储、管理与分 亿矿云大数据存储与管理:通过分布式文件系统、NoSQL数据库、关系数据库、时序数据库等不同的数据管理
Bigdata
⭐
6
小白大数据学习笔记,学习路线,技术路线
Neptune
⭐
6
Neptune Execution Framework for Stream/Batch Spark Applications
Data Misc Tools
⭐
6
This project hosts several tools to help with development using Hive, Spark
Loganalysis
⭐
6
日志分析项目
Spark Es Csv
⭐
6
spark export hdfs file to json or csv
Yl Spark Sql
⭐
6
一个Spark SQL方言,增强了批处理、机器学习、模型服务等语义;基于统一的SQL语法,提供了一个ETL、机器学习
Analysisofuserbehaviors
⭐
5
基于spark的电商用户行为分析系统
Example
⭐
5
HbaseETL
Bear
⭐
5
A Hive metadata dump tool
Example Spark Scala Read And Write From Hive
⭐
5
Cuesheet Starter Kit
⭐
5
A minimal skeleton code for developing Apache Spark applications with CueSheet
Related Searches
Scala Sbt (4,178)
Scala Spark (3,279)
Scala Akka (2,120)
Java Scala (1,794)
Scala Play Framework (1,309)
Plugin Scala (1,079)
Scala Kafka (969)
Scala Functional Programming (942)
Scala Scalajs (887)
Docker Scala (728)
1-78 of 78 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.