Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java hive
hive
x
java
x
254 search results found
Bigdata Notes
⭐
14,872
大数据入门指南 ⭐
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
God Of Bigdata
⭐
8,483
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive.
Hive
⭐
5,222
Apache Hive
Dataspherestudio
⭐
2,860
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Airpal
⭐
2,758
Web UI for PrestoDB.
Bigdataguide
⭐
2,355
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Parquet Mr
⭐
2,296
Apache Parquet
Quicksql
⭐
1,939
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Drill
⭐
1,856
Apache Drill is a distributed MPP query layer for self describing data
Secor
⭐
1,828
Secor is a service implementing Kafka log persistence
Mongo Hadoop
⭐
1,511
MongoDB Connector for Hadoop
Movie_recommend
⭐
1,441
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
Carbondata
⭐
1,401
High performance data store solution
Taier
⭐
1,220
Taier is a big data development platform for submission, scheduling, operation and maintenance, and indicator information display
Elephant Bird
⭐
1,100
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Addax
⭐
1,034
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
Hadoop_study
⭐
817
定期更新Hadoop生态圈中常用大数据组件文档 重心依次为: Flink Solr Sparksql ES Scala Kafka Hbase/phoenix Redis Kerberos (项目包含hadoop思维导图 印象笔记 Scala版本简单demo 常用工具类 去敏后的train code 持续更新!!!)
Nessie
⭐
762
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
Hive Json Serde
⭐
706
Read - Write JSON SerDe for Apache Hive.
Coral
⭐
680
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
Yanagishima
⭐
584
Web UI for Trino, Hive and SparkSQL
Tadpolefordbtools
⭐
512
Connectors
⭐
383
This library allows Scala and Java-based projects (including Apache Flink, Apache Hive, Apache Beam, and PrestoDB) to read from and write to Delta Lake.
Zdh_web
⭐
379
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批
Bigdata
⭐
358
💎🔥大数据学习笔记
Tutorials
⭐
321
StreamSets Tutorials
Hive Testbench
⭐
315
Incubator Hivemall
⭐
308
Mirror of Apache Hivemall (incubating)
Transport
⭐
288
A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Apache Hive, and Presto.
Facebook Hive Udfs
⭐
259
Facebook's Hive UDFs
Helicalinsight
⭐
256
Helical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
Reair
⭐
254
ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.
Hive Jdbc Uber Jar
⭐
252
Hive JDBC "uber" or "standalone" jar based on the latest Apache Hive version
Hive Third Functions
⭐
211
Some useful custom hive udf functions, especial array, json, math, string functions.
Emr Dynamodb Connector
⭐
210
Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB
Kettle Web
⭐
197
基于spring boot通过java代码调用kette
Datacompare
⭐
195
big data comparison and data profiling platform: low code,data comparison and data profiling
Hive Json Schema
⭐
188
Tool to generate a Hive schema from a JSON example doc
Bigdata Hub
⭐
187
数据建设与大数据技术知识体系,包含hadoop、hive、spark、flink主流框架和系列框架,
Aws Glue Data Catalog Client For Apache Hive Metastore
⭐
184
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog
Marble
⭐
160
A high performance in-memory hive sql engine based on Apache Calcite
Logparser
⭐
153
Easy parsing of Apache HTTPD and NGINX access logs with Java, Hadoop, Hive, Flink, Beam, Storm, Drill, ...
Bigdata Learning
⭐
136
大数据学习记录
Sqlsubmit
⭐
133
基于 Flink 的 sqlSubmit 程序
Hadoopdemo
⭐
128
Hadoop简单应用案例,包括MapReduce、单词统计、HDFS基本操作、web日志分析、Zoo
Xichuan_note
⭐
114
xichuan的学习总结笔记,覆盖了java、spring、java其他常用框架,以及大数据相关组件
Avro Hadoop Starter
⭐
111
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Hive Udf
⭐
102
NexR Hive UDFs
Streamx
⭐
95
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Wifi
⭐
95
基于wifi抓取信息的大数据查询分析系统
My Tutorial
⭐
93
我想构建形成自己的知识的体系,工作职位是大数据,所以主要还是以大数据为主,从主流框架Hadoop,S 大数据开发是很繁琐的,正确的运行环境是成功的第一步,所以我尽量从搭建,部署,开发整个流程都做出来,单
Flink Sql Benchmark
⭐
92
Honu
⭐
84
Honu is a large scale data collection and processing pipeline
Spark Llap
⭐
82
Springboot Templates
⭐
79
springboot和dubbo、netty的集成,redis mongodb的nosql模板, kafka rocketmq rabbit的MQ模板, solr solrcloud elasticsearch查询引擎
Howl
⭐
77
Common metadata layer for Hadoop's Map Reduce, Pig, and Hive
Pxf
⭐
76
Platform Extension Framework: Federated Query Engine
The Apache Ignite Book
⭐
72
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Searchanalytics Bigdata
⭐
71
Customer Product search clicks analytics using big data Hadoop, Hive, Oozie, ElasticSearch, Akka, Spring Data
Beetest
⭐
70
A super simple utility for testing Apache Hive scripts locally for non-Java developers.
Hive_test
⭐
68
Unit test framework for hive and hive-service
Datamingproject
⭐
67
大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)
Dataiku Hive Udf
⭐
63
A collection of Hive UDFs
Hive Io Experimental
⭐
62
Hive I/O Library
Tempto
⭐
60
A testing framework for Presto
Iceberg
⭐
59
A temporary home for LinkedIn's changes to Apache Iceberg (incubating)
Hiveka
⭐
59
Kafka as Hive Storage
Validatar
⭐
55
Functional testing framework for Big Data pipelines.
Exhibit
⭐
54
A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.
Flume Canal Source
⭐
54
Flume NG Canal source
Bestconf
⭐
53
A tool automatically improving the performance of large-scale systems by finding better configuration settings
Til
⭐
51
Today I Learned
Clickstream Tutorial
⭐
51
Code for Tutorial on designing clickstream analytics application using Hadoop
Phoenix Connectors
⭐
48
Apache Phoenix Connectors
Hadoop Unit
⭐
45
Hadoop-Unit is a project which allow testing projects which need hadoop ecosysteme like kafka, solr, hdfs, hive, hbase, ...
Learnbasicbigdatatech
⭐
44
🚀Some projects on Big Data Analysis like Spark, Hive, Presto and Data Visualization like Superset
P3
⭐
42
An open source pcap packet and NetFlow file analysis tool using Hadoop MapReduce and Hive.
Hive Xml Serde
⭐
40
XML Serializer/Deserializer for Apache Hive
Big Data Parent
⭐
39
大数据体系,存储,计算,相关组件,分析引擎等
Csds Material
⭐
38
Course material for the Computer Systems for Data Science class at Columbia
Hive Jdbc Driver
⭐
38
An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Xxhadoop
⭐
37
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Ts Chief
⭐
37
TS-CHIEF
Swordfish
⭐
37
Open-source distribute workflow schedule tools, also support streaming task.
Hiveqlunit
⭐
36
Test your Hive scripts inside your favorite IDE with HiveQLUnit! Increase your developers productivity by testing on all operating systems including Windows, Linux and Mac OSX. Build continuous integration and delivery tests to control the releases of your big data products.
Litemall Dw
⭐
36
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;
Nfldata
⭐
35
Combining datasets with MapReduce on NFL play by play data.
Flume Logs
⭐
34
Apache Flume to process log files on Hadoop cluster
Main
⭐
32
The main - so far, only - repository for the SmileWide project.
Msgpack Hadoop
⭐
32
MessagePack-Hadoop integration provides an efficient schema-free data representation for Hadoop and Hive.
Zeus
⭐
31
Hadoop作业平台
Hive Mongo
⭐
31
hive storage handler for connecting with MongoDB
Hive Solr
⭐
30
使用Hive读写solr
Hive Mr3
⭐
29
Hive for MR3
Ireport
⭐
29
数据分析统计报表平台
Avro Json
⭐
29
Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.
Aws Glue Catalog Sync Agent For Hive
⭐
28
Enables synchronizing metadata changes (Create/Drop table/partition) from Hive Metastore to AWS Glue Data Catalog
Hive Tools
⭐
27
Nyyellowtaxiproject
⭐
27
Big Data project using Hadoop (MapReduce, spark, Hive)
Related Searches
Java Jar (7,910)
Java Database (6,015)
Java Mysql (5,954)
Java Apache (4,283)
Java Json (3,692)
Java Sql (3,212)
Java Intellij (3,170)
Java Http (2,786)
Java Step (2,563)
Java Jdbc (2,549)
1-100 of 254 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.