Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for hive hdfs
hdfs
x
hive
x
131 search results found
Bigdata Notes
⭐
14,872
大数据入门指南 ⭐
God Of Bigdata
⭐
8,483
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive.
Bigdata Growth
⭐
1,256
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Addax
⭐
1,034
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
Zdh_web
⭐
379
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批
Bigdata
⭐
358
💎🔥大数据学习笔记
Omniduct
⭐
247
A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (including HDFS, Hive, Presto, MySQL, etc).
Bigdata_docker
⭐
226
Big Data Ecosystem Docker
Juicy Bigdata
⭐
162
🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉
Hdfs_fdw
⭐
131
PostgreSQL foreign data wrapper for HDFS
Hadoopdemo
⭐
128
Hadoop简单应用案例,包括MapReduce、单词统计、HDFS基本操作、web日志分析、Zoo
Kylin Docker
⭐
116
This repository trackes the code and files for building docker image with Apache Kylin.
Wifi
⭐
95
基于wifi抓取信息的大数据查询分析系统
My Tutorial
⭐
93
我想构建形成自己的知识的体系,工作职位是大数据,所以主要还是以大数据为主,从主流框架Hadoop,S 大数据开发是很繁琐的,正确的运行环境是成功的第一步,所以我尽量从搭建,部署,开发整个流程都做出来,单
Jaws Spark Sql Rest
⭐
92
Devops Perl Tools
⭐
88
25+ DevOps CLI Tools - Anonymizer, SQL ReCaser (MySQL, PostgreSQL, AWS Redshift, Snowflake, Apache Drill, Hive, Impala, Cassandra CQL, Microsoft SQL Server, Oracle, Couchbase N1QL, Dockerfiles), Hadoop HDFS & Hive tools, Solr/SolrCloud CLI, Nginx stats & HTTP(S) URL watchers for load-balanced web farms, Linux tools etc.
Hadoop_cookbook
⭐
80
Cookbook to install Hadoop 2.0+ using Chef
Pxf
⭐
76
Platform Extension Framework: Federated Query Engine
Datamingproject
⭐
67
大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)
Tempto
⭐
60
A testing framework for Presto
Mylearningnotes
⭐
58
Because its never late to start taking notes and 'public' it...
Flume Canal Source
⭐
54
Flume NG Canal source
Bigdataparty
⭐
54
大数据组件 All-in-One 的 Dockerfile
Getl
⭐
51
A tool for developing and testing ETL and ELT processes for automating the capture, delivery and processing of information in data warehouses on the MicroFocus Vertica platform.
Ooziesamples
⭐
48
Oozie Samples
Flume Kafka Storm
⭐
45
大数据实时计算的基础框架
Hadoop Unit
⭐
45
Hadoop-Unit is a project which allow testing projects which need hadoop ecosysteme like kafka, solr, hdfs, hive, hbase, ...
163 Bigdate Note
⭐
38
bigdata note
Csds Material
⭐
38
Course material for the Computer Systems for Data Science class at Columbia
Xxhadoop
⭐
37
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Hadoop Guide
⭐
36
🐘 关于 HDFS,Yarn,MapReduce,HBase,Hive,Pig,Sqoop,Flume,Zoo 等大数据框架的学习笔记
Nfldata
⭐
35
Combining datasets with MapReduce on NFL play by play data.
Bigdatatools
⭐
35
tools for bigData
Opendataplatform
⭐
34
An open source, enterprise-scale, vendor-neutral data platform accelerating solution delivery.
Minipipe
⭐
30
Minipipe: a minimal end-to-end data pipeline
Hive Solr
⭐
30
使用Hive读写solr
Aaocp
⭐
27
一个对用户行为日志进行分析的大数据项目
Hive Tools
⭐
27
Kiji Bento
⭐
26
Kiji BentoBox: Developer SDK for Kiji including a standalone zero-configuration HBase micro-cluster
Bigdata Doc
⭐
25
大数据学习笔记,学习路线,技术案例整理。
Csv To Orc
⭐
24
Convert a CSV fle to ORCFile
Mongo Hive
⭐
24
Load your MongoDB collection into Hive. Supports complex JSON structure.
Cobol To Hive
⭐
24
Serde for Cobol Layout to Hive table
Backup Hadoop And Hive
⭐
22
Tempto
⭐
22
A testing framework for Trino
Bigdata Tutorial
⭐
22
Spark_log_data
⭐
21
Flume-to-Spark-Streaming Log Parser
Apache Spark Docker
⭐
21
Dockerizing an Apache Spark Standalone Cluster
Bigdata_platform
⭐
20
大数据建模分析平台
Tpc H Impala
⭐
18
TPC-H Benchmark on Cloudera Impala
Jun_bigdata
⭐
18
jun_bigdata大数据平台服务框架。实现了Kafka实时数据过滤、清洗、转换、消费,实现了Sp SQL对Redis、MongoDB等非关系型数据库的数据的读写;集成了规则引擎,可基于规则引擎实现客
Hive_to_es
⭐
18
同步Hive数据仓库数据到Elasticsearch的小工具
Hive Presto Docker
⭐
18
Hadoop, Hive and PrestoDB for deployment using Docker
Hadoop Etl Udfs
⭐
17
The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Jmx_exporter Cloudera Hadoop
⭐
17
Prometheus jmx_exporter configurations for Cloudera Hadoop
Datax Src
⭐
16
DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase 等各种异构数据源之间高效的数据同步功能。
Spark2 Etl Examples
⭐
16
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
Salesforce2hadoop
⭐
16
Import Salesforce data into Hadoop HDFS in Avro format
Hive_compared_bq
⭐
16
hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
Hadoop
⭐
16
Hadoop Docker Nn
⭐
16
hadoop-docker集群
Featurestore
⭐
15
Building blocks and patterns for building data prep transformations and feature engineering in Spark.
Hdfs Spark Hive Dev Setup
⭐
15
This repository contains makescript and instruction on how to setup local hdfs+spark+hive setup.
Yandex Big Data Engineering
⭐
15
Bigdata
⭐
15
小白大数据学习笔记 ⭐
Bi_project
⭐
14
一个简单的Hive项目,使用了Sqoop、Hadoop、Hive、MySQL,对电商数据进行分析
Big Data Course
⭐
14
Practice course on Big Data
Copybookinputformat
⭐
14
Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...
Cloudera Framework
⭐
12
Bigdataguide
⭐
11
秋招自学上岸,自学太难了,想总结一份详细的大数据开发资料,包括基础 | 架构 | 源码,让更多自学的伙伴少走弯路。 有相关问题可以添加公众号:大数据老刘,联系老刘!
Avro Flume Hive Example
⭐
11
Hive_merge
⭐
11
Merge Small files for Hive Table on HDFS
Oscon Bigtop
⭐
11
Presentation on Apache Bigtop at OSCON 2013
Fm
⭐
11
using FM latent vectors as embedding features
Bigdata_stack
⭐
10
Dockerized Hadoop/Minio/Hive/Presto stack
Thrive
⭐
10
Thrive is an ETL framework that runs single-row transformations on HDFS data and makes the data available in relational databases (Hive and Vertica).
Bigdata Etl Pipeline
⭐
10
The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a complete data pipeline with all components seamlessly set up and ready to use
Nifi Templates
⭐
10
This is a GitHub for all of my NiFi Templates
Uhp
⭐
9
uhp for ucweb
Youtubereadme
⭐
9
Pxf_demo
⭐
9
Demo related artifacts/datasets for PXF
Docker Impala
⭐
9
Run Impala in a Docker container.
Bigdata Docker
⭐
9
Run Hadoop Cluster within Docker Containers.
Docker Impala Kudu
⭐
9
Bigdatademo
⭐
9
The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big data project relates to Hadoop ecosystem
Vagrant Jilla Hadoop
⭐
8
Vagrant setup to spin up vm hadoop cluster
Hdp3_upgrade_utils
⭐
8
Assist with HDP 3 Upgrade Planning
Hands On Hadoop
⭐
8
Hadoop, MapReduce, HDFS, Spark, Pig, Hive, HBase, MongoDB, Cassandra, Flume - the list goes on! Over 25 technologies.
Spark Yarn Hadoop Cluster Vagrant
⭐
8
Vagrant project to spin up a cluster of 4 nodes with Spark, YARN and Hadoop
Cloudera Cca175
⭐
8
CCA Spark and Hadoop Developer Certification
Nyc 311 Data Analytics
⭐
7
Hive, Python, Tableau and more..
Theft Market
⭐
7
Infrastructure for analyzing historical real estate data
Binlog2hive
⭐
7
MySQL增量数据实时同步到HDFS/Hive
Hdfs_to_cos_tools
⭐
7
用于将HDFS上的数据拷贝到COS上
Hdata
⭐
7
数据抽取工具
Hadoop Loganalysis
⭐
7
基于论坛的apache common日志分析项目 🍁
Avrotoolbox
⭐
7
ArcGIS toolbox to process feature classes in Apache Avro and Parquet format
Hdp21 Twitter Demo
⭐
7
'Hello world' Storm topology to analyze financial tweets
Tidyr.big
⭐
7
Scalable backend for tidyr
2018 Hadoop
⭐
7
存放代码资源,交流大数据开发技术。共同成长,一同进步。
Related Searches
Hadoop Hdfs (1,082)
Java Hdfs (752)
Hadoop Hive (703)
Java Hive (697)
Spark Hdfs (573)
Spark Hive (529)
1-100 of 131 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.