Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java hdfs
hdfs
x
java
x
279 search results found
Cat
⭐
18,237
CAT 作为服务端项目基础组件,提供了 Java, C/C++, Node.js, Python, Go 等多语言客户端,已经在美团点评的基础架构中间件框架(MVC框架,RPC框架,数据库框架,缓存框架等,
Bigdata Notes
⭐
14,872
大数据入门指南 ⭐
Mycat Server
⭐
9,431
God Of Bigdata
⭐
8,483
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive.
Xlearning
⭐
1,729
AI on Hadoop
Hopsworks
⭐
1,041
Hopsworks - Data-Intensive AI platform with a Feature Store
Addax
⭐
1,034
Addax is a versatile open-source ETL tool that can seamlessly transfer data between various RDBMS and NoSQL databases, making it an ideal solution for data migration.
Sqoop
⭐
820
Mirror of Apache Sqoop
Kafka Connect Hdfs
⭐
473
Kafka Connect HDFS connector
Storm Yarn
⭐
419
Storm-yarn enables Storm clusters to be deployed into machines managed by Hadoop YARN.
Zdh_web
⭐
379
大数据采集,抽取平台,zdh_web是zdh系列服务的可视化管理平台,包含数据采集,调度,权限,审批
Kite
⭐
366
Kite SDK
Bigdata
⭐
358
💎🔥大数据学习笔记
Cloudeon
⭐
345
CloudEon uses Kubernetes to install and deploy open-source big data components, enabling the containerized operation of an open-source big data platform. This allows you to reduce your focus on underlying resource management and maintenance.
Hops
⭐
285
Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.
Divolte Collector
⭐
275
Divolte Collector
Bigdata File Viewer
⭐
269
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Faunus
⭐
259
Graph Analytics Engine
Hadoop Mini Clusters
⭐
251
hadoop-mini-clusters provides an easy way to test Hadoop projects directly in your IDE
Rumble
⭐
194
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Wifiprobeanalysis
⭐
189
基于WIFI探针的商业大数据分析技术
Terrapin
⭐
168
Serving system for batch generated data sets
Dcos Commons
⭐
162
DC/OS SDK is a collection of tools, libraries, and documentation for easy integration of technologies such as Kafka, Cassandra, HDFS, Spark, and TensorFlow with DC/OS.
Bigdata In Practice
⭐
154
大数据实践项目 Hadoop、Spark、Kafka、Hbase、Flink.....
Lambda Arch
⭐
151
A full big data pipeline (Lambda Architecture) with Spark, Kafka, HDFS and Cassandra.
Distributed Graph Analytics
⭐
135
Distributed Graph Analytics (DGA) is a compendium of graph analytics written for Bulk-Synchronous-Parallel (BSP) processing frameworks such as Giraph and GraphX. The analytics included are High Betweenness Set Extraction, Weakly Connected Components, Page Rank, Leaf Compression, and Louvain Modularity.
Hadoop Hdfs Fsimage Exporter
⭐
131
Exports Hadoop HDFS content statistics to Prometheus
Hdfs Shell
⭐
129
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Hadoopdemo
⭐
128
Hadoop简单应用案例,包括MapReduce、单词统计、HDFS基本操作、web日志分析、Zoo
Ssm
⭐
123
Smart Storage Management for Big Data, a comprehensive hot/cold data optimized solution
Rubix
⭐
121
Cache File System optimized for columnar formats and object stores
Mpich2 Yarn
⭐
112
Running MPICH2 on Yarn
Dynamometer
⭐
110
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Nnanalytics
⭐
106
NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.
Kafka Connect Fs
⭐
106
Kafka Connect FileSystem Connector
Play Videos In Hdfs
⭐
102
This project realizes playing videos storing in HDFS(Hadoop) in the web page online.在线播放HDFS中视频文件
Wifi
⭐
95
基于wifi抓取信息的大数据查询分析系统
My Tutorial
⭐
93
我想构建形成自己的知识的体系,工作职位是大数据,所以主要还是以大数据为主,从主流框架Hadoop,S 大数据开发是很繁琐的,正确的运行环境是成功的第一步,所以我尽量从搭建,部署,开发整个流程都做出来,单
Camus
⭐
87
Mirror of Linkedin's Camus
Kafka Hadoop Loader
⭐
84
Hadoop Job for schemaless incremental loading of messages from Kafka topics onto hdfs with configurable output partitioning. 🚫
Trino Storage
⭐
79
Storage connector for Trino
Chukwa
⭐
78
Mirror of Apache Chukwa
Pxf
⭐
76
Platform Extension Framework: Federated Query Engine
Euphoria
⭐
74
Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model which can express both batch and stream transformations.
Datamingproject
⭐
67
大数据平台相关代码(ES/Hive/Hadoop/hdfs/hbase)
Hdfs Over Ftp
⭐
65
FTP server which works on a top of HDFS
Hdfs File Slurper
⭐
64
Utility to easily copy files into HDFS
Hdfs Netdisc
⭐
61
基于Hadoop的分布式云存储系统 🌴
Rainbow
⭐
61
A data layout optimization framework for wide tables stored on HDFS. See rainbow's webpage
Storm Hdfs
⭐
61
Storm components for interacting with HDFS file systems
Tempto
⭐
60
A testing framework for Presto
Cloud Note
⭐
59
基于分布式的云笔记(参考某道云笔记),数据存储在redis与hbase中
Nnproxy
⭐
58
Scalable NameNode RPC Proxy for HDFS Federation
Shapefile
⭐
56
Java library to read point and polygon shape files
Elasticsearch Hdfs
⭐
56
Hadoop Plugin for ElasticSearch
Flume Canal Source
⭐
54
Flume NG Canal source
Clickhouse Hdfs Loader
⭐
54
loading hdfs data to clickhouse
Mapreduce Demo
⭐
53
Hadoop,MapReduce编程学习练手实例
Spark Compaction
⭐
52
File compaction tool that runs on top of the Spark framework.
Arabesque
⭐
50
Scalable Graph Mining
Hdfs Metadata
⭐
48
Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks and nodes.
Jsr203 Hadoop
⭐
47
A Java NIO file system provider for HDFS
Hadoop Unit
⭐
45
Hadoop-Unit is a project which allow testing projects which need hadoop ecosysteme like kafka, solr, hdfs, hive, hbase, ...
Entrada
⭐
44
Entrada - A tool for DNS big data analytics
Hfsa
⭐
44
Hadoop FSImage Analyzer (HFSA)
Flink On Azure
⭐
44
Examples of Flink on Azure
Dk Sqoop Plus
⭐
42
本项目在原生sqoop 的基础上进行更多实用功能的扩展。
Samples
⭐
40
This repository contains open-source sample applications for IBM Streams.
Garmadon
⭐
39
Java event logs collector for hadoop and frameworks
Hbase Meta Repair
⭐
39
Repair hbase metadata table from hdfs.
163 Bigdate Note
⭐
38
bigdata note
Data Polygamy
⭐
38
Data Polygamy is a topology-based framework that allows users to query for statistically significant relationships between spatio-temporal data sets.
Csds Material
⭐
38
Course material for the Computer Systems for Data Science class at Columbia
Xxhadoop
⭐
37
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Bigdatatools
⭐
35
tools for bigData
Nfldata
⭐
35
Combining datasets with MapReduce on NFL play by play data.
Hadoop
⭐
34
Public hadoop release repository
Cipher
⭐
33
基于hdfs spark的视频非结构化数据计算
Splunk Shuttl
⭐
33
Splunk app for archive management, including HDFS support.
Hadoop Cli
⭐
31
HADOOP-CLI is an interactive command line shell that makes interacting with the Hadoop Distribted Filesystem (HDFS) simpler and more intuitive than the standard command-line tools that come with Hadoop. If you're familiar with OS X, Linux, or even Windows terminal/console-based applications, then you are likely familiar with features such as tab completion, command history, and ANSI formatting.
Bigfatlm
⭐
30
Hadoop MapReduce training of modified Kneser-Ney smoothed language models
Hive Solr
⭐
30
使用Hive读写solr
Arvo2parquet
⭐
30
Example program that writes Parquet formatted data to plain files (i.e., not Hadoop hdfs); Parquet is a columnar storage format.
Mapreduce
⭐
29
清华大数据作业MapReduce处理几百个G的JSON数据
Mongoreduce
⭐
29
Hadoop Input and Ouput formats for MongoDB
Learning Spark
⭐
29
Tidy up Spark and Hadoop tutorials.
Fourinone
⭐
29
Hcfsfuse
⭐
28
Hive Tools
⭐
27
Sbk
⭐
27
Storage Benchmark Kit
Bigdatas
⭐
27
this is a db-hdfs tools used to transfer big database datas to hadoop hdfs like sqoop,but bboss bigdata tool is very nice monitor and event drivered model,and high perfermance,support Distributed executor tasks Ability.
Hdfs Over Webdav
⭐
26
a customized version of origin hdfs-webdav from iponweb.net to support Hadoop 0.20.1
Sparkproject
⭐
26
Using Apache Spark in an ArcMap Toolbox
Hdfs Cli
⭐
26
Interactive shell for interacting with Hadoop HDFS. Supports multiple HDFS hosts, command line history and tab completion.
Kafka Parquet Writer
⭐
26
This project provides a compenent that reads logs from Kafka and writes it as parquet file on HDFS.
Kiji Bento
⭐
26
Kiji BentoBox: Developer SDK for Kiji including a standalone zero-configuration HBase micro-cluster
Fsbrowser
⭐
25
Fast desktop client for Hadoop Distributed File System
Example Java Read And Write From Hdfs
⭐
25
Example for Saagie Wiki - Read and write to HDFS with Java
Demo Spark Sensor Data
⭐
25
Demo Spark application to transform data gathered on sensors for a heatmap application
Spash
⭐
25
Spash
Related Searches
Java Docker (6,180)
Java Database (6,015)
Java Mysql (5,954)
Java Sdk (5,864)
Java Apache (4,283)
Java Json (3,692)
Java Command Line (3,457)
Python Java (3,215)
Java Jdbc (2,549)
Java Hadoop (2,117)
1-100 of 279 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.