Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for streaming hadoop
hadoop
x
streaming
x
62 search results found
Flink Streaming Platform Web
⭐
1,698
基于flink的实时流计算web平台
Kafka Connect Hdfs
⭐
473
Kafka Connect HDFS connector
Sylph
⭐
396
Stream computing platform for bigdata
Gridgain Old
⭐
278
Sparkstreaming
⭐
253
Spark Streaming+Flume+Kafka+HBase+Hadoop+Zookeeper实现实时日志
Hadoophp
⭐
137
A framework for writing Hadoop Streaming jobs in PHP
Opensoc Streaming
⭐
123
Extensible set of Storm topologies and topology attributes for streaming, enriching, indexing, and storing telemetry in Hadoop.
Hackathon
⭐
114
Library and resources for hack/reduce Hackathon events
Teddy
⭐
113
Spark Streaming监控平台,支持任务部署与告警、自启动
Jetstream
⭐
110
Jetstream is a streaming processing framework
Dmrgo
⭐
101
Go library for writing standalone Map/Reduce jobs or for use with Hadoop's streaming protocol
Streamx
⭐
95
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
The Apache Ignite Book
⭐
72
All code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Coursework
⭐
59
Icwsm2010_tutorial
⭐
44
example code for "Large-scale social media analysis with Hadoop" tutorial presented at ICWSM 2010
Bigdata Getting Started
⭐
37
大数据相关框架实战项目(Hadoop, Spark, Storm, Flink)
Swordfish
⭐
37
Open-source distribute workflow schedule tools, also support streaming task.
Haskell_hadoop
⭐
34
Haskell module for streaming hadoop MapReduce jobs
Awesome Tools
⭐
32
curated list of awesome tools and libraries for specific domains
Efflux
⭐
31
Easy Hadoop Streaming and MapReduce interfaces in Rust
Avro Json
⭐
29
Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.
Df_data_service
⭐
29
DataFibers Data Service
Zerowing
⭐
28
A set of tools for copying and streaming data from MongoDB into HBase
Stream To Hdfs
⭐
27
A simple utility for streaming stdin to a file in HDFS
Avro Utils
⭐
26
Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming
Iow Hadoop Streaming
⭐
26
Set of hadoop input/output formats for use in combination with hadoop streaming
Wasp
⭐
25
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Rosetta Scone
⭐
25
A collection of MapReduce tasks translated (from Pig, Hive, MapReduce streaming, Cascalog, etc.) into Scalding.
Dse230_data_analysis_using_hadoop_and_spark_ucsd
⭐
24
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
Data Pipeline Project
⭐
18
Data pipeline project
Twidoop
⭐
16
Write tweets from the Twitter streaming API to Hadoop
Bigdata Tech Index
⭐
16
Big Data Technology Index
Yandex Big Data Engineering
⭐
15
Blog
⭐
13
Kite Apps
⭐
12
Prescriptive Applications over Kite and Hadoop
Dijkstra Hadoop Spark
⭐
10
Dijkstra Algorithm - Python Hadoop Streaming and Pyspark
Pstl
⭐
10
Parallel Streaming Transformation Loader
Hadoop Examples
⭐
10
Hadoop Examples
Local Mapreduce
⭐
9
Simple CLI tool to simulate Hadoop MapReduce Streaming in a multicore computer
Cassandra Summit Demo
⭐
9
Hadoop integration demo for the Cassandra Summit
Node Hadoop Streaming Utils
⭐
8
Hadoop streaming utils for NodeJS
Edoop
⭐
8
A module for writing Hadoop Streaming jobs using Erlang
Perldoop
⭐
8
Efficient Execution of Perl Scripts on Hadoop Clusters
Streamingbetweenness
⭐
8
This repository hosts the code for a framework that offers streaming exact node and edge betweenness centrality, while edges can be added or removed.
Recommendation Algorithm
⭐
7
Network Based Recommendation Engine using Map-Reduce (in Python) to run on top of Hadoop!
Nicknack
⭐
7
Formatters for your hadoop streaming output
Sip
⭐
6
hadoop ruby/streaming statistically improbable phrases
Realtimeloganalyze
⭐
6
一个大数据实时流处理日志分析系统 Demo
Hadoop Python Hive Tutorial
⭐
6
A tutorial for using Hadoop with Python and Hive
Spork Streaming
⭐
6
Pig on Spark Streaming
Hadoop Streaming
⭐
6
Hadoop2.6 MapReduce2 Python3.5的一些经典入门程序:词频统计、好友推荐、PageRank
Spark Dev Training
⭐
6
Mezzanine
⭐
6
Mezzanine is a library built on Spark Streaming used to consume data from Kafka and store it into Hadoop.
Aiowebhdfs
⭐
5
A modern and async implementation of the WebHDFS API in python
Hadoop Multiple Streaming
⭐
5
hadoop-multiple-streaming is a addition to the Hadoop-Streaming which is a utility that comes with the Hadoop distribution. This utility allows you to not only do Hadoop-Streaming, but also create and run 'multiple' Map/Reduce jobs with any executable or script as the mapper and/or the reducer for 'one' input. hadoop-multiple-streaming includes Hadoop-Streaming.
Hadoop Tutorial
⭐
5
A tutorial for hadoop
Streaming Data Pipeline
⭐
5
Streaming pipeline repo for data engineering training program
Rubydoop
⭐
5
Simple Ruby Sugar for Hadoop Streaming
Hadoop_record
⭐
5
A record reader for hadoop CSV files
Hadoop Indexer
⭐
5
Distributed web crawling and Indexing with Python and the Hadoop Streaming API
Logistic Regression Sgd Mapreduce
⭐
5
A Python implementation of binary regularized logistic regression with stochastic gradient descent, packaged as scripts for use with Hadoop streaming
Rust_hadoop_streaming
⭐
5
Hadoop Streaming using Rust
Related Searches
Java Hadoop (2,117)
Javascript Streaming (2,085)
Stream Streaming (1,696)
Python Streaming (1,407)
Spark Hadoop (1,188)
Java Streaming (1,103)
Hadoop Hdfs (1,082)
Hadoop Mapreduce (851)
Spark Streaming (813)
Shell Hadoop (772)
1-62 of 62 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.