Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for hadoop mapreduce
hadoop
x
mapreduce
x
214 search results found
Data Science Ipython Notebooks
⭐
25,668
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Bigdata Notes
⭐
14,872
大数据入门指南 ⭐
Cookbook
⭐
12,557
The Data Engineering Cookbook
Hive
⭐
5,222
Apache Hive
Mrjob
⭐
2,584
Run MapReduce jobs on Hadoop or Amazon Web Services
Bigdata Growth
⭐
1,625
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Mongo Hadoop
⭐
1,511
MongoDB Connector for Hadoop
Bigdata Interview
⭐
1,397
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop
Data Algorithms Book
⭐
973
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Cdap
⭐
735
An open source framework for building data analytic applications.
Bigdata Ecosystem
⭐
536
BigData Ecosystem Dataset
Scoobi
⭐
485
A Scala productivity framework for Hadoop.
Bigdata
⭐
358
💎🔥大数据学习笔记
Cascading
⭐
337
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
Behemoth
⭐
284
Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Compass
⭐
284
Compass is a task diagnosis platform for bigdata
Parkour
⭐
261
Hadoop MapReduce in idiomatic Clojure.
Hadoopy
⭐
244
Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.
Hadoop Docker
⭐
210
基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Wonderdog
⭐
193
Bulk loading for elastic search
Terrapin
⭐
168
Serving system for batch generated data sets
Juicy Bigdata
⭐
162
🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉
Learning Hadoop And Spark
⭐
160
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
Cc Mrjob
⭐
157
Demonstration of using Python to process the Common Crawl dataset with the mrjob framework
Hadoop R
⭐
135
Example code for running R on Hadoop
Hadoopdemo
⭐
128
Hadoop简单应用案例,包括MapReduce、单词统计、HDFS基本操作、web日志分析、Zoo
Ctenopharyngodon Idella
⭐
125
Hadoop, MapReduce Distributed Crawling of Data Information from All Chinese Universities. (Hadoop,mapreduce分布式爬取掌上高考的所有中国大学数据)
Sequenceiq Samples
⭐
119
SequenceIQ Hadoop examples
Asakusafw
⭐
113
Asakusa Framework
Gora
⭐
111
The Apache Gora open source framework provides an in-memory data model and persistence for big data.
Avro Hadoop Starter
⭐
111
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Dynamometer
⭐
110
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Hadron
⭐
110
Construct and run Hadoop MapReduce programs in Haskell
Introtohadoopandmr__udacity_course
⭐
103
🐘 Source code for assignments of Udacity course "Introduction to Hadoop and MapReduce"
Distributed Statistical Computing
⭐
99
Teaching Materials for Distributed Statistical Computing (大数据分布式计算教学材料)
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Focusbigdata
⭐
89
【大数据成神之路学习路径+面经+简历】
Elastic Mapreduce Ruby
⭐
86
Amazon's elastic mapreduce ruby client. Ruby 1.9.X compatible
Lemur
⭐
85
Lemur is a tool to launch hadoop jobs locally or on EMR, based on a configuration file, referred to as a jobdef. The jobdef file describes your EMR cluster, local environment, pre- and post-actions and zero or more "steps".
Chukwa
⭐
78
Mirror of Apache Chukwa
Guagua
⭐
72
An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.
Hadoop Map Reduce Patterns
⭐
71
Hadoop Map-Reduce Design Patterns
Scala Hadoop
⭐
70
Using Hadoop with Scala
Hadoop Bam
⭐
66
Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework
Hadoop Java Example
⭐
66
A very simple example of using Hadoop's MapReduce functionality in Java.
Hive Io Experimental
⭐
62
Hive I/O Library
Src
⭐
62
A light-weight distributed stream computing framework for Golang
Pybigdata
⭐
56
使用 python 操作大数据的各种组件
Big_data
⭐
55
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Snabler
⭐
54
Parallel Algorithms in Python for Hadoop/Mapreduce
Mlhadoop
⭐
53
This repository contains Machine-Learning MapReduce codes for Hadoop which are written from scratch (without using any package or library). E.g. Prediction (Linear and Logistic Regression), Clustering (K-Means), Classification (KNN) etc.
Mapreduce Demo
⭐
53
Hadoop,MapReduce编程学习练手实例
Hadoop Papyrus
⭐
48
Hadoop MapReduce DSL framework by Ruby. Changed from hadoop-rubydsl.
Hbasedoc_cn
⭐
46
HBase 0.95版中文文档翻译
Simr
⭐
45
Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure
Machine_learning_in_action_py3
⭐
44
Important book about the machine learning algorithms, and introduces the application of those who use these algorithms and tools, and how to use them in a real environment. This book and other books, behind the other books are long on machine learning theory knowledge, the book happened to be more discussion on how to use coded machine learning algorithms.
P3
⭐
42
An open source pcap packet and NetFlow file analysis tool using Hadoop MapReduce and Hive.
Code Of Spark Big Data Business Trilogy
⭐
42
This is code of book "Spark Big Data Business Trilogy"
Pallet Hadoop
⭐
41
Hadoop Cluster Management with Intelligent Defaults
Gomrjob
⭐
41
gomrjob - a Go Framework for Hadoop Map Reduce Jobs
Barclamp Pig
⭐
41
[UNMAINTAINED] Hadoop Pig: Mapreduce Programming component
Devops
⭐
40
DevOps
Sizzle
⭐
39
A compiler and runtime for Google's Sawzall language, optimized for Hadoop
Csds Material
⭐
38
Course material for the Computer Systems for Data Science class at Columbia
Hadoop Guide
⭐
36
🐘 关于 HDFS,Yarn,MapReduce,HBase,Hive,Pig,Sqoop,Flume,Zoo 等大数据框架的学习笔记
Cc Warc Examples
⭐
35
CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop
Kassandramrhelper
⭐
35
Library for processing Cassandra SSTables with Hadoop MapReduce.
Haskell_hadoop
⭐
34
Haskell module for streaming hadoop MapReduce jobs
Data Infra Projects
⭐
34
List of some interesting projects
Avro2parquet
⭐
33
Hadoop MapReduce tool to convert Avro data files to Parquet format.
Cc Helloworld
⭐
33
CommonCrawl Hello World example
Marklogic Contentpump
⭐
32
MarkLogic Contentpump (mlcp)
Efflux
⭐
31
Easy Hadoop Streaming and MapReduce interfaces in Rust
Bigfatlm
⭐
30
Hadoop MapReduce training of modified Kneser-Ney smoothed language models
Hive Mr3
⭐
29
Hive for MR3
Emr S3 Io
⭐
29
Hadoop IO for Amazon S3
Nativetask
⭐
29
Hadoop task level native runtime
Mongoreduce
⭐
29
Hadoop Input and Ouput formats for MongoDB
Ansible Hadoop
⭐
28
THIS REPOSITORY IS VERY OUTDATED. See Ansible Galaxy instead.
Sculptor
⭐
26
Ankus
⭐
26
Rosetta Scone
⭐
25
A collection of MapReduce tasks translated (from Pig, Hive, MapReduce streaming, Cascalog, etc.) into Scalding.
Bigdata Doc
⭐
25
大数据学习笔记,学习路线,技术案例整理。
Hadooplearning
⭐
25
全套大数据基础学习教程,包含最基础的centos、maven。大数据主要包含hdfs、mr、yarn
K Medoid
⭐
25
Maven Archetype Hadoop
⭐
24
Provides a simple archetype to create MapReduce jobs with Maven.
Learn Hadoop And Spark
⭐
22
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Azure Documentdb Hadoop
⭐
22
Hadoop Binary Analysis
⭐
20
Framework that makes processing arbitrary binary data in Hadoop easier
Vessel
⭐
20
Elixir MapReduce interfaces with Hadoop Streaming integration
Docker Hadoop Base
⭐
20
Hadoop(Common/HDFS/YARN/MapReduce) docker image based on alpine
Ceteri Mapred
⭐
19
MapReduce examples
Mining Frequent Pattern From Search History
⭐
19
《大数据挖掘技术》@复旦 课程项目,试图从搜狗实验室用户查询日志数据(2008)中找出搜索记录中有较高支持度关键词的频繁二项集 Hadoop 集群,并且用 Python 实现了 Parallel FP-Growth 算法中的三个 MapReduce 过程。
Phadoop
⭐
19
Map/reduce jobs for Hadoop in PHP
Hadoop Python Tutorial
⭐
18
Exercises and examples developed for the Hadoop with Python tutorial
Data Pipeline Project
⭐
18
Data pipeline project
Hadoop And Swift Integration
⭐
18
API to run Hadoop MapReduce programs over Swift
Bigdata Practice
⭐
18
🤘 常用大数据工具学习实战,包含Hadoop、HBase、Kafka、ClickHouse、Hive、R
Mipr
⭐
17
MapReduce Image Processing framework for Hadoop
Qs Hadoop
⭐
17
大数据生态圈学习
Related Searches
Java Hadoop (2,117)
Spark Hadoop (1,188)
Hadoop Hdfs (1,082)
Shell Hadoop (766)
Python Hadoop (761)
Java Mapreduce (759)
Hadoop Hive (703)
Apache Hadoop (514)
Scala Hadoop (479)
Hadoop Hbase (470)
1-100 of 214 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2025 Awesome Open Source. All rights reserved.