Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for mapreduce
mapreduce
x
671 search results found
Guitar
⭐
85
A Simple and Efficient Distributed Multidimensional BI Analysis Engine.
Mapreduce
⭐
83
MapReduce by examples
Programmingwithscalding
⭐
81
Programming MapReduce with Scalding
Solutions Google Compute Engine Cluster For Hadoop
⭐
81
This sample app will get up and running quickly with a Hadoop cluster on Google Compute Engine. For more information on running Hadoop on GCE, read the papers at https://cloud.google.com/resources/.
Riak_function_contrib
⭐
80
Riak Function Contrib
Chukwa
⭐
78
Mirror of Apache Chukwa
Studyalgorithm
⭐
76
用Go 语言实现基础算法
Bespin
⭐
74
Reference implementations of data-intensive algorithms in MapReduce and Spark
Pithy
⭐
73
Fast compression / decompression library.
Guagua
⭐
72
An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.
Hbase Orm
⭐
72
A production-grade HBase ORM library that makes accessing HBase clean, fast and fun (Can also be used as Bigtable ORM)
Hadoop Map Reduce Patterns
⭐
71
Hadoop Map-Reduce Design Patterns
Rail
⭐
70
Scalable RNA-seq analysis
Scala Hadoop
⭐
70
Using Hadoop with Scala
Tangseng
⭐
69
Tangseng search engine including full text search and vector search base on golang. 基于go语言的搜索引擎,信息检索系统
Channels
⭐
68
go channel patterns
Mongoid Mapreduce
⭐
67
Mongoid MapReduce provides simple aggregation functions to your models using MongoDB map/reduce
Hadoop Bam
⭐
66
Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework
Hadoop Java Example
⭐
66
A very simple example of using Hadoop's MapReduce functionality in Java.
K Means
⭐
65
K-Means Clustering using MapReduce
Elixir Iteraptor
⭐
64
Handy enumerable operations implementation.
Hive Io Experimental
⭐
62
Hive I/O Library
Src
⭐
62
A light-weight distributed stream computing framework for Golang
Phoenix
⭐
61
an API and runtime environment for data processing with MapReduce for shared-memory multi-core & multiprocessor systems.
Chess
⭐
60
A MapReduce job to explore blunders in chess games.
Pypar
⭐
58
Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
Notes
⭐
58
:octocat:This is a learning note | Java基础,JVM,源码,大数据,面经
Mit6.824 2017 Chinese
⭐
57
A Chinese version of MIT 6.824 (Distributed System)
Pybigdata
⭐
56
使用 python 操作大数据的各种组件
Social Graph Analysis
⭐
56
Social Graph Analysis using Elastic MapReduce and PyPy
Big_data
⭐
55
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Clickhouse Hdfs Loader
⭐
54
loading hdfs data to clickhouse
Snabler
⭐
54
Parallel Algorithms in Python for Hadoop/Mapreduce
Odps_book
⭐
54
source code for my book on odps
Mapreduce Demo
⭐
53
Hadoop,MapReduce编程学习练手实例
Mlhadoop
⭐
53
This repository contains Machine-Learning MapReduce codes for Hadoop which are written from scratch (without using any package or library). E.g. Prediction (Linear and Logistic Regression), Clustering (K-Means), Classification (KNN) etc.
Prosto
⭐
53
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
Connected Component
⭐
52
Map Reduce Implementation of Connected Component on Apache Spark
Elasticrawl
⭐
50
Launch AWS Elastic MapReduce jobs that process Common Crawl data.
Hadoop_vision
⭐
49
Example code for "Web-Scale Computer Vision using MapReduce for Multimedia Data Mining"
Iflux2
⭐
48
Reactive state container (based on immutable) for React or ReactNative, inspired by mapreduce.
Json Mapreduce
⭐
48
InputFormat that can split multi-line JSON
Hadoop Papyrus
⭐
48
Hadoop MapReduce DSL framework by Ruby. Changed from hadoop-rubydsl.
Mapred
⭐
47
NodeJS native MapReduce implementation
Hadoop Sstable
⭐
47
Splittable Input Format for Reading Cassandra SSTables Directly
Sharpdups
⭐
46
find duplicate file with C# parallel MapReduce compute using quick hashing, quick search
Hbasedoc_cn
⭐
46
HBase 0.95版中文文档翻译
Hadoop
⭐
46
A Hanborq optimized Hadoop Distribution, especially with high performance of MapReduce. It's the core part of HDH (Hanborq Distribution with Hadoop for Big Data Engineering).
Simr
⭐
45
Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure
Ipdc
⭐
44
IPDC(InterPlanetary Distributed Computing) is the Distributed Computation service, A peer-to-peer hypermedia protocol to make the computation faster, open, and more scalable.
Mit 6.824 Labs
⭐
44
MIT 6.824 (Distributed Systems) labs in Go
Locis
⭐
44
Implementation of "A Parallel Spatial Co-location Mining Algorithm Based on MapReduce" paper
Examples
⭐
44
A SimRank algorithm implementation using Spark
Machine_learning_in_action_py3
⭐
44
Important book about the machine learning algorithms, and introduces the application of those who use these algorithms and tools, and how to use them in a real environment. This book and other books, behind the other books are long on machine learning theory knowledge, the book happened to be more discussion on how to use coded machine learning algorithms.
Mit 6.824 Distributed Systems
⭐
43
Template repository to work on the labs from MIT 6.824 Distributed Systems course.
Impatient Cascalog
⭐
43
Cascalog for the Impatient
Meteor Mongo Extensions
⭐
42
Very simple implementation of some of mongodb aggregation framework functions for Meteor
Code Of Spark Big Data Business Trilogy
⭐
42
This is code of book "Spark Big Data Business Trilogy"
P3
⭐
42
An open source pcap packet and NetFlow file analysis tool using Hadoop MapReduce and Hive.
Gomrjob
⭐
41
gomrjob - a Go Framework for Hadoop Map Reduce Jobs
Barclamp Pig
⭐
41
[UNMAINTAINED] Hadoop Pig: Mapreduce Programming component
Telemetry Server
⭐
41
Server for the Mozilla Telemetry project
Starfish
⭐
41
Starfish is a utility to make distributed programming ridiculously easy
Pallet Hadoop
⭐
41
Hadoop Cluster Management with Intelligent Defaults
Pig
⭐
40
Package for Apache Pig support in Sublime Text 2 and 3
Metis
⭐
40
MapReduce for multi-core
Hanhan Spark Python
⭐
40
Used Spark core python, Spark sql, Spark MLlib, Spark Streaming
Devops
⭐
40
DevOps
Pattern Matching
⭐
39
Hadoop MapReduce over Hive based implementation of attributed network pattern matching.
Sizzle
⭐
39
A compiler and runtime for Google's Sawzall language, optimized for Hadoop
Mpms
⭐
39
Simple python Multiprocesses-Multithreads queue 简易Python多进程-多线程任务队列, 也能做简单的MapReduce, 自用性质,请勿用于生产环境
Gpumapreduce
⭐
38
panda, a heterogeneous mapreduce framework on gpu and cpu cluster
Csds Material
⭐
38
Course material for the Computer Systems for Data Science class at Columbia
Unoexample
⭐
38
MapReduce/Hadoop example that uses regular playing cards to show mapping and reducing.
163 Bigdate Note
⭐
38
bigdata note
Tutorial
⭐
37
Original tutorials, share innocently, download free. 原创教程,分享无罪,下载免费。
Hack_parallel
⭐
37
The core parallel and shared memory library used by Hack, Flow, and Pyre
Inferno Lab
⭐
36
Inferno Lab - experiments in Inferno OS and Limbo -
Foldscuda.jl
⭐
36
Data-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
Pyxtension
⭐
36
Pure Python extensions library that includes Scala-like streams, Json with attribute access syntax, and other common use stuff
Hadoop Guide
⭐
36
🐘 关于 HDFS,Yarn,MapReduce,HBase,Hive,Pig,Sqoop,Flume,Zoo 等大数据框架的学习笔记
Replephant
⭐
35
A Clojure library to interactively analyze Hadoop cluster usage via REPL
Kassandramrhelper
⭐
35
Library for processing Cassandra SSTables with Hadoop MapReduce.
Nfldata
⭐
35
Combining datasets with MapReduce on NFL play by play data.
Intellij Hadoop
⭐
35
Run Hadoop program using Intellij
Cc Warc Examples
⭐
35
CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop
Terraform Aws Emr Cluster
⭐
35
A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.
Hadoop_exporter
⭐
35
A hadoop exporter for prometheus, scrape hadoop metrics (including HDFS, YARN, MAPREDUCE, HBASE. etc.) from hadoop components jmx url.
Accumulo Examples
⭐
34
Apache Accumulo Examples
Remap
⭐
34
MapReduce platform in python
Data Infra Projects
⭐
34
List of some interesting projects
Riak_mapreduce_utils
⭐
34
Library containing map/reduce utility functions for Riak implemented in erlang.
Haskell_hadoop
⭐
34
Haskell module for streaming hadoop MapReduce jobs
Avro2parquet
⭐
33
Hadoop MapReduce tool to convert Avro data files to Parquet format.
Cc Helloworld
⭐
33
CommonCrawl Hello World example
Pyspark Algorithms
⭐
33
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Lrmr
⭐
33
Less-Resilient MapReduce framework for Go
Mossrose
⭐
32
Light Distributed Job Framework
Skynet
⭐
32
Marklogic Contentpump
⭐
32
MarkLogic Contentpump (mlcp)
Related Searches
Hadoop Mapreduce (851)
Java Mapreduce (759)
Python Mapreduce (383)
101-200 of 671 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.