Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java mapreduce
java
x
mapreduce
x
447 search results found
Redisson
⭐
21,222
Redisson - Easy Redis Java client with features of In-Memory Data Grid. Over 50 Redis based Java objects and services: Set, Multimap, SortedSet, Map, List, Queue, Deque, Semaphore, Lock, AtomicLong, Map Reduce, Publish / Subscribe, Bloom filter, Spring Cache, Tomcat, Scheduler, JCache API, Hibernate, MyBatis, RPC, local cache ...
Bigdata Notes
⭐
13,291
大数据入门指南 ⭐️
Powerjob
⭐
5,288
Enterprise job scheduling middleware with distributed computing ability.
Hive
⭐
4,833
Apache Hive
Mongo Hadoop
⭐
1,511
MongoDB Connector for Hadoop
Data Algorithms Book
⭐
973
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Cdap
⭐
707
An open source framework for building data analytic applications.
Elephantdb
⭐
540
Distributed database specialized in exporting key/value data from Hadoop
Bigdata
⭐
358
💎🔥大数据学习笔记
Cascading
⭐
330
Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
Behemoth
⭐
284
Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Hadoop Connectors
⭐
267
Libraries and tools for interoperability between Hadoop-related open-source software and Google Cloud Platform.
Incubator Uniffle
⭐
254
Uniffle is a high performance, general purpose Remote Shuffle Service.
Firestorm
⭐
240
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
Commoncrawl Crawler
⭐
208
The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)
Hadoop Pcap
⭐
202
Hadoop library to read packet capture (PCAP) files
Wonderdog
⭐
193
Bulk loading for elastic search
Sharding Method
⭐
169
分表分库的新思路——服务层Sharding框架,全SQL、全数据库兼容,ACID特性与原生数据库一致
Terrapin
⭐
168
Serving system for batch generated data sets
Bigdata In Practice
⭐
154
大数据实践项目 Hadoop、Spark、Kafka、Hbase、Flink.....
Mr.lda
⭐
153
Scalable Topic Modeling using Variational Inference in MapReduce
Spatialhadoop2
⭐
148
The second generation of SpatialHadoop that ships as an extension
Hipi
⭐
128
HIPI: Hadoop Image Processing Interface
Hadoopdemo
⭐
128
Hadoop简单应用案例,包括MapReduce、单词统计、HDFS基本操作、web日志分析、Zoo
Sequenceiq Samples
⭐
119
SequenceIQ Hadoop examples
Aliyun Emapreduce Demo
⭐
116
Hackathon
⭐
114
Library and resources for hack/reduce Hackathon events
Asakusafw
⭐
113
Asakusa Framework
Avro Hadoop Starter
⭐
111
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Gora
⭐
110
The Apache Gora open source framework provides an in-memory data model and persistence for big data.
Dynamometer
⭐
110
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Introtohadoopandmr__udacity_course
⭐
103
🐘 Source code for assignments of Udacity course "Introduction to Hadoop and MapReduce"
Babar
⭐
101
Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
Crunch
⭐
100
Mirror of Apache Crunch (Incubating)
Guitar
⭐
85
A Simple and Efficient Distributed Multidimensional BI Analysis Engine.
Mapreduce
⭐
83
MapReduce by examples
Chukwa
⭐
78
Mirror of Apache Chukwa
Bespin
⭐
74
Reference implementations of data-intensive algorithms in MapReduce and Spark
Guagua
⭐
72
An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.
Hbase Orm
⭐
72
A production-grade HBase ORM library that makes accessing HBase clean, fast and fun (Can also be used as Bigtable ORM)
Hadoop Map Reduce Patterns
⭐
71
Hadoop Map-Reduce Design Patterns
Hadoop Bam
⭐
66
Hadoop-BAM is a Java library for the manipulation of files in common bioinformatics formats using the Hadoop MapReduce framework
Hadoop Java Example
⭐
66
A very simple example of using Hadoop's MapReduce functionality in Java.
K Means
⭐
65
K-Means Clustering using MapReduce
Hive Io Experimental
⭐
62
Hive I/O Library
Notes
⭐
58
This is a learning note | Java基础,JVM,源码,大数据,面经
Odps_book
⭐
54
source code for my book on odps
Clickhouse Hdfs Loader
⭐
54
loading hdfs data to clickhouse
Mlhadoop
⭐
53
This repository contains Machine-Learning MapReduce codes for Hadoop which are written from scratch (without using any package or library). E.g. Prediction (Linear and Logistic Regression), Clustering (K-Means), Classification (KNN) etc.
Mapreduce Demo
⭐
53
Hadoop,MapReduce编程学习练手实例
Json Mapreduce
⭐
48
InputFormat that can split multi-line JSON
Hadoop Sstable
⭐
47
Splittable Input Format for Reading Cassandra SSTables Directly
Hadoop
⭐
46
A Hanborq optimized Hadoop Distribution, especially with high performance of MapReduce. It's the core part of HDH (Hanborq Distribution with Hadoop for Big Data Engineering).
Simr
⭐
45
Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure
P3
⭐
42
An open source pcap packet and NetFlow file analysis tool using Hadoop MapReduce and Hive.
Sizzle
⭐
39
A compiler and runtime for Google's Sawzall language, optimized for Hadoop
Pattern Matching
⭐
39
Hadoop MapReduce over Hive based implementation of attributed network pattern matching.
Csds Material
⭐
38
Course material for the Computer Systems for Data Science class at Columbia
Unoexample
⭐
38
MapReduce/Hadoop example that uses regular playing cards to show mapping and reducing.
163 Bigdate Note
⭐
38
bigdata note
Kassandramrhelper
⭐
35
Library for processing Cassandra SSTables with Hadoop MapReduce.
Cc Warc Examples
⭐
35
CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop
Intellij Hadoop
⭐
35
Run Hadoop program using Intellij
Nfldata
⭐
35
Combining datasets with MapReduce on NFL play by play data.
Marklogic Contentpump
⭐
34
MarkLogic Contentpump (mlcp)
Avro2parquet
⭐
33
Hadoop MapReduce tool to convert Avro data files to Parquet format.
Cc Helloworld
⭐
33
CommonCrawl Hello World example
Mossrose
⭐
32
Light Distributed Job Framework
Accumulo Examples
⭐
32
Apache Accumulo Examples
Bigfatlm
⭐
30
Hadoop MapReduce training of modified Kneser-Ney smoothed language models
Emr S3 Io
⭐
29
Hadoop IO for Amazon S3
Mongoreduce
⭐
29
Hadoop Input and Ouput formats for MongoDB
Mongo Deep Mapreduce
⭐
28
Use Hadoop MapReduce directly on Mongo data
Cloudsimex
⭐
28
A set of extensions for the CloudSim simulator
Nyyellowtaxiproject
⭐
27
Big Data project using Hadoop (MapReduce, spark, Hive)
Kiji Bento
⭐
26
Kiji BentoBox: Developer SDK for Kiji including a standalone zero-configuration HBase micro-cluster
Ankus
⭐
26
Interview Refresher Java Bigdata
⭐
26
a one-stop repo to lookup for code snippets of core java concepts, sql, data structures as well as big data. It also consists of interview questions asked in real-life.
Sculptor
⭐
26
Hive Mr3
⭐
25
Hive for MR3
Flink Mongodb Test
⭐
25
Flink 0.7 MongoDB example (for Hadoop2)
Maven Archetype Hadoop
⭐
24
Provides a simple archetype to create MapReduce jobs with Maven.
Azure Documentdb Hadoop
⭐
22
Ffmpeg Mr
⭐
22
MapReduce FFmpeg
Book Examples
⭐
21
Examples from Learning Hadoop 2 (Packt Publishing, 2015)
Hiveunit Mr2
⭐
21
A library to test Hive scripts with YARN and MR2
Cascading.multitool
⭐
21
Cascading.Multitool is a sed and grep command line tool for Apache Hadoop.
Hadoopcv
⭐
21
HadoopCV Hadoop,Spark Reader Video!
Zephyr
⭐
21
Zephyr is a big data, platform agnostic ETL API, with Hadoop MapReduce, Storm, and other big data bindings.
Hadoop Binary Analysis
⭐
20
Framework that makes processing arbitrary binary data in Hadoop easier
Spring Data Mongodb
⭐
20
spring-data-mongodb 操作示例
Cdh Maven Archetype
⭐
20
Cloudera Maven Archetypes
Weatherpipe
⭐
19
A MapReduce pipeline for the analysis of the NEXRAD data set in S3 - Purdue CS307 Project
Bigdata Project
⭐
19
Analyzing Uber Movement Dataset
Ooso
⭐
19
Java library for running Serverless MapReduce jobs
Data Pipeline Project
⭐
18
Data pipeline project
Bigdata Practice
⭐
18
🤘 常用大数据工具学习实战,包含Hadoop、HBase、Kafka、ClickHouse、Hive、R
Mapreduce Knn
⭐
17
(java) K nearest neighbour implementation for Hadoop MapReduce
Mapreduce Kmeans
⭐
17
Naive K-Means clustering with MapReduce
Qs Hadoop
⭐
17
大数据生态圈学习
Related Searches
Java Spring (21,495)
Java Spring Boot (11,982)
Java Gradle (8,072)
Java Game (7,956)
Java Docker (6,180)
Java Sdk (6,021)
Javascript Java (6,016)
Java Mysql (4,593)
Java Algorithms (4,524)
Java Apache (4,331)
1-100 of 447 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2023 Awesome Open Source. All rights reserved.