Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for mapreduce emr
emr
x
mapreduce
x
28 search results found
Mrjob
⭐
2,584
Run MapReduce jobs on Hadoop or Amazon Web Services
Mongo Hadoop
⭐
1,511
MongoDB Connector for Hadoop
Learning Hadoop And Spark
⭐
160
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
Lemur
⭐
85
Lemur is a tool to launch hadoop jobs locally or on EMR, based on a configuration file, referred to as a jobdef. The jobdef file describes your EMR cluster, local environment, pre- and post-actions and zero or more "steps".
Rail
⭐
70
Scalable RNA-seq analysis
Elasticrawl
⭐
50
Launch AWS Elastic MapReduce jobs that process Common Crawl data.
Csds Material
⭐
38
Course material for the Computer Systems for Data Science class at Columbia
Terraform Aws Emr Cluster
⭐
35
A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.
Kassandramrhelper
⭐
35
Library for processing Cassandra SSTables with Hadoop MapReduce.
Webarchive Indexing
⭐
30
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
Emrio
⭐
30
Elastic MapReduce instance optimizer
Common_crawl_types
⭐
28
A simple Ruby example of how to process Common Crawl files using Elastic MapReduce
Bigdata Project
⭐
19
Analyzing Uber Movement Dataset
Weatherpipe
⭐
19
A MapReduce pipeline for the analysis of the NEXRAD data set in S3 - Purdue CS307 Project
Spark And Mllib Projects
⭐
18
This repository contains Spark, MLlib, PySpark and Dataframes projects
Cs205_ga
⭐
16
How deep does Google Analytics go? Efficiently tackling Common Crawl using AWS & MapReduce
Googleplay Web Crawler
⭐
15
Mapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive
Emrmonitoring
⭐
12
Command line tool for monitoring Amazon Elastic MapReduce (Amazon EMR) jobflows and analyze past jobflows.
Big Data Architecture
⭐
10
国外互联网公司大数据技术架构研究
Hive Hbase Rdf
⭐
10
An implementation of Hive+HBase for RDF
Emr Demo
⭐
10
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
Common_crawl
⭐
8
Simple Python MapReduce jobs for processing the Common Crawl plus command-line utilities
Flint
⭐
5
Main repository of the Flint project for Spark and Amazon EMR.
Rail Dbgap
⭐
5
Protocol for analyzing dbGaP-protected data from SRA with Amazon Elastic MapReduce
Geotweet
⭐
5
Store Twitter Streaming API output into Amazon S3 Buckets and process with EMR
Mrgob
⭐
5
Golang wrapper for Hadoop streaming
Hadoop Mrutils
⭐
5
Utility/starter/example scripts to get started with Hadoop MapReduce
Lein Emr
⭐
5
Lein plugin that servers as a wrapper over Amazon's ruby elastic-mapreduce client. Used to create job flows on Amazon Elastic MapReduce.
Related Searches
Hadoop Mapreduce (851)
Java Mapreduce (759)
Python Mapreduce (383)
1-28 of 28 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.