Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Data Science Ipython Notebooks | 25,668 | 6 months ago | 34 | other | Python | |||||
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. | ||||||||||
Dev Setup | 5,802 | 2 years ago | 34 | other | Python | |||||
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults. | ||||||||||
Mrjob | 2,584 | 112 | 2 | a year ago | 62 | December 15, 2021 | 211 | other | Python | |
Run MapReduce jobs on Hadoop or Amazon Web Services | ||||||||||
Corral | 652 | 3 years ago | 1 | May 28, 2021 | 4 | mit | Go | |||
🐎 A serverless MapReduce framework written for AWS Lambda | ||||||||||
Lambda Refarch Mapreduce | 355 | 5 years ago | 7 | other | JavaScript | |||||
This repo presents a reference architecture for running serverless MapReduce jobs. This has been implemented using AWS Lambda and Amazon S3. | ||||||||||
Learning Hadoop And Spark | 160 | 5 months ago | apache-2.0 | HTML | ||||||
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning | ||||||||||
Rail | 70 | 3 years ago | 26 | other | Python | |||||
Scalable RNA-seq analysis | ||||||||||
Elasticrawl | 50 | 1 | 7 years ago | 10 | February 15, 2017 | 1 | mit | Ruby | ||
Launch AWS Elastic MapReduce jobs that process Common Crawl data. | ||||||||||
Csds Material | 38 | 6 years ago | 1 | Java | ||||||
Course material for the Computer Systems for Data Science class at Columbia | ||||||||||
Terraform Aws Emr Cluster | 35 | 4 years ago | 3 | apache-2.0 | HCL | |||||
A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster. |