Cc Mrjob

Demonstration of using Python to process the Common Crawl dataset with the mrjob framework
Alternatives To Cc Mrjob
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Data Science Ipython Notebooks25,668
9 months ago34otherPython
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Bigdata Notes14,872
6 months ago39Java
大数据入门指南 :star:
6 months ago111apache-2.0
The Data Engineering Cookbook
6 months ago89apache-2.0Java
Apache Hive
Scalding3,4333740a year ago43September 14, 2016319apache-2.0Scala
A Scala API for Cascading
Mrjob2,58411222 years ago62December 15, 2021211otherPython
Run MapReduce jobs on Hadoop or Amazon Web Services
7 years ago9bsd-3-clauseGo
A search engine which can hold 100 trillion lines of log data.
Mongo Hadoop1,51178102 years ago14January 27, 201716Java
MongoDB Connector for Hadoop
Bigdata Interview1,397
3 years agon,ull
:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Bigdata Growth1,256
3 months ago1mitShell
Alternatives To Cc Mrjob
Select To Compare

Alternative Project Comparisons
Popular Hadoop Projects
Popular Mapreduce Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.