Project Name | Stars | Downloads | Repos Using This | Packages Using This | Most Recent Commit | Total Releases | Latest Release | Open Issues | License | Language |
---|---|---|---|---|---|---|---|---|---|---|
Data Science Ipython Notebooks | 24,829 | 17 hours ago | 34 | other | Python | |||||
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines. | ||||||||||
Bigdata Notes | 13,291 | 2 months ago | 33 | Java | ||||||
大数据入门指南 :star: | ||||||||||
Cookbook | 11,362 | 3 months ago | 108 | apache-2.0 | ||||||
The Data Engineering Cookbook | ||||||||||
Hive | 4,733 | 15 hours ago | 92 | apache-2.0 | Java | |||||
Apache Hive | ||||||||||
Scalding | 3,358 | 37 | 40 | a year ago | 43 | September 14, 2016 | 318 | apache-2.0 | Scala | |
A Scala API for Cascading | ||||||||||
Mrjob | 2,584 | 112 | 1 | 6 months ago | 62 | September 17, 2020 | 211 | other | Python | |
Run MapReduce jobs on Hadoop or Amazon Web Services | ||||||||||
Poseidon | 1,543 | 6 years ago | 9 | bsd-3-clause | Go | |||||
A search engine which can hold 100 trillion lines of log data. | ||||||||||
Mongo Hadoop | 1,511 | 78 | 9 | a year ago | 14 | January 27, 2017 | 16 | Java | ||
MongoDB Connector for Hadoop | ||||||||||
Bigdata Interview | 1,237 | 2 years ago | n,ull | |||||||
:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结 | ||||||||||
Data Algorithms Book | 973 | a year ago | 14 | other | Java | |||||
MapReduce, Spark, Java, and Scala for Data Algorithms Book |
Brandyn White [email protected] Andrew Miller [email protected]
Source https://github.com/bwhite/hadoopy/ Issues https://github.com/bwhite/hadoopy/issues Docs http://bwhite.github.com/hadoopy/
IRC: #hadoopy @ freenode.net
Requirements python development headers (python-dev), build tools (build-essential)
Optional cython (>=.13) (without this it falls back to the pregenerated .c files)
Features
Limitations
Used in
Ubuntu Install (others are similar) sudo apt-get install python-dev build-essential sudo python setup.py install