Cascading

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster.
Alternatives To Cascading
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Data Science Ipython Notebooks25,668
9 months ago34otherPython
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Bigdata Notes14,872
6 months ago39Java
大数据入门指南 :star:
Cookbook12,557
6 months ago111apache-2.0
The Data Engineering Cookbook
Hive5,222
6 months ago89apache-2.0Java
Apache Hive
Scalding3,4333740a year ago43September 14, 2016319apache-2.0Scala
A Scala API for Cascading
Mrjob2,58411222 years ago62December 15, 2021211otherPython
Run MapReduce jobs on Hadoop or Amazon Web Services
Poseidon1,543
7 years ago9bsd-3-clauseGo
A search engine which can hold 100 trillion lines of log data.
Mongo Hadoop1,51178102 years ago14January 27, 201716Java
MongoDB Connector for Hadoop
Bigdata Interview1,397
3 years agon,ull
:dart: :star2:[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结
Bigdata Growth1,256
3 months ago1mitShell
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Alternatives To Cascading
Select To Compare


Alternative Project Comparisons
Popular Hadoop Projects
Popular Mapreduce Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Java
Dsl
Hadoop
Mapreduce