Learn Hadoop And Spark

This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Alternatives To Learn Hadoop And Spark
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Spark37,6612,3949393 months ago46May 09, 2021186apache-2.0Scala
Apache Spark - A unified analytics engine for large-scale data processing
Data Science Ipython Notebooks25,668
6 months ago34otherPython
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Bigdata Notes14,872
4 months ago39Java
大数据入门指南 :star:
Cookbook12,557
4 months ago111apache-2.0
The Data Engineering Cookbook
Trino9,118293 months ago83November 30, 20232,496apache-2.0Java
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
God Of Bigdata8,483
9 months ago3
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
H2o 36,61862333 months ago49August 09, 20232,746apache-2.0Jupyter Notebook
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Hive5,222
3 months ago89apache-2.0Java
Apache Hive
Ignite4,6261533 months ago36May 04, 2023729apache-2.0Java
Apache Ignite
Calcite4,2163901283 months ago1,714November 07, 2023315apache-2.0Java
Apache Calcite
Alternatives To Learn Hadoop And Spark
Select To Compare


Alternative Project Comparisons
Popular Hadoop Projects
Popular Big Data Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Python
Hadoop
Big Data
Hive
Hbase
Mapreduce
Apache Spark
Apache Kafka
Apache Storm