Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for spark mapreduce
mapreduce
x
spark
x
77 search results found
Data Science Ipython Notebooks
⭐
25,668
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Bigdata Notes
⭐
14,872
大数据入门指南 ⭐
Cookbook
⭐
12,557
The Data Engineering Cookbook
Dev Setup
⭐
5,802
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools like Vim, Sublime Text, Bash, iTerm, Python data analysis, Spark, Hadoop MapReduce, AWS, Heroku, JavaScript web development, Android development, common data stores, and dev-based OS X defaults.
Dpark
⭐
2,637
Python clone of Spark, a MapReduce alike framework in Python
Bigdata Interview
⭐
1,397
🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop
Bigdata Growth
⭐
1,256
大数据知识仓库涉及到数据仓库建模、实时计算、大数据、数据中台、系统设计、Java、算法等。
Data Algorithms Book
⭐
973
MapReduce, Spark, Java, and Scala for Data Algorithms Book
Mobius
⭐
942
C# and F# language binding and extensions to Apache Spark
Cdap
⭐
735
An open source framework for building data analytic applications.
Compass
⭐
284
Compass is a task diagnosis platform for bigdata
Firestorm
⭐
240
Firestorm is a Remote Shuffle Service, and provides the capability for Apache Spark and Apache Hadoop MapReduce applications to store shuffle data on remote servers
Hadoop Docker
⭐
210
基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Juicy Bigdata
⭐
162
🎉🎉🐳 Datawhale大数据处理导论教程 | 大数据技术方向的开篇课程🎉🎉
Learning Hadoop And Spark
⭐
160
Companion to Learning Hadoop and Learning Spark courses on Linked In Learning
Bigdata In Practice
⭐
154
大数据实践项目 Hadoop、Spark、Kafka、Hbase、Flink.....
Data Algorithms With Spark
⭐
151
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Huaweicloud Mrs Example
⭐
150
Examples for HUAWEI CLOUD MRS.
Big Data Mapreduce Course
⭐
135
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
Python Bigdata
⭐
128
Data science and Big Data with Python
Aliyun Emapreduce Demo
⭐
123
Asakusafw
⭐
113
Asakusa Framework
Distributed Statistical Computing
⭐
99
Teaching Materials for Distributed Statistical Computing (大数据分布式计算教学材料)
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Focusbigdata
⭐
89
【大数据成神之路学习路径+面经+简历】
Bespin
⭐
74
Reference implementations of data-intensive algorithms in MapReduce and Spark
Pybigdata
⭐
56
使用 python 操作大数据的各种组件
Big_data
⭐
55
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Connected Component
⭐
52
Map Reduce Implementation of Connected Component on Apache Spark
Simr
⭐
45
Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure
Examples
⭐
44
A SimRank algorithm implementation using Spark
Code Of Spark Big Data Business Trilogy
⭐
42
This is code of book "Spark Big Data Business Trilogy"
Devops
⭐
40
DevOps
Hanhan Spark Python
⭐
40
Used Spark core python, Spark sql, Spark MLlib, Spark Streaming
Hadoop Guide
⭐
36
🐘 关于 HDFS,Yarn,MapReduce,HBase,Hive,Pig,Sqoop,Flume,Zoo 等大数据框架的学习笔记
Data Infra Projects
⭐
34
List of some interesting projects
Pyspark Algorithms
⭐
33
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Nyyellowtaxiproject
⭐
27
Big Data project using Hadoop (MapReduce, spark, Hive)
K Medoid
⭐
25
Bigdata Doc
⭐
25
大数据学习笔记,学习路线,技术案例整理。
Mlbd
⭐
20
Materials for "Machine Learning on Big Data" course
Data Pipeline Project
⭐
18
Data pipeline project
Spark And Mllib Projects
⭐
18
This repository contains Spark, MLlib, PySpark and Dataframes projects
Lectures Hse Spark
⭐
17
Масштабируемое машинное обучение и анализ больших данных с Apache Spark
Qs Hadoop
⭐
17
大数据生态圈学习
Spark
⭐
16
There are Python 2.7 codes and learning notes for Spark 2.1.1
Yandex Big Data Engineering
⭐
15
Pyspark
⭐
15
spark (scala and python)
Big Data Course
⭐
14
Practice course on Big Data
Rasppi Cluster
⭐
14
An efficient quick-start tool to build a Raspberry Pi (or Debian-based) Cluster with popular ecosystem like Hadoop, Spark
Bigdata Projects
⭐
14
Student projects in Big Data field.
Copybookinputformat
⭐
14
Using JRecord to build a mapred and mapreduce inputformat for HDFS, MAPREDUCE, PIG, HIVE, Spark, ...
Bigdata Learning
⭐
14
大数据学习,主要涉及Kafka、ZooKeeper、Hive、HBase、Spark
Obd
⭐
13
Tools of Big Data (Outils de Big Data)
Blog
⭐
13
Bigdataguide
⭐
11
秋招自学上岸,自学太难了,想总结一份详细的大数据开发资料,包括基础 | 架构 | 源码,让更多自学的伙伴少走弯路。 有相关问题可以添加公众号:大数据老刘,联系老刘!
Mongodb Hadoop Workshop
⭐
11
MongoDB-Hadoop Workshop Exercises
Big_data_course_rimini_2021
⭐
11
Questa repository contiene tutto il materiale didattico utilizzato durante il corso di "Laboratorio Big Data" in collaborazione con il comune di Rimini.
Dijkstra Hadoop Spark
⭐
10
Dijkstra Algorithm - Python Hadoop Streaming and Pyspark
Ddu
⭐
10
good good study,day day up (记录学习的点点滴滴)
Aadhaar Dataset Analysis
⭐
10
An analysis on Aadhaar dataset using Mapreduce and Spark
Emr Demo
⭐
10
Project files for the post: Running PySpark Applications on Amazon EMR: Methods for Interacting with PySpark on Amazon Elastic MapReduce.
Cloud Computing
⭐
9
Web repository for the "Cloud Programming Models" module
Spark Dbscan
⭐
9
DBSCAN clustering algorithm implemented in Apache Spark (MapReduce Framework).
Coursera_bigdata_ucsd
⭐
8
UCSD Big Data Specialization General Materials and my Capstone Project.
Hands On Hadoop
⭐
8
Hadoop, MapReduce, HDFS, Spark, Pig, Hive, HBase, MongoDB, Cassandra, Flume - the list goes on! Over 25 technologies.
Bigdata
⭐
8
빅데이터 pipeline 구성 요소 기술들에 관한 coding 실습 및 연구
Hadoop Hands On
⭐
8
Learning how to tame the Big Data with Hadoop and related technologies
Sparklearning_notebook
⭐
8
Spark 学习notebook
Bigdata Essentials
⭐
7
All big data related tools/frameworks in once central repo.
Inverted Index
⭐
7
An implementation of inverted index in Mapreduce and Spark
Spark_ver_bigdatahw_jiuzhang
⭐
7
Homework for the Big Data course at Jiuzhang, re-written in Python and Spark!
Spark For Noobs By A Noob
⭐
7
Jupyter notebooks for learning PySpark
Big Data Knowledge
⭐
6
📖大数据相关知识集锦
Scache
⭐
6
A distributed memory cache system for shuffle in map-reduce
Star Join Spark
⭐
6
Two efficient algorithms to process Star Joins using the Spark framework: Spark Bloom-Filtered Cascade Join (SBFCJ) and the Spark Broadcast Join (SBJ)
Map_reduce Ntua
⭐
6
Lab exercise of Advanced Topics in Database Systems course in NTUA regarding Map Reduce
Flint
⭐
5
Main repository of the Flint project for Spark and Amazon EMR.
Youtubedataanalysis
⭐
5
Large Scale data analysis on Youtube Dataset using Spark and Hadoop
Veloxmr
⭐
5
Data processing component of the Velox Big Data Framework (VBDF)
Related Searches
Scala Spark (3,279)
Python Spark (2,053)
Java Spark (1,587)
Apache Spark (1,207)
Spark Hadoop (1,188)
Jupyter Notebook Spark (1,151)
Spark Kafka (985)
Hadoop Mapreduce (847)
Spark Streaming (817)
Spark Pyspark (812)
1-77 of 77 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.