Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for data science hadoop
data-science
x
hadoop
x
23 search results found
Data Science Ipython Notebooks
⭐
25,668
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Trino
⭐
9,118
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
H2o 3
⭐
6,618
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Dist Keras
⭐
611
Distributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Griffon Vm
⭐
129
Griffon Data Science Virtual Machine
Ni
⭐
81
Say "ni" to data of any size
Big Data
⭐
37
Python tools for big data
Awesome Tools
⭐
32
curated list of awesome tools and libraries for specific domains
Learning Spark
⭐
29
Tidy up Spark and Hadoop tutorials.
Practical Data Science With Hadoop And Spark
⭐
23
Springboard Data Science Immersive
⭐
23
Data Science Ebooks
⭐
19
Data Science E-books, Interview Resources and Cheat-sheets
Etl Starter Kit
⭐
18
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Multidim
⭐
18
Visualising Multi Dimensional Data
Interview Notes
⭐
15
有关Python、大数据、MySQL的总结
Cheatsheets For Ai
⭐
14
Cheatsheets on numerous topics ranging from DataScience | ML | DL | AI | Big Data.
Big_data_course_rimini_2021
⭐
11
Questa repository contiene tutto il materiale didattico utilizzato durante il corso di "Laboratorio Big Data" in collaborazione con il comune di Rimini.
Conch Bigdata
⭐
10
Big Data
Pstl
⭐
10
Parallel Streaming Transformation Loader
Coursera_bigdata_ucsd
⭐
8
UCSD Big Data Specialization General Materials and my Capstone Project.
Sparklyclean
⭐
6
Optimal distributed data deduplication and supervised learning pipeline using Apache Spark
Datascience Playground
⭐
6
A scalable, cloud-ready environment for Data Science using Docker
Learning Scala For Data Science
⭐
6
Data Science: Scala for brave and impatient
Data Engineer Portfolio
⭐
6
This is a repository to demonstrate my details, skills, projects and to keep track of my progression in Data Analytics and Data Science topics.
Open_data_science_for_healthcare
⭐
5
Everanalyzer
⭐
5
EverAnalyzer is my thesis in the Department of Digital Systems of the University of Piraeus. EverAnalyzer is a platform for collecting, preprocessing, processing and analyzing Big Data from the Twitter platform.
Related Searches
Python Data Science (6,905)
Machine Learning Data Science (5,390)
Jupyter Notebook Data Science (4,053)
Java Hadoop (2,117)
Spark Hadoop (1,188)
R Data Science (1,164)
Hadoop Hdfs (1,082)
Deep Learning Data Science (1,039)
Html Data Science (872)
Hadoop Mapreduce (851)
1-23 of 23 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.