Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for jupyter notebook spark
jupyter-notebook
x
spark
x
777 search results found
Data Engineering Zoomcamp
⭐
14,007
Free Data Engineering course!
Ds Cheatsheets
⭐
11,535
List of Data Science Cheatsheets to rule the world
H2o 3
⭐
6,485
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Bigdl
⭐
4,383
Accelerating LLM with low-bit (INT3 / INT4 / NF4 / INT5 / INT8) optimizations using bigdl-llm
Helk
⭐
3,511
The Hunting ELK
Analytics Zoo
⭐
2,580
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
Almond
⭐
1,544
A Scala kernel for Jupyter
Spark Py Notebooks
⭐
1,515
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Cube Studio
⭐
1,466
cube studio开源云原生一站式机器学习/深度学习AI平台,支持sso登录,多租户/多项目组,数据资产对
Caffeonspark
⭐
1,272
Distributed deep learning on Hadoop and Spark clusters.
Sparkmagic
⭐
1,254
Jupyter magics and kernels for working with remote Spark clusters
Pixiedust
⭐
1,030
Python Helper library for Jupyter Notebooks
Pyspark Tutorial
⭐
959
PySpark-Tutorial provides basic algorithms using PySpark
Spark Nlp Workshop
⭐
939
Public runnable examples of using John Snow Labs' NLP for Apache Spark.
Spark Scala Tutorial
⭐
922
A free tutorial for Apache Spark.
Spark Movie Lens
⭐
757
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Incubator Toree
⭐
719
Mirror of Apache Toree (Incubating)
Justenoughscalaforspark
⭐
643
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Elasticsearch Spark Recommender
⭐
603
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Enterprise_gateway
⭐
574
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
Streaming Benchmarks
⭐
560
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
Popmon
⭐
461
Monitor the stability of a Pandas or Spark dataframe ⚙︎
Agile_data_code_2
⭐
435
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Learningpyspark
⭐
409
Code base for the Learning PySpark book (in preparation)
Machinelearning
⭐
406
Machine Learning
Zat
⭐
402
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Spark Syntax
⭐
391
This is a repo documenting the best practices in PySpark.
Miscellaneous
⭐
373
Includes notes on Apache Spark, Spark for Physics, Jupyter notebook examples for Spark, Oracle and other DB systems.
Synapse
⭐
348
Samples for Azure Synapse Analytics
Data Engineering Projects
⭐
322
Personal Data Engineering Projects
Spark Standalone Cluster On Docker
⭐
311
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. ⚡️
Learning Pyspark
⭐
294
Code repository for Learning PySpark by Packt
Spark Jupyter Aws
⭐
255
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Kamu Cli
⭐
239
New generation decentralized data lake and a streaming data pipeline
Installations_mac_ubuntu_windows
⭐
233
Installations for Data Science. Anaconda, RStudio, Spark, TensorFlow, AWS (Amazon Web Services).
Data_science_blogs
⭐
232
A repository to keep track of all the code that I end up writing for my blog posts.
Bigdata_docker
⭐
226
Big Data Ecosystem Docker
Ngods Stocks
⭐
217
New Generation Opensource Data Stack Demo
Spark Fm Parallelsgd
⭐
212
Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)
Rasterframes
⭐
208
Geospatial Raster support for Spark DataFrames
Data Science
⭐
206
Projects and awesome list for all Data Science fields
Bigdl Tutorials
⭐
201
Step-by-step Deep Leaning Tutorials on Apache Spark using BigDL
Azure Cosmosdb Spark
⭐
194
Apache Spark Connector for Azure Cosmos DB
Rumble
⭐
194
⛈️ RumbleDB 1.21.0 "Hawthorn blossom" 🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Jupyter Spark
⭐
192
Jupyter Notebook extension for Apache Spark integration
Spark On Kubernetes Helm
⭐
187
Spark on Kubernetes infrastructure Helm charts repo
Book Resources
⭐
181
Cloud Dataproc
⭐
173
Cloud Dataproc: Samples and Utils
Mydatascienceportfolio
⭐
172
Applying Data Science and Machine Learning to Solve Real World Business Problems
Sparkmonitor
⭐
164
Monitor Apache Spark from Jupyter Notebook
Scalable Data Science Platform
⭐
153
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Spark Practice
⭐
153
Apache Spark (PySpark) Practice on Real Data
Geopyspark
⭐
151
GeoTrellis for PySpark
Spark Style Guide
⭐
145
Spark style guide
Sparknotebook
⭐
142
An example of running Apache Spark using Scala in ipython notebook
Csdn Code
⭐
138
停止维护 -->移步 https://github.com/vbay/tutorials
Mastering Apache Spark
⭐
130
This is repository of my YouTube Course on End to End Apache Spark in AIEngineering YouTube Channel
Handyspark
⭐
129
HandySpark - bringing pandas-like capabilities to Spark dataframes
Python Bigdata
⭐
128
Data science and Big Data with Python
Rikai
⭐
127
Parquet-based ML data format optimized for working with unstructured data
Beymani
⭐
126
Hadoop, Spark and Storm based anomaly detection implementations for data quality, cyber security, fraud detection etc.
Data Science Tutorials
⭐
124
Python Tutorials for Data Science
Mastering Big Data Analytics With Pyspark
⭐
118
Mastering Big Data Analytics with PySpark, Published by Packt
Spark Df Profiling
⭐
115
Create HTML profiling reports from Apache Spark DataFrames
Spark R Notebooks
⭐
109
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Spark Tpc Ds Performance Test
⭐
104
Use the TPC-DS benchmark to test Spark SQL performance
Distributed Statistical Computing
⭐
100
Teaching Materials for Distributed Statistical Computing (大数据分布式计算教学材料)
Spark Nlp Models
⭐
100
Models and Pipelines for the Spark NLP library
Spark With Python
⭐
98
Fundamentals of Spark with Python (using PySpark), code examples
Medium Articles
⭐
97
Repo for all my code on the articles I post on medium
Eclairjs Nashorn
⭐
94
JavaScript API for Apache Spark
Big Data Engineering Coursera Yandex
⭐
91
Big Data for Data Engineers Coursera Specialization from Yandex
Kafka Sparkstreaming Cassandra
⭐
86
Docker container for Kafka - Spark Streaming - Cassandra
Pyspark Predictive Maintenance
⭐
85
Predictive Maintenance using Pyspark
Spark.samples
⭐
81
tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark
Data Engineering Nanodegree
⭐
76
Projects done in the Data Engineering Nanodegree by Udacity.com
Spark Ocr Workshop
⭐
74
Public runnable examples of using John Snow Labs' OCR for Apache Spark.
Python Spark Streaming
⭐
73
Jupyterlab Sparkmonitor
⭐
72
JupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Sit742
⭐
72
SIT742: Modern Data Science
Jupyterlab Integration
⭐
72
DEPRECATED: Integrating Jupyter with Databricks via SSH
Learning Deep
⭐
65
All my deep learning notes: contains v1 of machine learning with Jeremy Howard, and v1 of Fastai Deep Learning 2018 part 1
Taxiprediction
⭐
65
Airflow Spark
⭐
64
Docker with Airflow and Spark standalone cluster
Lighter
⭐
64
REST API for Apache Spark on K8S or YARN
Ammonium
⭐
64
Impatient fork of Ammonite
Building Spark Applications Live Lessons
⭐
63
Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable Spark applications for predictive analytics in the context of a data scientist's standard workflow.
W2v
⭐
62
Word2Vec models with Twitter data using Spark. Blog:
W261 Environment
⭐
62
Sparkml
⭐
61
Spark ML with pyspark
Spark
⭐
60
Apache Spark (Scala, PySpark, SparkR) Code, Tricks, and References
Visualize Data With Python
⭐
60
A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Pysparkgeoanalysis
⭐
60
🌐 Interactive Workshop on GeoAnalysis using PySpark
Udacity Data Engineer Nanodegree
⭐
59
Classwork projects and home works done through Udacity data engineering nano degree
Clinical Notes Diagnosis Dl Nlp
⭐
59
Nlp_spark
⭐
58
Natural Language Processing with Spark's MLlib
Mylearningnotes
⭐
58
Because its never late to start taking notes and 'public' it...
Learning
⭐
57
Walkthrough notebooks for Deep Learning, Machine Learning, Reinforcement Learning, Spark, Statistics, Algorithms, Scala, Python
Kafka Streaming Click Analysis
⭐
56
Use Kafka and Apache Spark streaming to perform click stream analytics
Bigdataanalytics_infoh515
⭐
56
Material for the Big Data Analytics exercise classes - INFOH515 - Big Data : Distributed Data Management and Scalable Analytics - Université Libre de Bruxelles
Related Searches
Python Jupyter Notebook (12,976)
Jupyter Notebook Deep Learning (9,967)
Jupyter Notebook Machine Learning (8,463)
Jupyter Notebook Dataset (6,824)
Jupyter Notebook Tensorflow (4,771)
Jupyter Notebook Convolutional Neural Networks (4,037)
Jupyter Notebook Classification (3,939)
Jupyter Notebook Neural (3,926)
Jupyter Notebook Pytorch (3,877)
Jupyter Notebook Data Science (3,734)
1-100 of 777 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2023 Awesome Open Source. All rights reserved.