Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for jupyter notebook spark
jupyter-notebook
x
spark
x
382 search results found
Data Pipeline Project
⭐
18
Data pipeline project
Notebooks
⭐
18
Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.
Ds30_5
⭐
18
Data Science in 30 Minutes #5: Spark
Spark And Mllib Projects
⭐
18
This repository contains Spark, MLlib, PySpark and Dataframes projects
Arc Starter
⭐
18
A starter project to create Arc jobs using the Jupyter Notebook interface
Apache Spark In 7 Days
⭐
17
Apache Spark in 7 Days [Video], by Packt Publishing
Example Health Machine Learning
⭐
17
This code pattern shows you how to train a machine learning model to predict type 2 diabetes using synthesized patient health records.
Spark Structured Streaming
⭐
17
A short course on the new, experimental features by The Data Incubator and O'Reilly Strata.
Lectures Hse Spark
⭐
17
Масштабируемое машинное обучение и анализ больших данных с Apache Spark
Prace Spark For Data Scientists
⭐
17
Course materials for PRACE Introduction to Spark for Data Scientists.
Pyspark_dl_pipeline
⭐
17
Pytd
⭐
17
Treasure Data Driver for Python
Amazon Emr With Delta Lake
⭐
17
Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR
Spark Dev Env Docker
⭐
17
Spark development environment for kubernetes, spark-submit and jupyter notebook
Berkeleyx Apache Spark Labs
⭐
17
Big Data Recommender Systems
⭐
17
Télécom Paris | MS Big Data | SD 701 | Big Data Mining Course Project using Spark and Google Colab for building Scalable Recommender Systems
Pyspark For Data Processing
⭐
16
Code for my presentation: Using PySpark to Process Boat Loads of Data
Datascience Environment
⭐
16
Docker Environment for data science
Advanced Factorization Of Machine Systems
⭐
16
GSOC 2017 - Apache Organization - # Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)
T Watch
⭐
15
Real Time Twitter Sentiment Analysis Product
Yandex Big Data Engineering
⭐
15
Machine Learning With Apache Spark Quick Start Guide
⭐
15
Machine Learning with Apache Spark Quick Start Guide, published by Packy
Hdinsight Pyspark Cntk Integration
⭐
14
Instructions and examples for installing CNTK on an HDInsight cluster and running CNTK-Pyspark applications from Jupyter notebooks.
Recommender Systems For Implicit Feedback Datasets
⭐
14
Matrix Factorization augmented with customer item meta data
Bunsen Tutorial
⭐
14
Tutorial for exploring FHIR data with Apache Spark in an interactive notebook
Knowledge_search
⭐
14
a graph-based knowledge search engine powered by Wikipedia
Ghcn D
⭐
14
Data Pipeline from the Global Historical Climatology Network DataSet
Lambda Architecture Demo
⭐
14
Developing a Lambda Architecture pipeline using Apache Kafka, Spark Structured Streaming, Redshift, S3, Python
Pipeasy Spark
⭐
14
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
Spark Vs Dataflow
⭐
14
Demo code contrasting Google Dataflow (Apache Beam) with Apache Spark
Big Data Course
⭐
14
Practice course on Big Data
Machine Learning Notes
⭐
14
Jupyter notebooks for Machine Learning practice
Tedsds
⭐
14
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Notebooks
⭐
14
Interactive Notebooks that support the book
Pyspark Ml Examples
⭐
13
Spark ML Tutorial and Examples for Beginners
Nyc_taxi_trip_duration
⭐
13
Develop ML models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle) | #DS
Storeitemdemand
⭐
13
(117th place - Top 26%) Deep learning using Keras and Spark for the "Store Item Demand Forecasting" Kaggle competition.
Hdinsight Spark Scala Kafka
⭐
13
A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight
Bigdata_docker
⭐
13
Big Data Docker Data Science Spark Spark3 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook
Blog
⭐
13
Ox Clo
⭐
13
Materials for Oxford Software Engineering Programme CLO course
Obd
⭐
13
Tools of Big Data (Outils de Big Data)
Spark Janelia
⭐
13
scripts for using spark on janelia's cluster
Workshop Spark
⭐
13
Código para workshops Spark com ambiente de desenvolvimento em docker
Docker Datascience Ultimate
⭐
13
Customized Jupyter Spark Docker images with everything you need
Ibis Demo
⭐
12
Demo notebook of Ibis for "Spark + Python + Dita science Festival"
Relk
⭐
12
RELK -- The Research Elastic Stack (Kafka, Beats, Zookeeper, Logstash, ElasticSearch, Kibana, Spark, & Jupyter -- All in Docker)
Pyspark For Beginners
⭐
12
PySpark for Beginners by Packt Pyblishing
Infoflow
⭐
12
An Apache Spark implementation of the InfoMap community detection algorithm
Hands On Great Expectations With Spark
⭐
12
How to evaluate the Quality of your Data with Great Expectations and Spark.
Hse_spark_course
⭐
12
Репозиторий учебных материалов для ДПО от ВШЭ (https://cs.hse.ru/dpo/) и курсов по Apache Spark
Spark_tutorial
⭐
11
Code for the Spark tutorial at the Pydata conference in London June 2015
Hackntu_x_cathay_2017
⭐
11
This repo is prepared for the HackNTU X Cathay 2017 Hackathon
Sparknow
⭐
11
Deploy Spark on OpenStack. Now!
Dsx Twitter Auto Analysis
⭐
11
WARNING: This repository is no longer maintained ⚠️ The Insights for Twitter service from IBM Cloud has been sunset. This repository will not be updated. This repository will be kept available in read-only mode. Refer to https://github.com/ibm/cognitive-social-crm for a similar example.
Big_data_course_rimini_2021
⭐
11
Questa repository contiene tutto il materiale didattico utilizzato durante il corso di "Laboratorio Big Data" in collaborazione con il comune di Rimini.
Greenplum Spark Connector
⭐
11
Example of using greenplum-spark connector
Spark On Zos
⭐
11
In this journey we demonstrate running an analytics application using Spark on z/OS. Apache Spark on z/OS is in-place, optimized abstraction and real-time analysis of structured and unstructured enterprise data
Literate Computing Hadoop
⭐
11
Literate Computing for Reproducible Infrastructure - Hadoop Practice
Dcos Jupyterlab Service
⭐
11
JupyterLab Notebook for Mesosphere DC/OS
Thesparkbox
⭐
11
TheSparkBox is an all-in-one Spark deployment that you can use to fire up a local cluster.
D Pandisim
⭐
11
distributed pandemics simulator, uses the power of spark to generate huge bulks of contact-tracing data.
Wikibrain
⭐
11
Wikipedia graph mining: dynamic structure of collective memory
Pyspark_amld2019
⭐
11
Workshop materials for AMLD2019 on PySpark.
Analyze Customer Data Spark Pixiedust
⭐
11
An introductory IBM Developer Code Pattern on how to use PixieDust to visualize customer data
Workshop Data Lakehouse
⭐
11
Repositório dedicado a Workshop de Data Lakehouse com Delta Lake
Python And Spark For Data Analysis
⭐
11
A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course given by Patrick Varilly to one of our clients in December 2015
Pyspark Tutorial
⭐
10
A short tutorial notebook on PySpark
Pyspark Dataframe Made Easy
⭐
10
pyspark dataframe made easy
Pyspark Tutorial
⭐
10
A learning journey into the Python API of Apache Spark from an ETL-developer perspective
Azure Databricks
⭐
10
Aprender análise de dados no Azure Databricks
Spark Etl Atlas
⭐
10
A small project to show how to add lineage to Atlas when using Spark as ETL tool
Big Data Student Resources
⭐
10
These are the Jupyter notebooks for the Big Data specialization in the Data Science Program.
Bigdata Profiler
⭐
10
Profiles the data, validates the schema and runs data quality checks and produces a report
Bsql Demos
⭐
10
Example notebooks using BlazingSQL with the RAPIDS AI ecoystem.
Twittertrends
⭐
10
Get Twitter trends with twitter4j, stream it to a Kafka topic, save it to MongoDB and visualize in Google Maps
Betfair Data Analysis
⭐
10
Explore, analyse and visualise Betfair Historical Data Feed using PySpark.
Predictive Maintenance In Pyspark
⭐
10
Dataquest
⭐
10
My solutions to problems on Dataquest
Data_ai_for_all
⭐
10
Data Analysis, Analytics, Science, AI & ML, LLM etc.
Bigdata20180301
⭐
10
巨量資料導論 上課資料
Micro Cluster Lab
⭐
10
A micro cluster lab to experiment Dask and Spark (Python and Scala) based on Docker
Sparktraining
⭐
9
Training material for the course "Introduction to Apache Spark APIs for Data Processing" https://sparktraining.web.cern.ch/
Docker Jupyter Spark
⭐
9
Docker image for Jupyter notebooks with PySpark
Docker Blog Example
⭐
9
Example of Docker configuration for an entry at the BBVA Data & Analytics blog
Bde
⭐
9
Blossom development environment, pre-build
Bigdatademo
⭐
9
The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big data project relates to Hadoop ecosystem
Spark2 H2o R Zeppelin
⭐
9
A stack for data mining using Spark2, H2O, R and Zeppelin running on Cloudera Hadoop Distribution
Hdinsight Spark Scala Kafka Cosmosdb
⭐
9
An example of using Spark Structured Streaming to read data from Kafka on HDInsight and store it into Azure Cosmos DB
A Simple Neural Network On Spark
⭐
9
A simple implementation of an artificial neural network based with Apache Spark and python. this is another implementation of my toy program https://github.com/lzhengchun/step-by-step-neural-
Tensorflow Spark Docker
⭐
9
contains Tensorflow + HADOOP + SPARK, to make it easy to running TensorFlow on Spark via Docker.
Sparkmagic On Hdp
⭐
9
Spark Funds Investments Assignment
⭐
9
Spark Funds wants to make investments in a few companies. The CEO of Spark Funds wants to understand the global trends in investments so that she can take the investment decisions effectively.
Ml On Code
⭐
9
"Introduction to ML-on-Code" workshop materials 2018
Timeseriesgan
⭐
9
GANs for Time series analysis (Synthetic data generation, anomaly detection and interpolation), Hypertuning using Optuna, MLFlow and Databricks
Roadmap Data Scientist
⭐
9
The basic roadmap to become a data scientist
Cloud Computing
⭐
9
Web repository for the "Cloud Programming Models" module
Mleap Demo
⭐
9
MLeap demo repository for use with MLeap blog posts
Ansible Pydatalab
⭐
9
Ansible playbook for creating a python based datalab
Sparkdefinitiveguide
⭐
9
The Example Codes of "Spark, The Definitive Guide"
Related Searches
Python Jupyter Notebook (12,976)
Jupyter Notebook Deep Learning (9,967)
Jupyter Notebook Machine Learning (8,463)
Jupyter Notebook Dataset (6,824)
Jupyter Notebook Tensorflow (4,771)
Jupyter Notebook Convolutional Neural Networks (4,218)
Jupyter Notebook Classification (3,939)
Jupyter Notebook Neural (3,926)
Jupyter Notebook Pytorch (3,877)
Jupyter Notebook Data Science (3,734)
201-300 of 382 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.