Spark Notebook

Interactive and Reactive Data Science using Scala and Spark.
Alternatives To Spark Notebook
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Spark Notebook3,145
7 months ago207apache-2.0JavaScript
Interactive and Reactive Data Science using Scala and Spark.
Spark As Service Using Embedded Server30
6 years ago2Scala
This application comes as Spark2.1-as-Service-Provider using an embedded, Reactive-Streams-based, fully asynchronous HTTP server
Rendactive.js10
10 years agomitJavaScript
Experimental reactive template rendering with Bacon.js + Meteor Spark
Creating 2d 3d Colliders With Scripting Spark Ar7
3 years agomit
This tutorial was created for 2020 Facebook Developer Circles Community Challenge
Deprecated Clinical Overlays7
8 years ago2December 08, 2015JavaScript
Meteorite package to add image and tutorial overlays, using reactive spark templates.
Spark Play Activator7
8 years ago1apache-2.0Scala
Spark Social6
9 years agoScala
Reactive Social Trending
Reactive Datapipe4
8 years agoScala
A reactive data analysis machine, consumes a stream of data from different, passes it through a user defined custom chain of transformations, translations, and processing, then it pushes the outcome to a configurable set of sinks
Realtimemusicrecommendationapp2
7 years agoScala
Reactive Big Data project based on Apache Spark and Scala
Reactive Spark2
8 years agomitScala
Dockerized Spark streaming example using Kafka and Spark back-pressure
Alternatives To Spark Notebook
Select To Compare


Alternative Project Comparisons
Readme

Spark Notebook

Gitter

The Spark Notebook is the open source notebook aimed at enterprise environments, providing Data Scientists and Data Engineers with an interactive web-based editor that can combine Scala code, SQL queries, Markup and JavaScript in a collaborative manner to explore, analyse and learn from massive data sets.

notebook intro

The Spark Notebook allows performing reproducible analysis with Scala, Apache Spark and the Big Data ecosystem.

Features Highlights

Apache Spark

Apache Spark is available out of the box, and is simply accessed by the variable sparkContext or sc.

Multiple Spark Context Support

One of the top most useful feature brought by the Spark Notebook is its separation of the running notebooks. Each started notebook will spawn a new JVM with its own SparkSession instance. This allows a maximal flexibility for:

  • dependencies without clashes
  • access different clusters
  • tune differently each notebook
  • external scheduling (on the roadmap)

Metadata-driven configuration

We achieve maximum flexibility with the availability of multiple sparkContexts by enabling metadata driven configuration.

Scala

The Spark Notebook supports exclusively the Scala programming language, the Unpredicted Lingua Franca for Data Science and extensibly exploits the JVM ecosystem of libraries to drive an smooth evolution of data-driven software from exploration to production.

The Spark Notebook is available for *NIX and Windows systems in easy to use ZIP/TAR, Docker and DEB packages.

Reactive

All components in the Spark Notebook are dynamic and reactive.

The Spark Notebook comes with dynamic charts and most (if not all) components can be listened for and can react to events. This is very helpful in many cases, for example:

  • data entering the system live at runtime
  • visually plots of events
  • multiple interconnected visual components Dynamic and reactive components mean that you don't have write the html, js, server code just for basic use cases.

Quick Start

Go to Quick Start for our 5-minutes guide to get up and running with the Spark Notebook.

C'mon on to Gitter to discuss things, to get some help, or to start contributing!

Learn more

Testimonials

Skymind - Deeplearning4j

Spark Notebook gives us a clean, useful way to mix code and prose when we demo and explain our tech to customers. The Spark ecosystem needed this.

Vinted.com

It allows our analysts and developers (15+ users) to run ad-hoc queries, to perform complex data analysis and data visualisations, prototype machine learning pipelines. In addition, we use it to power our BI dashboards.

Adopters

Name Logo URL Description
Kensu Kensu website Lifting Data Science to the Enterprise level
Agile Lab Agile Lab website The only Italian Spark Certified systems integrator
CloudPhysics CloudPhysics website Data-Driven Inisghts for Smarter IT
Aliyun Alibaba - Aliyun ECS product Spark runtime environment on ECS and management tool of Spark Cluster running on Aliyun ECS
EMBL European Bioinformatics Institute EMBL - EBI website EMBL-EBI provides freely available data from life science experiments, performs basic research in computational biology and offers an extensive user training programme, supporting researchers in academia and industry.
Metail Metail website The best body shape and garment fit company in the world. To create and empower everyones online body identity.
kt NexR kt NexR website the kt NexR is one of the leading BigData company in the Korea from 2007.
Skymind website At Skymind, were tackling some of the most advanced problems in data analysis and machine intelligence. We offer start-of-the-art, flexible, scalable deep learning for industry.
Amino website A new way to get the facts about your health care choices.
Vinted Vinted website Online marketplace and a social network focused on young womens lifestyle.
Vingle Vingle website Vingle is the community where you can meet someone like you.
47 Degrees website 47 Degrees is a global consulting firm and certified Typesafe & Databricks Partner specializing in Scala & Spark.
Barclays Barclays website Barclays is a British multinational banking and financial services company headquartered in London.
Swisscom Swisscom website Swisscom is the leading mobile service provider in Switzerland.
Knoldus knoldus website Knoldus is a global consulting firm and certified "Select" Lightbend & Databricks Partner specializing in Scala & Spark ecosystem.
Popular Spark Projects
Popular Reactive Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Scala
Data Science
Spark
Reactive
Apache Spark