Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for docker spark
docker
x
spark
x
278 search results found
Docker_practice
⭐
23,279
Learn and understand Docker&Container technologies, with real DevOps practice!
Data Engineering Zoomcamp
⭐
19,461
Free Data Engineering course!
Pipeline
⭐
4,158
PipelineAI
Helk
⭐
3,633
The Hunting ELK
Szt Bigdata
⭐
2,055
深圳地铁大数据客流分析系统🚇🚄🌟
Docker Spark
⭐
1,783
Apache Spark docker image
Seldon Server
⭐
1,420
Machine Learning Platform and Recommendation Engine built on Kubernetes
Dockerfiles
⭐
1,171
50+ DockerHub public images for Docker & Kubernetes - DevOps, CI/CD, GitHub Actions, CircleCI, Jenkins, TeamCity, Alpine, CentOS, Debian, Fedora, Ubuntu, Hadoop, Kafka, ZooKeeper, HBase, Cassandra, Solr, SolrCloud, Presto, Apache Drill, Nifi, Spark, Consul, Riak
Spark Scala Tutorial
⭐
922
A free tutorial for Apache Spark.
Docker Spark
⭐
769
Devops Python Tools
⭐
709
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Functions, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Justenoughscalaforspark
⭐
643
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Docker Spark
⭐
626
Docker build for Apache Spark
Docker Hadoop Spark Workbench
⭐
503
[EXPERIMENTAL] This repo includes deployment instructions for running HDFS/Spark inside docker containers. Also includes spark-notebook and HDFS FileBrowser.
Piflow
⭐
498
πflow is a big data flow engine with spark support
Docker Spark Cluster
⭐
413
A simple spark standalone cluster for your testing environment purposses
Learning Resource
⭐
351
列出一些优秀的程序员学习资源
Eclairjs Node
⭐
340
Node.js API for Apache Spark with Remote Client
Elasticluster
⭐
334
Create clusters of VMs on the cloud and configure them with Ansible.
Spark Standalone Cluster On Docker
⭐
311
Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker. ⚡
Data Accelerator
⭐
295
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Sparklint
⭐
293
A tool for monitoring and tuning Spark jobs for efficiency.
Tidb Docker Compose
⭐
278
Beginner_de_project
⭐
276
Beginner data engineering project - batch edition
Docker Scripts
⭐
261
Dockerfiles and scripts for Spark and Shark Docker images
Hydro Serving
⭐
248
MLOps Platform
Spark
⭐
236
Firely and Incendi's open source FHIR server
Bigdata_docker
⭐
226
Big Data Ecosystem Docker
Hadoop Docker
⭐
210
基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Docker Zeppelin
⭐
210
Docker build for Zeppelin, a web-based Spark notebook
Airflow Pipeline
⭐
168
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Aztk
⭐
152
AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure
Ipython Spark Docker
⭐
151
Docker Spark
⭐
118
Docker image for general apache spark client
Mastering Big Data Analytics With Pyspark
⭐
118
Mastering Big Data Analytics with PySpark, Published by Packt
Movalytics Data Warehouse
⭐
117
Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow
Hanshu Note
⭐
116
所有的学习笔记都在这里了,地主家也没有余粮了。
Gvc
⭐
112
Geek's valuable creation.
De Zoomcamp Ui
⭐
107
🎨 UI for the Free Data Engineering Zoomcamp 2023 Course provided by DataTalksClub
Spark Mllib Twitter Sentiment Analysis
⭐
103
🌟 ✨ Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Docs
⭐
102
《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Spark Kubernetes
⭐
96
spark on kubernetes
Spark Dashboard
⭐
79
This repo provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.
Docker Spark
⭐
77
🚢 Docker image for Apache Spark
Generator Mitosis
⭐
75
A micro-service infrastructure generator based on Yeoman/Chatbot, Kubernetes/Docker Swarm, Traefik, Ansible, Jenkins, Spark, Hadoop, Kafka, etc.
Labs
⭐
73
Research on distributed system
Jupyterlab Integration
⭐
72
DEPRECATED: Integrating Jupyter with Databricks via SSH
Openshift Spark
⭐
70
Kafka Spark Streaming Zeppelin Docker
⭐
68
One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)
Mltoolkits
⭐
65
learningOrchestra is a distributed Machine Learning integration tool that facilitates and streamlines iterative processes in a Data Science project.
Airflow Spark
⭐
64
Docker with Airflow and Spark standalone cluster
Dockers
⭐
64
Docker hello world templates
W261 Environment
⭐
62
Awesome Ai Kubernetes
⭐
62
❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Hive Metastore Docker
⭐
61
Example for article Running Spark 3 with standalone Hive Metastore 3.0
Pysparkgeoanalysis
⭐
60
🌐 Interactive Workshop on GeoAnalysis using PySpark
Data Processing Pipeline
⭐
59
Real-Time Data Processing Pipeline & Visualization with Docker, Spark, Kafka and Cassandra
Platys Modern Data Platform
⭐
58
Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....
Spark The Definitive Guide
⭐
57
한빛미디어에서 출간한 스파크 완벽 가이드 1판의 소스코드 저장소
Big_data
⭐
55
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Spark Docker
⭐
55
Apache Spark Docker Image
Spark Integration Tests
⭐
55
Integration tests for Spark
Cowait
⭐
54
Containerized distributed programming framework for Python
Books
⭐
53
A collection of online books for data science, computer science and coding!
Aiopen
⭐
52
AIOpen是一个按人工智能三要素(数据、算法、算力)进行AI开源项目分类的汇集项目,项目致力于跟踪
Docker Hadoop
⭐
51
A Docker container with a full Hadoop cluster setup with Spark and Zeppelin
Spark Build
⭐
48
Used to build the mesosphere/spark docker image and the DC/OS Spark package
Casper
⭐
47
A compiler for automatically re-targeting sequential Java code to Apache Spark.
Zemberek Nlp Server
⭐
46
Zemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu
Spark Mail
⭐
45
Tutorial on parsing Enron email to Avro and then explore the email set using Spark.
Hadoop Spark Hive Cluster Docker
⭐
45
hadoop-spark-hive-cluster-docker
Docker Hadoop Workbench
⭐
44
A Hadoop cluster based on Docker, including Hive and Spark.
Docker Spark Cluster
⭐
44
A Spark cluster setup running on Docker containers
Spark Neo4j
⭐
43
A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine
Searchhub
⭐
43
Fusion demo app searching open-source project data from the Apache Software Foundation
Smv
⭐
41
Spark Modularized View
Devops
⭐
40
DevOps
Etl Light
⭐
38
A light Kafka to HDFS/S3 ETL library based on Apache Spark
Big Data
⭐
37
Python tools for big data
Hive Metastore
⭐
36
Apache Hive Metastore as a Standalone server in Docker
Notes
⭐
36
My notes about Openstack,Docker,etc.
Getting_started_with_pyspark
⭐
34
Materials for class Getting Started with Pyspark
Docker Spark Submit
⭐
32
Docker image to submit Spark applications
Snowtire
⭐
32
A Snowflake Sandbox for Data Science
Building Data Lakehouse
⭐
32
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
Spark Docker Swarm
⭐
31
running apache spark with docker swarm
Bigdata Docker
⭐
30
docker构建大数据开发学习环境
Kafka Spark Streaming Druid
⭐
29
Takes a kafka stream into spark, apply transformations and sink into Druid. Everything Dockerised.
Kafka Compose
⭐
28
🎼 Docker compose files for various kafka stacks
Docker Pyspark
⭐
28
Docker image of Apache Spark with its Python interface, pyspark.
Flokkr
⭐
28
Documentation placeholder and utilities for all the other containers.
Docker Sparkdev
⭐
27
Docker image for Apache Spark (scala) applications
Mesos Spark Docker
⭐
27
Document and showcase how you can create Spark Applications which run inside Docker Containers using Apache Mesos.
Geodocker Cluster
⭐
27
[NOT MAINTAINED] GeoDocker Cluster is a Docker environment with Apache Accumulo and Apache Spark environment.
Kraps Haskell
⭐
26
Experimental Haskell bindings to Spark Datasets and DataFrames
Bigdata Docker
⭐
26
Docker images for Open Source bigdata/hadoop projects
Docker Spark
⭐
26
Apache Spark docker container image (Standalone mode)
Kafka Spark Flink Example
⭐
26
Kafka streaming with Spark and Flink example
Runtime Compose
⭐
25
Examples to run Hadoop/Spark clusters locally with docker-compose.
Sansa Notebooks
⭐
25
Interactive Spark Notebooks for running SANSA examples.
Related Searches
Shell Docker (20,041)
Docker Dockerfile (16,395)
Python Docker (16,341)
Javascript Docker (10,426)
Golang Docker (7,702)
Php Docker (6,192)
Java Docker (6,071)
Docker Nginx (5,238)
Typescript Docker (4,630)
Docker Postgresql (4,363)
1-100 of 278 search results
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.