Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for docker big data
big-data
x
docker
x
49 search results found
Kafka Ui
⭐
7,779
Open-Source Web UI for Apache Kafka Management
Pachyderm
⭐
6,035
Data-Centric Pipelines and Data Versioning
Docker Spark Cluster
⭐
413
A simple spark standalone cluster for your testing environment purposses
Smooks
⭐
377
Extensible data integration Java framework for building XML and non-XML fragment-based applications
Arvados
⭐
354
An open source platform for managing and analyzing biomedical big data
Data Accelerator
⭐
295
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Couchdb Docker
⭐
242
Semi-official Apache CouchDB Docker images
Bigdata_docker
⭐
226
Big Data Ecosystem Docker
Bigdata Playground
⭐
154
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Storm Doc Zh
⭐
143
Apache Storm 官方文档中文版
Incubator Liminal
⭐
131
Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
Amas
⭐
77
Amas is recursive acronym for “Amas, monitor alert system”.
Airflow_multi_dagrun
⭐
76
triggering a DAG run multiple times
Labs
⭐
73
Research on distributed system
Awesome Ai Kubernetes
⭐
62
❄️ 🐳 Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc
Docker Kafka Alpine
⭐
59
Alpine Linux based Kafka Docker Image
Big_data
⭐
55
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Docker Hadoop Workbench
⭐
44
A Hadoop cluster based on Docker, including Hive and Spark.
Docker Spark Cluster
⭐
44
A Spark cluster setup running on Docker containers
Rtdl
⭐
39
rtdl makes it easy to build and maintain a real-time data lake
Esper Tv
⭐
37
Esper instance for TV news analysis
Big Data
⭐
37
Python tools for big data
Pretzel
⭐
36
Javascript full-stack framework for Big Data visualisation and analysis
Bigdata In Docker
⭐
31
Make it easier to learn big data
Bigdata Docker
⭐
30
docker构建大数据开发学习环境
Flokkr
⭐
28
Documentation placeholder and utilities for all the other containers.
Airavata Django Portal
⭐
27
Apache Airavata Django Portal Framework
Bigdata Docker
⭐
26
Docker images for Open Source bigdata/hadoop projects
Clusterdock
⭐
26
clusterdock is a framework for creating Docker-based container clusters
Twitter Sentiment Analysis Using Hadoop
⭐
26
A Project where one can fetch and read tweets and show the analysis like who is most influential
Pyspark Setup Demo
⭐
21
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Bftkv
⭐
20
A distributed key-value storage that's tolerant to Byzantine fault.
Simple Kafka Mqtt Connector
⭐
19
This application receives messages from a mqtt broker and sends the messages to a kafka cluster. Topic mapping is configurable.
Accumulo Docker
⭐
18
Apache Accumulo Docker
Docker Bigdata
⭐
14
Sillot
⭐
13
Sillot (汐洛)致力于服务智慧新彖乄
Bigdata_docker
⭐
13
Big Data Docker Data Science Spark Spark3 Hadoop HDFS Scala Python Artificial Intelligence Machine Learning Jupyter Lab Notebook
Snowplow Pipeline
⭐
11
End-to-end Snowplow Analytics Pipeline for real time events
Metamodel Membrane
⭐
11
Mirror of Apache MetaModel Membrane
Blazegraph Docker
⭐
11
Blazegraph docker container for deploying to Container Cluster Platforms (OpenShift, Kubernetes, etc)
Spark On Yarn Cluster
⭐
10
A Procedure To Create A Yarn Cluster Based on Docker, Run Spark, And Do TPC-DS Performance Test.
Masterdatcom_bdcc_practice
⭐
10
Practice and Workshop on BigData and Cloud Computing using Docker Containers and OpenNebula. HDFS, hadoop and spark+R
Open Kubeyard
⭐
10
A collection of helm charts for big data
Big Data Open Os
⭐
10
The definitive open source big data operating system.
Bde
⭐
9
Blossom development environment, pre-build
Couchdb Ci
⭐
9
Apache CouchDB Continuous Integration (CI) support repository
Panoptes_docker
⭐
9
Containerized version of Panoptes for testing and experimentation.
Bigdatademo
⭐
9
The demo of using Kafka, Spark, Hive, Cassandra, etc by using Docker. It produces the production ready environment for any kinds of big data project relates to Hadoop ecosystem
Kibana Plugin Builder
⭐
9
Malice Kibana Plugin Builder
Pygmql
⭐
9
Python Library for data analysis based on GMQL
K8s Bigdata
⭐
8
Apache Spark with HDFS cluster within Kubernetes
Tech Faqs
⭐
8
Easy introductions to few important, simple, tech topics.
Coderepo
⭐
8
Code for all the projects I create.
Real Time Stock Analysis System
⭐
7
Stock Analysis System based on Spark, Kafka.
Sentimento
⭐
7
An intelligent stock market sentiment analytic engine 🚀
Quickref
⭐
7
Quick references to notes on specific topics and their basic introductions
Sturdy Robot
⭐
7
smartkit.ai
Transport
⭐
7
data transportation tool, from one to another.such as,file, kafka, hdfs etc.
Hive Metastore Docker
⭐
6
Containerized Apache Hive Metastore for horizontally scalable Hive Metastore deployments
Community Detection Lastfm
⭐
6
Python3, NetworkX, Java, MLlib, Spark, Cassandra, Neo4j 3.0, Gephi, Docker
Knowledge Repository
⭐
6
To keep forgetfulness at bay
Bigdata Platform
⭐
6
End to end big data project, that aims to show how to implement different big data layers, from the infrastructure layer to the end user one. [HADOOP][Spark][Kafka][Cassandra][Ansible][Jupyter
Bigbox
⭐
6
Sunlab big data training container
Rump
⭐
5
A Reproducible Untargeted Metabolomics Data Processing Pipeline
Cloudera Quickstart Dockers
⭐
5
Useful information for the article about setting a cloudera quickstart images with docker
St Kilda Pier
⭐
5
Docker Cluster based Data Science Platform for Big Data
Related Searches
Shell Docker (20,660)
Docker Dockerfile (16,395)
Python Docker (16,341)
Javascript Docker (10,426)
Golang Docker (7,702)
Php Docker (6,192)
Java Docker (6,071)
Docker Nginx (5,238)
Typescript Docker (4,630)
Docker Postgresql (4,363)
1-49 of 49 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.