Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for hadoop hdfs
hadoop-hdfs
x
18 search results found
Seaweedfs
⭐
21,541
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
Data Engineering Interview Questions
⭐
554
More than 2000+ Data engineer interview questions.
Morphl Community Edition
⭐
233
MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization
Dynamometer
⭐
110
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Sparksql For Hbase
⭐
66
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers
Datapipelines Essentials Python
⭐
45
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Console
⭐
23
Open source data infrastructure platform. Designed for developers, built for speed.
Marayarn
⭐
13
Marathon on yarn
Travelwebsite_bigdataanalysis
⭐
12
旅游网站(携程网部分数据)大数据分析-hadoop课程设计(本科级别)
Twitter Hashtag Graph
⭐
9
Twitter + Flume + Hadoop (HDFS, MapReduce) + Neo4j + Pyhton
Hdfsclient
⭐
8
A Java Hdfs client example and full Kerberos example for call hadoop commands directly in java code or on your local machine.
Hadoop Sandbox
⭐
8
A fully-functional Hadoop Yarn cluster as docker-compose deployment.
Distributable_docker_sql_on_hadoop
⭐
6
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Ansible Hadoop Hdfs
⭐
6
Ansible Playbook For Setup Hadoop HDFS
Data Engineering Project With Hdfs And Kafka
⭐
5
Data Engineering Project with Hadoop HDFS and Kafka
Hadoop Installation
⭐
5
Instructions on setting up Hadoop, HDFS, java, sbt, kafka, scala, spark and flume on Ubuntu 18.04
Mammoth
⭐
5
Mammoth is a container based hadoop distributed system log analyzer. Sponsed by Mantech and Naver Cloud Platform.
Hadoop Spark Cluster
⭐
5
Repository containing Docker images for create a cluster Spark on Hadoop Yarn.
1-18 of 18 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.