Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for java hadoop
hadoop
x
java
x
800 search results found
Hcatalog
⭐
61
Mirror of Apache HCatalog
Hdfs Netdisc
⭐
61
基于Hadoop的分布式云存储系统 🌴
Coopr
⭐
61
A template-based cluster provisioning system
Storm Hdfs
⭐
61
Storm components for interacting with HDFS file systems
Hydrator Plugins
⭐
61
Cask Hydrator Plugins Repository
Incubator Tez
⭐
60
Mirror of Apache Tez (Incubating)
Stormtweetssentimentanalysis
⭐
60
Computes sentiment analysis of tweets of US States in real-time using Storm.
Hadoop S3a
⭐
59
An AWS SDK-backed FileSystem driver for Hadoop
Hadoopusc
⭐
59
USC Version of Hadoop that includes HDFS-RAID. Erasure codes like Locally Repairable Codes (aka Simple Regenerating Code), Reed Solomon Code and XOR code are supported
Likelike
⭐
57
An implementation of locality sensitive hashing with Hadoop
Cascading_ext
⭐
56
cascading_ext is a collection of tools built on top of the Cascading platform which make it easy to build, debug, and run simple and high-performance data workflows.
Elasticsearch Hdfs
⭐
56
Hadoop Plugin for ElasticSearch
Katta
⭐
54
Katta - distributed Lucene
Clickhouse Hdfs Loader
⭐
54
loading hdfs data to clickhouse
Mlhadoop
⭐
53
This repository contains Machine-Learning MapReduce codes for Hadoop which are written from scratch (without using any package or library). E.g. Prediction (Linear and Logistic Regression), Clustering (K-Means), Classification (KNN) etc.
Mapreduce Demo
⭐
53
Hadoop,MapReduce编程学习练手实例
Bestconf
⭐
53
A tool automatically improving the performance of large-scale systems by finding better configuration settings
Presto Yarn
⭐
52
Cascading Flink
⭐
52
Cascading on Apache Flink®
Iotdb
⭐
51
This repository is ReadOnly now. please go to https://github.com/apache/incubator-iotdb
Til
⭐
51
Today I Learned
Clickstream Tutorial
⭐
51
Code for Tutorial on designing clickstream analytics application using Hadoop
Hadoop Training
⭐
50
Hadoop training material from free MapR courses.
Arabesque
⭐
50
Scalable Graph Mining
Lzo Java
⭐
50
Pure Java implementation of the liblzo2 LZO compression algorithm
Splout Db
⭐
50
A web-latency SQL spout for Hadoop.
Pentaho Hadoop Shims
⭐
49
Hadoop Configurations
Mavuno
⭐
48
Mavuno: A Hadoop-Based Text Mining Toolkit
Lingual
⭐
48
Stand-alone ANSI SQL for Cascading on Apache Hadoop
Scalding Workshop
⭐
48
A half-day workshop on Scalding, the Scala API for Cascading
Distributedcomputingexamples
⭐
47
Example codes for my Distributed Computing course at Hefei University.
Hadoop Sstable
⭐
47
Splittable Input Format for Reading Cassandra SSTables Directly
Jsr203 Hadoop
⭐
47
A Java NIO file system provider for HDFS
Cc Warc Examples
⭐
46
CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop
Hadoop
⭐
46
A Hanborq optimized Hadoop Distribution, especially with high performance of MapReduce. It's the core part of HDH (Hanborq Distribution with Hadoop for Big Data Engineering).
Cascading.hbase
⭐
46
HBase adapters for Cascading
Graph Analytics Triangle Counting
⭐
46
Use Big data tools such as Vertica, Hadoop and PIG to count triangles in a graph. Experimentally compare their performance.
Simr
⭐
45
Spark In MapReduce (SIMR) - launching Spark applications on existing Hadoop MapReduce infrastructure
Lzo Java
⭐
45
Pure Java implementation of the liblzo2 LZO compression algorithm
Hadoop Unit
⭐
45
Hadoop-Unit is a project which allow testing projects which need hadoop ecosysteme like kafka, solr, hdfs, hive, hbase, ...
Fraud Detection Tutorial
⭐
44
Hfsa
⭐
44
Hadoop FSImage Analyzer (HFSA)
Ades
⭐
43
An analysis of adverse drug event data using Hadoop, R, and Gephi
Knittingboar
⭐
43
Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework
Datasketches Hive
⭐
42
Sketch adaptors for Hive.
P3
⭐
42
An open source pcap packet and NetFlow file analysis tool using Hadoop MapReduce and Hive.
Sqoop On Spark
⭐
42
Sqoop on Apache Spark Engine
Cephfs Hadoop
⭐
42
cephfs-hadoop
Yuzhouwan
⭐
42
Code Library for My Blog
Mongodb Spark Demo
⭐
41
Spark app that demonstrates reading and writing data to from MongoDB and BSON files
Hadoop Crypto
⭐
41
Library for per-file client-side encyption in Hadoop FileSystems such as HDFS or S3.
Gzinga
⭐
40
Hadoop Dns Mining
⭐
40
Wikireverse
⭐
39
Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.
Sizzle
⭐
39
A compiler and runtime for Google's Sawzall language, optimized for Hadoop
Garmadon
⭐
39
Java event logs collector for hadoop and frameworks
Hbase Meta Repair
⭐
39
Repair hbase metadata table from hdfs.
Big Data Parent
⭐
39
大数据体系,存储,计算,相关组件,分析引擎等
Java Study Learn Video Resources Tutorial
⭐
38
java,python,hadoop,spring,php学习视频资源分享
Csds Material
⭐
38
Course material for the Computer Systems for Data Science class at Columbia
Vertica Hadoop Connector
⭐
38
Vertica Hadoop Connector
Hia Examples
⭐
38
Hadoop In Action Examples
Unoexample
⭐
38
MapReduce/Hadoop example that uses regular playing cards to show mapping and reducing.
Hive Jdbc Driver
⭐
38
An alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Big Data Lite
⭐
38
Samples to the Oracle Big Data Lite VM
Swordfish
⭐
37
Open-source distribute workflow schedule tools, also support streaming task.
Recommendationengine
⭐
37
[Deprecated] An optimized MapReduce for item‐based collaborative filtering recommendation algorithm with empirical analysis
Big Data Exploration
⭐
37
[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Bigdata Getting Started
⭐
37
大数据相关框架实战项目(Hadoop, Spark, Storm, Flink)
Accumulo Recipes
⭐
37
Recipes & cookbooks for Accumulo.
Xxhadoop
⭐
37
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Cloudgene
⭐
37
A framework to build Software As A Service (SaaS) platforms for data analysis pipelines.
Weblogsanalysissystem
⭐
37
A big data platform for analyzing web access logs
Hadoop Dns Checker
⭐
36
Hadoop Snappy
⭐
36
Snappy compression for Hadoop
Paraflow
⭐
36
A real-time analytical system for ID-associated data
Logistic
⭐
36
Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimization, L2 regularization and more.
Hbase Secondary Index
⭐
35
Several implementation for building hbase secondary index.
Nectar
⭐
35
Open source framework for predictive modeling on Apache Hadoop
Cc Warc Examples
⭐
35
CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop
Kmeans_mapreduce
⭐
35
K-means MapReduce implementation
Intellij Hadoop
⭐
35
Run Hadoop program using Intellij
Kassandramrhelper
⭐
35
Library for processing Cassandra SSTables with Hadoop MapReduce.
Flume Logs
⭐
34
Apache Flume to process log files on Hadoop cluster
Csvinputformat
⭐
34
Input format for hadoop able to read multiline CSVs
Ambari Metrics
⭐
34
Apache Ambari Metrics is a sub project of Apache Ambari.
Avro Maven Plugin
⭐
34
Maven 2 Plugin for processing Apache Avro files. Avro is a subproject of Apache Hadoop.
Sparkdemo
⭐
34
spark全示例代码(java、scala) Spark most full instance code DEMO (java、scala)
Hadoop
⭐
34
Public hadoop release repository
Visitante
⭐
34
Set of Hadoop, Spark and Storm based tools for web and customer analytic
Riak Hadoop
⭐
33
Riak data as input to hadoop m/r and output of hadoop m/r
Cc Helloworld
⭐
33
CommonCrawl Hello World example
Cipher
⭐
33
基于hdfs spark的视频非结构化数据计算
Avro2parquet
⭐
33
Hadoop MapReduce tool to convert Avro data files to Parquet format.
Hadoop_framework
⭐
32
This is a prototype system that uses Hadoop to process hard drive images.
Msgpack Hadoop
⭐
32
MessagePack-Hadoop integration provides an efficient schema-free data representation for Hadoop and Hive.
Marklogic Contentpump
⭐
32
MarkLogic Contentpump (mlcp)
Thrax
⭐
32
Offline extractor of synchronous context-free grammars for machine translation.
Mastering Scala Machine Learning
⭐
32
Mastering-Scala-Machine-Learning
Main
⭐
32
The main - so far, only - repository for the SmileWide project.
Related Searches
Java Spring (21,350)
Java Jar (7,924)
Java Testing (7,163)
Java Database (6,015)
Java Mysql (5,954)
Javascript Java (5,468)
Java Algorithms (4,705)
Java Apache (4,283)
Java Cloud Computing (4,240)
Java Json (3,692)
201-300 of 800 search results
< Previous
Next >
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.