Awesome Open Source
Search
Programming Languages
Languages
All Categories
Categories
About
Search results for hadoop pig
hadoop
x
pig
x
88 search results found
Scalding
⭐
3,433
A Scala API for Cascading
Elasticsearch Hadoop
⭐
1,914
🐘 Elasticsearch real-time search and analytics natively integrated with Hadoop
Mongo Hadoop
⭐
1,511
MongoDB Connector for Hadoop
Elephant Bird
⭐
1,100
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
Oozie
⭐
687
Mirror of Apache Oozie
Pig
⭐
659
Mirror of Apache Pig
Bigdata Ecosystem
⭐
536
BigData Ecosystem Dataset
Shifu
⭐
235
An end-to-end machine learning and data mining framework on Hadoop
Hadoop Tutorials Examples
⭐
228
Source, data and turotials of the blog post video series of Hue, the Web UI for Hadoop.
Crunch
⭐
196
A fast to develop, fast to run, Go based toolkit for ETL and feature extraction on Hadoop.
Pignlproc
⭐
160
Apache Pig utilities to build training corpora for machine learning / NLP out of public Wikipedia and DBpedia dumps.
Logparser
⭐
153
Easy parsing of Apache HTTPD and NGINX access logs with Java, Hadoop, Hive, Flink, Beam, Storm, Drill, ...
Spatialanalytics
⭐
134
Where 2.0 Workshop Code: Spatial Analysis of Tweets using Hadoop, Pig, Python & Mechanical Turk. Slides here: http://www.slideshare.net/kevinweil/spatial-analyt
Binarypig
⭐
133
Scalable Binary Data Extraction in Hadoop
Gora
⭐
111
The Apache Gora open source framework provides an in-memory data model and persistence for big data.
Avro Hadoop Starter
⭐
111
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Datafu
⭐
110
Mirror of Apache DataFu
Linkedin Gradle Plugin For Apache Hadoop
⭐
106
Spork
⭐
84
Pig on Apache Spark
Akela
⭐
78
A bunch of utility classes for Java, Hadoop, HBase, Pig, etc.
Splittablegzip
⭐
66
Splittable Gzip codec for Hadoop
Graph Analytics Triangle Counting
⭐
46
Use Big data tools such as Vertica, Hadoop and PIG to count triangles in a graph. Experimentally compare their performance.
Ades
⭐
43
An analysis of adverse drug event data using Hadoop, R, and Gephi
Barclamp Pig
⭐
41
[UNMAINTAINED] Hadoop Pig: Mapreduce Programming component
Hia Examples
⭐
38
Hadoop In Action Examples
Vertica Hadoop Connector
⭐
38
Vertica Hadoop Connector
Hadoop Guide
⭐
36
🐘 关于 HDFS,Yarn,MapReduce,HBase,Hive,Pig,Sqoop,Flume,Zoo 等大数据框架的学习笔记
Hadoop For Hpcers Tutorial
⭐
28
Introduction to Hadoop for those from an HPC simulation background
Data_science_fun_pack
⭐
28
Meta-repository of big data tools -- source and essential plugins for hadoop, pig, wukong, storm, kafka etc.
Rosetta Scone
⭐
25
A collection of MapReduce tasks translated (from Pig, Hive, MapReduce streaming, Cascalog, etc.) into Scalding.
Dse230_data_analysis_using_hadoop_and_spark_ucsd
⭐
24
Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive, Pig and Spark. The class will include assignment of analyzing large existing databases.
Presentations
⭐
24
Public Presentations
Serengeti Pantry
⭐
23
Cookbooks and roles used by Serengeti
Learn Hadoop And Spark
⭐
22
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
Azure Documentdb Hadoop
⭐
22
Hadoop Scripts
⭐
20
Collocations
⭐
20
bigram / trigram analysis of wikipedia; mainly mutual info
Sandcrawler
⭐
19
Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki
Tbana
⭐
19
Big Data Connectors for Splunk
Solutions Apache Hive And Pig On Google Compute Engine
⭐
19
This sample app will get up and running quickly with Hive and/or Pig on a Hadoop cluster on Google Compute Engine. For more information on running Hadoop on GCE, read the papers at https://cloud.google.com/resources/.
Guineapig
⭐
19
Pure python PIG-like language
Etl Starter Kit
⭐
18
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
Hadoop Python Tutorial
⭐
18
Exercises and examples developed for the Hadoop with Python tutorial
Wikipediaphilosophy
⭐
18
do all first links on wikipedia _really_ lead to philosophy?
Hadoop Docker Lite
⭐
15
Docker build project to setup a lightweight hadoop cluster containing hadoop, pig, zookeeper, hbase, phoenix, storm, kafka, kafka manager
Googleplay Web Crawler
⭐
15
Mapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive
Github Explorer
⭐
15
Recommender system for Github projects using the github archive data
Pig Data Mining Talk
⭐
15
Notes and resources for my talk at the Hadoop UK Users' Group in June 2012
Glue
⭐
14
BigData Workflow Engine for Hadoop, Hbase, Netezza, Pig, Hive, Redis ...
Cheatsheets For Ai
⭐
14
Cheatsheets on numerous topics ranging from DataScience | ML | DL | AI | Big Data.
Hadoop.client
⭐
14
.NET client for Hadoop
Hadoop Scripting
⭐
13
Scripting Languages on Hadoop: Jaql vs. Pig Latin (MapReduce stuff)
Hadoop.net
⭐
13
.NET port of hadoop core
Enron Node Mongo
⭐
12
Building a simple Node application with Pig, MongoDB, Node.js and the Enron Emails
Mongodb Hadoop Workshop
⭐
11
MongoDB-Hadoop Workshop Exercises
Druid Hadoop Utils
⭐
10
Read druid segments from hadoop
Hadoop Examples
⭐
10
Hadoop Examples
Big Data Pipeline
⭐
9
Big Data
Bigdatademos
⭐
9
Demo programs for Hadoop etc.
Cassandra Summit Demo
⭐
9
Hadoop integration demo for the Cassandra Summit
Trending
⭐
9
testing out some trending algorithms, mostly written in hadoop pig
Hadoop_ctakes
⭐
9
Hadoop integration code for working with with Apache cTAKES
Elasticsearch Hadoop
⭐
9
elasticsearch-hadoop connector for elassandra
Datasketches Pig
⭐
9
Sketch adaptors for Pig.
Hadoop Ansible
⭐
8
This big data distro contains ansible provisioning for: Apache Hadoop, Apache Spark, Apache Hive, Apache Pig, Apache Storm, Apache Zookeeper, Apache Kafka, Apache Cassandra, ElasticSearch, Kibana, Logstash, Apache Hbase, Apache Zeppelin, Apache Flink
Vagrant Jilla Hadoop
⭐
8
Vagrant setup to spin up vm hadoop cluster
Hands On Hadoop
⭐
8
Hadoop, MapReduce, HDFS, Spark, Pig, Hive, HBase, MongoDB, Cassandra, Flume - the list goes on! Over 25 technologies.
Hadoop Scripts
⭐
8
Scripts for installing Hadoop, HBase, Hive, Pig & Spark. 📄
Talks Hadoop Getting Started
⭐
8
Talk on 'Getting Started with Hadoop'
Hdpcd
⭐
8
This repository contains all the documents related to HDPCD certification.
Vagrant Hadoop Hortonworks Tutorial Centos7
⭐
7
Multinode Hortonworks Hadoop cluster automated with Ambari Blueprints under Vagrant
Hadoop Tutorials
⭐
6
hadoop-tutorials
Awesome Oss Data Analytics
⭐
6
A curated list of open source alternatives for data analytics start-up products.
Hadoop Examples
⭐
6
Examples for Hadoop newbies
Big Data Stack
⭐
6
Hadoop-based Big Data stack (hdfs, yarn, spark, etc)
Hog
⭐
6
Supercharge your Ruby Pig UDFs with Hog.
Jaglion
⭐
6
Tools for doing hybrid cloud hadoop jobs with Azure and Cloudera.
Oss Transform Processing Comparison
⭐
6
Spork Streaming
⭐
6
Pig on Spark Streaming
Dda
⭐
5
S4pigexample
⭐
5
Example of using S4PigWrapper to run S4 PEs using Pig on Hadoop
Vim Pig
⭐
5
Try to Create Multi Programming Language VIM IDE
Stepic_hadoop
⭐
5
Code for Hadoop - Big Data Processing Systems MOOC by stepic.org
Hadoop Example1 Wordcount
⭐
5
Hadoop example1 WordCount
Darkweb
⭐
5
Using Hadoop MapReduce, Pig to generate reports using crawled data from multiple dark net markets and forums.
Mongo Hadoop Test Harness
⭐
5
Test Harness for MongoDB Hadoop project
Pig Hive Wordcount
⭐
5
Wordcount is the "Hello World" for Hadoop, yet most of the Pig and Hive wordcount examples I've seen either require UDFs, external scripts, or they just don't do a very good job of counting words. Here are my Wordcount hacks.
Hadoop Mrutils
⭐
5
Utility/starter/example scripts to get started with Hadoop MapReduce
Related Searches
Java Hadoop (2,117)
Spark Hadoop (1,188)
Hadoop Hdfs (1,082)
Hadoop Mapreduce (851)
Shell Hadoop (772)
Python Hadoop (761)
Hadoop Hive (703)
Apache Hadoop (514)
Scala Hadoop (479)
Hadoop Big Data (388)
1-88 of 88 search results
Privacy
|
About
|
Terms
|
Follow Us On Twitter
Copyright 2018-2024 Awesome Open Source. All rights reserved.