Cc Helloworld

CommonCrawl Hello World example
Alternatives To Cc Helloworld
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Fastutil1,6181,1224822 months ago73March 07, 202329apache-2.0Java
fastutil extends the Java™ Collections Framework by providing type-specific maps, sets, lists and queues.
Xjar1,142
a year ago91apache-2.0Java
Spring Boot JAR 安全加密运行工具,支持的原生JAR。
Xowa238
3 years ago392otherJava
xowa offline wiki application
Javasysmon2271515 years ago1December 13, 201322otherJava
Manage OS processes and get cpu and memory stats cross-platform in Java.
Jffi162929619 days ago38March 11, 202342apache-2.0Roff
Java Foreign Function Interface
Jide Oss16021017a year ago157February 04, 201721otherJava
JIDE Common Layer
Fb Contrib13824106 months ago54October 20, 201979lgpl-2.1Java
a FindBugs/SpotBugs plugin for doing static code analysis for java code bases
Splunk Sdk Java134
4 months ago8apache-2.0Java
Splunk Software Development Kit for Java
Cofoja12624 years ago1October 16, 201614lgpl-3.0Java
Contracts for Java
Bigdatascript90
3 years ago11apache-2.0Shell
BigDataScript: Scirpting language for big data
Alternatives To Cc Helloworld
Select To Compare


Alternative Project Comparisons
Readme

This is a simple library demonstrating the analysis of the CommonCrawl dataset through implementing the canonical Hadoop Hello World program, a simple word counter.

To build

You'll need to have Apache Ant (http://ant.apache.org/manual/install.html) installed, and once you do, just run a:

ant dist

This step will compile the libraries and Hadoop code into an Elastic MapReduce- friendly JAR at dist/lib/HelloWorld.jar, suitable for use as a custom JAR-based Elastic MapReduce workflow.

To run locally

You'll need to be running Hadoop, and if you don't have it installed, Cloudera provides a useful set of OS-specific Hadoop packages which will make it easy. Check out their site:

https://ccp.cloudera.com/display/SUPPORT/Downloads

Once you've got Hadoop installed, you can use the 'hadoop jar' task to execute the tutorial code. Here's the pattern:

hadoop jar /dist/lib/HelloWorld.jar org.commoncrawl.tutorial.HelloWorld

Popular Jar Projects
Popular Ant Design Projects
Popular Build Tools Categories

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Java
Jar
Ant
Hadoop
Elastic
Mapreduce