Terraform Aws Emr Cluster

A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.
Alternatives To Terraform Aws Emr Cluster
Project NameStarsDownloadsRepos Using ThisPackages Using ThisMost Recent CommitTotal ReleasesLatest ReleaseOpen IssuesLicenseLanguage
Elastic Mapreduce Ruby86
9 years ago8apache-2.0Ruby
Amazon's elastic mapreduce ruby client. Ruby 1.9.X compatible
Lemur85
6 years ago8apache-2.0Clojure
Lemur is a tool to launch hadoop jobs locally or on EMR, based on a configuration file, referred to as a jobdef. The jobdef file describes your EMR cluster, local environment, pre- and post-actions and zero or more "steps".
Rail70
3 years ago26otherPython
Scalable RNA-seq analysis
Social Graph Analysis56
12 years agootherPython
Social Graph Analysis using Elastic MapReduce and PyPy
Elasticrawl50
17 years ago10February 15, 20171mitRuby
Launch AWS Elastic MapReduce jobs that process Common Crawl data.
Terraform Aws Emr Cluster35
4 years ago3apache-2.0HCL
A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.
Cc Helloworld33
9 years ago1Java
CommonCrawl Hello World example
Emrio30
9 years agoPython
Elastic MapReduce instance optimizer
Ceteri Mapred19
12 years agoPython
MapReduce examples
Spark Emr17
10 years ago9Scala
Spark Elastic MapReduce bootstrap and runnable examples.
Alternatives To Terraform Aws Emr Cluster
Select To Compare


Alternative Project Comparisons
Readme

terraform-aws-emr-cluster

A Terraform module to create an Amazon Web Services (AWS) Elastic MapReduce (EMR) cluster.

Usage

data "template_file" "emr_configurations" {
  template = "${file("configurations/default.json")}"
}

module "emr" {
  source = "github.com/azavea/terraform-aws-emr-cluster?ref=0.1.0"

  name          = "DatarpocCluster"
  vpc_id        = "vpc-20f74844"
  release_label = "emr-5.9.0"

  applications = [
    "Hadoop",
    "Ganglia",
    "Spark",
    "Zeppelin",
  ]

  configurations = "${data.template_file.emr_configurations.rendered}"
  key_name       = "hector"
  subnet_id      = "subnet-e3sdf343"

  instance_groups = [
    {
      name           = "MasterInstanceGroup"
      instance_role  = "MASTER"
      instance_type  = "m3.xlarge"
      instance_count = "1"
    },
    {
      name           = "CoreInstanceGroup"
      instance_role  = "CORE"
      instance_type  = "m3.xlarge"
      instance_count = "1"
      bid_price      = "0.30"
    },
  ]

  bootstrap_name = "runif"
  bootstrap_uri  = "s3://elasticmapreduce/bootstrap-actions/run-if"
  bootstrap_args = []
  log_uri        = "s3n://.../"

  project     = "Something"
  environment = "Staging"
}

Variables

  • name - Name of EMR cluster
  • vpc_id - ID of VPC meant to house cluster
  • release_label - EMR release version to use (default: emr-5.8.0)
  • applications - A list of EMR release applications (default: ["Spark"])
  • configurations - JSON array of EMR application configurations
  • key_name - EC2 Key pair name
  • subnet_id - Subnet used to house the EMR nodes
  • instance_groups - List of objects for each desired instance group (see section below)
  • bootstrap_name - Name for the bootstrap action
  • bootstrap_uri - S3 URI for the bootstrap action script
  • bootstrap_args - A list of arguments to the bootstrap action script (default: [])
  • log_uri - S3 URI of the EMR log destination, must begin with s3n:// and end with trailing slashes
  • project - Name of project this cluster is for (default: Unknown)
  • environment - Name of environment this cluster is targeting (default: Unknown)

Instance Group Example

[
    {
      name           = "MasterInstanceGroup"
      instance_role  = "MASTER"
      instance_type  = "m3.xlarge"
      instance_count = 1
    },
    {
      name           = "CoreInstanceGroup"
      instance_role  = "CORE"
      instance_type  = "m3.xlarge"
      instance_count = "1"
      bid_price      = "0.30"
    },
]

Outputs

  • id - The EMR cluster ID
  • name - The EMR cluster name
  • master_public_dns - The EMR master public FQDN
  • master_security_group_id - Security group ID of the master instance/s
  • slave_security_group_id - Security group ID of the slave instance/s
Popular Mapreduce Projects
Popular Elastic Projects
Popular Data Processing Categories
Related Searches

Get A Weekly Email With Trending Projects For These Categories
No Spam. Unsubscribe easily at any time.
Amazon Web Services
Aws
Hcl
Terraform
Amazon
Elastic
Mapreduce