Skip to content

Amazon Elastic MapReduceΒΆ

  • EMR stands for "Elastic MapReduce"
  • EMR helps creating [[Hadoop]] clusters ([[Big Data]]) to analyse and process vast amounts of data
  • The clusters can be made of hundreds of AWS EC2 instances
  • Also supports [[Apache Spark]], [[HBase]], [[Presto]], [[Flink]]
  • EMR takes care of all the provisioning and configuration
  • [[Auto-scaling]] and integrated with EC2 Spot Instancess
  • Use cases: [[data processing]], [[machine learning]], [[web indexing]], [[big data]]