Amazon Elastic MapReduceΒΆ
- EMR stands for "Elastic MapReduce"
- EMR helps creating [[Hadoop]] clusters ([[Big Data]]) to analyse and process vast amounts of data
- The clusters can be made of hundreds of AWS EC2 instances
- Also supports [[Apache Spark]], [[HBase]], [[Presto]], [[Flink]]
- EMR takes care of all the provisioning and configuration
- [[Auto-scaling]] and integrated with EC2 Spot Instancess
- Use cases: [[data processing]], [[machine learning]], [[web indexing]], [[big data]]