This document discusses using Amazon Elastic MapReduce (EMR) for scalable data processing. EMR allows running Apache Hadoop on the scalable resources of Amazon EC2 and storing data in Amazon S3. It provides a simple web interface and command line tools to define MapReduce jobs that can process large amounts of data across many servers. Examples shown include using EMR to perform word counting on text data and single-nucleotide polymorphism analysis on genomic sequencing data stored in S3.