Support for Spark 2.3.0 on Amazon EMR Release 5.13.0

Posted on: Apr 10, 2018

You can now use Apache Spark 2.3.0, Apache HBase 1.4.2, and Presto 0.194 on Amazon EMR release 5.13.0. Spark 2.3.0 adds several new features and updates, including continuous processing mode in Structured Streaming for lower end-to-end latency, an improved ORC file format reader that supports vectorized reads and improves scan throughput, PySpark and Pandas interoperability improvements. improvements. HBase 1.4.2 and Presto 0.194 includes various bug fixes and improvements. Additionally, the AWS SDK included on your Amazon EMR clusters is now updated to 1.11.297. 

You can create an Amazon EMR cluster with the release 5.13.0 by choosing the release label “emr-5.13.0” from the AWS Management Console, AWS CLI, or SDK. You can select Spark, HBase, and Presto to install these applications when you launch your EMR cluster. Please visit the Amazon EMR documentation for more information about EMR release 5.13.0, HBase 1.4.2, and Presto 0.194.

Amazon EMR release 5.13.0 is now available in all supported regions for Amazon EMR.