From the course: AWS Certified Data Engineer Associate (DEA-C01) Cert Prep

Unlock this course with a free trial

Join today to access over 25,300 courses taught by industry experts.

Introduction

Introduction

- Hey guys, and welcome to this section on data transformation. In this section, you'll learn about how to transform data and make it usable by downstream applications. Now, when you do this, you need to consider some of the challenges that can arise when you're trying to process data at scale and high volume with velocity and also variety as well. You're going to learn about the EMR, the Elastic MapReduce, hosted on AWS because it is designed for meeting these requirements and challenges. It's one of the biggest data frameworks and supports Hadoop and Spark. You'll learn how to handle streaming data versus batch data and some low or no-code options that you can use on AWS like Glue DataBrew and that's going to allow you to transform data using a visual interface. And you're going to get plenty of hands-on experience here with both EMR and DataBrew in this section.

Contents