From the course: AWS Certified Data Engineer Associate (DEA-C01) Cert Prep

Unlock this course with a free trial

Join today to access over 25,300 courses taught by industry experts.

AWS Glue

AWS Glue

- [Instructor] AWS Glue is one of the most important services for data pipelines. In this lesson we'll introduce AWS Glue and some of its capabilities. AWS Glue is a fully managed serverless service that makes it easy to extract, transform, and load your data in your data pipeline. It supports many use cases for ingestion, including ETL, ELT, batch and streaming. You could use it for preparing data for analytics, and it runs Glue jobs on a fully managed and scalable Apache Spark environment. Glue Studio features a drag and drop editor, so you can create ETL jobs without writing code. Glue Crawlers can discover and classify data in schemas and then store the metadata in the Glue data catalog for you. This could be done for a variety of data stores, including S3, RedShift, RDS, DynamoDB, MongoDB, and other stores over at JDBC connection.

Contents