AWS Glue UPDATED ACTUAL Exam Questions and CORRECT
Answers
How do I ETL my data? - FOUR STEPS:
1. CRAWL
2. Map. - process going from source to target schema
3. Edit and Explore
4. Schedule
Glue Data Catalogue: Cralwers - 1. Automatically discover new data and extract schema
definitions
a. Detect schema changes and version tables
b. Detect Apache Hive style partitions on Amazon S3
2. Built-in "classifiers" for popular data types
a. Custom classifiers using Grok expressions
3. Run on-Demand or on a schedule; SERVERLESS - only pay when crawler runs
Crawlers: Classifiers - 1. JDBC connection
a. Databases (Amazon RDS, others)
b. Data Warehouse (Amazon Redshift)
2. Object connection
a. Data Lakes
(i). Amazon S3
Answers
How do I ETL my data? - FOUR STEPS:
1. CRAWL
2. Map. - process going from source to target schema
3. Edit and Explore
4. Schedule
Glue Data Catalogue: Cralwers - 1. Automatically discover new data and extract schema
definitions
a. Detect schema changes and version tables
b. Detect Apache Hive style partitions on Amazon S3
2. Built-in "classifiers" for popular data types
a. Custom classifiers using Grok expressions
3. Run on-Demand or on a schedule; SERVERLESS - only pay when crawler runs
Crawlers: Classifiers - 1. JDBC connection
a. Databases (Amazon RDS, others)
b. Data Warehouse (Amazon Redshift)
2. Object connection
a. Data Lakes
(i). Amazon S3