AWS Glue interview questions

21. How does AWS Glue monitor dependencies ?
AWS Glue manages dependencies between two or more jobs or dependencies on external events using triggers. Triggers can watch one or more jobs as well as invoke one or more jobs.

22. How to get metadata into the AWS Glue Data Catalog ?
Glue crawlers scan various data stores you own to automatically infer schemas and partition structure and populate the Glue Data Catalog with corresponding table definitions and statistics.

23. What is bookmarks in AWS glue ?
AWS Glue tracks data that has already been processed during a previous run of an ETL job by persisting state information from the job run. This persisted state information is called a job bookmark. Job bookmarks help AWS Glue maintain state information and prevent the reprocessing of old data.

Author: user

Leave a Reply