Co-Authored By:
Asked by: Xulian Delbrugge
technology and computing data storage and warehousingWhat is a glue job?
Hereof, can be set in AWS glue?
AWS Glue is serverless, so there's no infrastructure to set up or manage. You can also use the AWS Glue API operations to interface with AWS Glue services. Edit, debug, and test your Python or Scala Apache Spark ETL code using a familiar development environment.
Additionally, how does AWS glue work?
AWS Glue automatically discovers and profiles your data via the Glue Data Catalog, recommends and generates ETL code to transform your source data into target schemas, and runs the ETL jobs on a fully managed, scale-out Apache Spark environment to load your data into its destination.
AWS Glue supports two job types: Apache Spark and Python shell. Note: Libraries and extension modules for Spark jobs must be written in Python. Libraries such as pandas, which is written in C, are not supported.