Top / Amazon Web Service / AWS Glue / Crawler

AWS Glue Crawler

This page shows how to write Terraform and CloudFormation for AWS Glue Crawler and write them securely.

Review your .tf file for AWS best practices

Shisho Cloud, our free checker to make sure your Terraform configuration follows best practices, is available (beta).

aws_glue_crawler (Terraform)

The Crawler in AWS Glue can be configured in Terraform with the resource name aws_glue_crawler. The following sections describe how to use the resource and its parameters.

Example Usage from GitHub

An example could not be found in GitHub.

Review your Terraform file for AWS best practices

Shisho Cloud, our free checker to make sure your Terraform configuration follows best practices, is available (beta).

Parameters

arn optional computed - string
classifiers optional - list of string
configuration optional - string
database_name required - string
description optional - string
id optional computed - string
name required - string
role required - string
schedule optional - string
security_configuration optional - string
table_prefix optional - string
tags optional - map from string to string
catalog_target list block
- database_name required - string
- tables required - list of string
dynamodb_target list block
- path required - string
- scan_all optional - bool
- scan_rate optional - number
jdbc_target list block
- connection_name required - string
- exclusions optional - list of string
- path required - string
lineage_configuration list block
- crawler_lineage_settings optional - string
mongodb_target list block
- connection_name required - string
- path required - string
- scan_all optional - bool
recrawl_policy list block
- recrawl_behavior optional - string
s3_target list block
- connection_name optional - string
- exclusions optional - list of string
- path required - string
schema_change_policy list block
- delete_behavior optional - string
- update_behavior optional - string

>> from Terraform Registry

Explanation in Terraform Registry

Manages a Glue Crawler. More information can be found in the AWS Glue Developer Guide

>> from Terraform Registry

AWS::Glue::Crawler (CloudFormation)

The Crawler in Glue can be configured in CloudFormation with the resource name AWS::Glue::Crawler. The following sections describe 10 examples of how to use the resource and its parameters.

Example Usage from GitHub

GirijaRaniGavara/provision-codepipeline-glue-workflows-

glue-workflow-stack.yml#L351

    Type: AWS::Glue::Crawler
    DependsOn: GlueRole
    Properties:
      Name: c_aggregated
      Description: !Sub Crawl aggregated datasets at s3://${Covid19Bucket}/covid19/world-cases-deaths-aggregates/
      DatabaseName: !Ref GlueDatabaseName

duyhoang15/test

glue-workflow-stack.yml#L351

    Type: AWS::Glue::Crawler
    DependsOn: GlueRole
    Properties:
      Name: c_aggregated
      Description: !Sub Crawl aggregated datasets at s3://${Covid19Bucket}/covid19/world-cases-deaths-aggregates/
      DatabaseName: !Ref GlueDatabaseName

ozzyince/tv

tvdata-raw-crawler.yml#L3

    Type: AWS::Glue::Crawler
    Properties:
      Name: ${self:custom.stage}-tvdata-raw-crawler
      Role:
        Fn::GetAtt: [RatingRole, Arn]
      DatabaseName:

garystafford/athena-glue-quicksight-demo

smart-hub-athena-glue.yml#L255

    Type: AWS::Glue::Crawler
    Properties:
      Name: smart-hub-locations-csv
      Role: !GetAtt "CrawlerRole.Arn"
      Targets:
        CatalogTargets:

yike5460/quickstart-aws-utility-meter-data-analytics-platform

glue.yml#L23

    Type: "AWS::Glue::Crawler"
    Properties:
      Name: "meter-data-business-aggregated-daily"
      Role: !Sub "service-role/${IAMRole}"
      Targets:
        S3Targets:

gridu/AMAZONBIGDATA_FOR_STUDENTS

glue_cf_template.json#L83

            "Type": "AWS::Glue::Crawler",
            "Properties": {
                "Name": {"Fn::Sub": "${AWS::StackName}-views-crawler"},
                "Role": {"Ref": "glueRole"},
                "DatabaseName": {
                    "Ref": "viewDatabase"

aws-cloudformation/cfn-lint

aws_glue.json#L4

    "path": "/ResourceTypes/AWS::Glue::Crawler/Properties/Role/Value",
    "value": {
      "ValueType": "AWS::IAM::Role.NameOrArn"
    }
  },
  {

vkhadela1985/cfn-lint-test

aws_glue.json#L4

    "path": "/ResourceTypes/AWS::Glue::Crawler/Properties/Role/Value",
    "value": {
      "ValueType": "AWS::IAM::Role.NameOrArn"
    }
  },
  {

ritwik-singh-yash/data-lake

glue.json#L204

            "Type": "AWS::Glue::Crawler",
            "Properties": {
                "Role": {
                    "Ref": "AWSGlueCuratedDatasetsCrawlerRoleName"
                },
                "DatabaseName": {

akashkatakam/INFO-7374-production-data-pipelines

twitter_analytics.json#L102

      "Type": "AWS::Glue::Crawler",
      "Properties": {
        "Name": "raw_crawler",
        "Role": {
          "Fn::GetAtt": [
            "GlueRole",

Parameters

Classifiers optional - List
Description optional - String
SchemaChangePolicy optional - SchemaChangePolicy
Configuration optional - String
RecrawlPolicy optional - RecrawlPolicy
DatabaseName optional - String
Targets required - Targets
CrawlerSecurityConfiguration optional - String
Name optional - String
Role required - String
Schedule optional - Schedule
TablePrefix optional - String
Tags optional - Json

>> from AWS CloudFormation Documentation

Explanation in CloudFormation Registry

The AWS::Glue::Crawler resource specifies an AWS Glue crawler. For more information, see Cataloging Tables with a Crawler and Crawler Structure in the AWS Glue Developer Guide.

>> from AWS CloudFormation Documentation

The Other Related AWS Glue Resources

AWS Glue Catalog Database

AWS Glue Catalog Table

AWS Glue Classifier

AWS Glue Connection

AWS Glue Data Catalog Encryption Settings

AWS Glue Dev Endpoint

AWS Glue Job

AWS Glue Ml Transform

AWS Glue Partition

AWS Glue Partition Index

Frequently asked questions

What is AWS Glue Crawler?

AWS Glue Crawler is a resource for Glue of Amazon Web Service. Settings can be wrote in Terraform and CloudFormation.

Where can I find the example code for the AWS Glue Crawler?

For CloudFormation, the GirijaRaniGavara/provision-codepipeline-glue-workflows-, duyhoang15/test and ozzyince/tv source code examples are useful. See the CloudFormation Example section for further details.

Automate config file reviews on your commits

Fix issues in your infrastructure as code with auto-generated patches.

aws_glue_crawler
AWS::Glue::Crawler
Frequently asked questions

AWS Glue Crawler

Review your .tf file for AWS best practices

aws_glue_crawler (Terraform)

Example Usage from GitHub

Review your Terraform file for AWS best practices

Parameters

Explanation in Terraform Registry

AWS::Glue::Crawler (CloudFormation)

Example Usage from GitHub

Parameters

Explanation in CloudFormation Registry

The Other Related AWS Glue Resources

Frequently asked questions

What is AWS Glue Crawler?

Where can I find the example code for the AWS Glue Crawler?

Automate config file reviews on your commits

Table of Contents