class Aws::Glue::Types::Crawler

Specifies a crawler program that examines a data source and uses classifiers to try to determine its schema. If successful, the crawler records metadata concerning the data source in the Glue Data Catalog.

@!attribute [rw] name

The name of the crawler.
@return [String]

@!attribute [rw] role

The Amazon Resource Name (ARN) of an IAM role that's used to access
customer resources, such as Amazon Simple Storage Service (Amazon
S3) data.
@return [String]

@!attribute [rw] targets

A collection of targets to crawl.
@return [Types::CrawlerTargets]

@!attribute [rw] database_name

The name of the database in which the crawler's output is stored.
@return [String]

@!attribute [rw] description

A description of the crawler.
@return [String]

@!attribute [rw] classifiers

A list of UTF-8 strings that specify the custom classifiers that are
associated with the crawler.
@return [Array<String>]

@!attribute [rw] recrawl_policy

A policy that specifies whether to crawl the entire dataset again,
or to crawl only folders that were added since the last crawler run.
@return [Types::RecrawlPolicy]

@!attribute [rw] schema_change_policy

The policy that specifies update and delete behaviors for the
crawler.
@return [Types::SchemaChangePolicy]

@!attribute [rw] lineage_configuration

A configuration that specifies whether data lineage is enabled for
the crawler.
@return [Types::LineageConfiguration]

@!attribute [rw] state

Indicates whether the crawler is running, or whether a run is
pending.
@return [String]

@!attribute [rw] table_prefix

The prefix added to the names of tables that are created.
@return [String]

@!attribute [rw] schedule

For scheduled crawlers, the schedule when the crawler runs.
@return [Types::Schedule]

@!attribute [rw] crawl_elapsed_time

If the crawler is running, contains the total time elapsed since the
last crawl began.
@return [Integer]

@!attribute [rw] creation_time

The time that the crawler was created.
@return [Time]

@!attribute [rw] last_updated

The time that the crawler was last updated.
@return [Time]

@!attribute [rw] last_crawl

The status of the last crawl, and potentially error information if
an error occurred.
@return [Types::LastCrawlInfo]

@!attribute [rw] version

The version of the crawler.
@return [Integer]

@!attribute [rw] configuration

Crawler configuration information. This versioned JSON string allows
users to specify aspects of a crawler's behavior. For more
information, see [Include and Exclude Patterns][1].

[1]: https://docs.aws.amazon.com/glue/latest/dg/define-crawler.html#crawler-data-stores-exclude
@return [String]

@!attribute [rw] crawler_security_configuration

The name of the `SecurityConfiguration` structure to be used by this
crawler.
@return [String]

@see docs.aws.amazon.com/goto/WebAPI/glue-2017-03-31/Crawler AWS API Documentation

Constants

SENSITIVE