class Aws::Glue::Types::Crawler
Specifies a crawler program that examines a data source and uses classifiers to try to determine its schema. If successful, the crawler records metadata concerning the data source in the Glue
Data Catalog.
@!attribute [rw] name
The name of the crawler. @return [String]
@!attribute [rw] role
The Amazon Resource Name (ARN) of an IAM role that's used to access customer resources, such as Amazon Simple Storage Service (Amazon S3) data. @return [String]
@!attribute [rw] targets
A collection of targets to crawl. @return [Types::CrawlerTargets]
@!attribute [rw] database_name
The name of the database in which the crawler's output is stored. @return [String]
@!attribute [rw] description
A description of the crawler. @return [String]
@!attribute [rw] classifiers
A list of UTF-8 strings that specify the custom classifiers that are associated with the crawler. @return [Array<String>]
@!attribute [rw] recrawl_policy
A policy that specifies whether to crawl the entire dataset again, or to crawl only folders that were added since the last crawler run. @return [Types::RecrawlPolicy]
@!attribute [rw] schema_change_policy
The policy that specifies update and delete behaviors for the crawler. @return [Types::SchemaChangePolicy]
@!attribute [rw] lineage_configuration
A configuration that specifies whether data lineage is enabled for the crawler. @return [Types::LineageConfiguration]
@!attribute [rw] state
Indicates whether the crawler is running, or whether a run is pending. @return [String]
@!attribute [rw] table_prefix
The prefix added to the names of tables that are created. @return [String]
@!attribute [rw] schedule
For scheduled crawlers, the schedule when the crawler runs. @return [Types::Schedule]
@!attribute [rw] crawl_elapsed_time
If the crawler is running, contains the total time elapsed since the last crawl began. @return [Integer]
@!attribute [rw] creation_time
The time that the crawler was created. @return [Time]
@!attribute [rw] last_updated
The time that the crawler was last updated. @return [Time]
@!attribute [rw] last_crawl
The status of the last crawl, and potentially error information if an error occurred. @return [Types::LastCrawlInfo]
@!attribute [rw] version
The version of the crawler. @return [Integer]
@!attribute [rw] configuration
Crawler configuration information. This versioned JSON string allows users to specify aspects of a crawler's behavior. For more information, see [Include and Exclude Patterns][1]. [1]: https://docs.aws.amazon.com/glue/latest/dg/define-crawler.html#crawler-data-stores-exclude @return [String]
@!attribute [rw] crawler_security_configuration
The name of the `SecurityConfiguration` structure to be used by this crawler. @return [String]
@see docs.aws.amazon.com/goto/WebAPI/glue-2017-03-31/Crawler AWS API Documentation
Constants
- SENSITIVE