The following table lists the worker processes:
Process | Description | Actions performed |
---|---|---|
Test Connection | Returns detailed success or failure information for each step of the test. Data Catalog starts this worker process when you configure or update a data source connection. Data Catalog marks the data source “OFFLINE” until a successful test completes. |
|
Metadata Ingest | Ingests the metadata for one or more schemas. |
|
Data Profiling | Generates a variety of statistics and intermediate data with a single pass through the source data. Typically, this is the first process you run on your data. |
|
Data Identification | Identifies and tags columns and tables using ontology information (dictionaries, aliases), along with underlying data and metadata. |
|
Key Discovery | Performs a variety of key discovery actions. Foreign key discovery requires that Data Profiling of the data sources has completed. |
|
Data Quality | Performs a full data quality (DQ) analysis on the underlying data, using regular expressions and other configurable business rules. |
|
Sensitive Data Discovery (SDD) | Performs the tasks beyond data identification for SDD. This process uses flows, lineage, Foreign Keys, and more to put together the items comprising PI and PII. |
|