When Parquet data is read, the table defines the fields to read as input from the Parquet file.
Enter the information for the Catalog Input step fields, as shown in the following table.
Column | Description |
---|---|
Path | The name of the field as it appears in the Parquet data file, and the Parquet data type. An associated PDI field type is provided in parentheses. |
Name | The name of the input field. |
Type | The type of the input field as detected by PDI. |
Format | Specify the Date formats when the Type specified is Date. |
Get Fields (button) | Click to retrieve a list of fields derived from the source file in Data Catalog. |
Provide a path to a Parquet data file and click Get Fields. When the fields are retrieved, the Parquet type is converted to an applicable PDI type, as shown in the PDI types table. You can change the type by using the Type drop-down menu or by entering the type manually.