Content tab

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15


Content tab

In the Content tab, you can specify the format of the text files that are being read.
Option Description
Filetype Select either CSV or Fixed length. Based on this selection, the PDI client launches a different helper GUI when you click Get Fields in the Fields tab.
Separator One or more characters that separate the fields in a single line of text. Typically, this is a semicolon ( ; ) or tab.
Enclosure Some fields can be enclosed by a pair of strings to allow separator characters in fields. The enclosure string is optional.
Allow breaks in enclosed fields This field is either not used by the Spark engine or not implemented for Spark on AEL.
Escape Specify an escape character (or characters) if you have these types of characters in your data. If you have a backslash ( / ) as an escape character, the text Not the nine o\'clock news (with a single quote \[ ' \] as the enclosure) is parsed as Not the nine o'clock news.
Header & Number of header lines Select if your text file has a header row (first lines in the file). Set Header & Number of header lines to 1 (one).
Footer & Number of footer lines These fields are either not used by the Spark engine or not implemented for Spark on AEL.
Wrapped lines & Number of times wrapped These fields are either not used by the Spark engine or not implemented for Spark on AEL.
Paged layout (printout), Number of lines per page, & Document header lines These fields are either not used by the Spark engine or not implemented for Spark on AEL.
Compression This field is either not used by the Spark engine or not implemented for Spark on AEL.
No empty rows This field is either not used by the Spark engine or not implemented for Spark on AEL.
Include filename in output? This field is either not used by the Spark engine or not implemented for Spark on AEL.
Filename fieldname This field is either not used by the Spark engine or not implemented for Spark on AEL.
Rownum in output? This field is either not used by the Spark engine or not implemented for Spark on AEL.
Rownum fieldname & Rownum by file? These fields are either not used by the Spark engine or not implemented for Spark on AEL.
Format Select UNIX.
Encoding & Limit These fields are either not used by the Spark engine or not implemented for Spark on AEL.
Be lenient when parsing dates? This field is either not used by the Spark engine or not implemented for Spark on AEL.
The date format Locale This field is either not used by the Spark engine or not implemented for Spark on AEL.
Add filenames to result This field is either not used by the Spark engine or not implemented for Spark on AEL.