Use the Transformation menu

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-09

The Transformation menu contains transformation settings, options, and properties.

Right-click any step in the transformation canvas to view the Transformation menu. Each menu item is described in the following table:

Menu Item Description
New Hop Creates a new hop.
Edit Shows the configuration window for the step.
Description Allows you to add a description to the step.
Open Referenced Object Allows you to map a sub-transformation. Mapping a sub-transformation is covered in detail in Mapping.
Inspect Data Allows you to inspect the data stream of a step once the transformation has run.
Note: This option runs your transformation only if it was not previously executed.
Run and Inspect Data Runs the transformation up to the selected step, then lets you inspect your data.
Data Movement Describes the way data moves through the transformation when there is more than one hop. The following options are available:
  • Round Robin: Partitions the output stream and sends a portion of all output records down each hop.
  • Load Balance: Checks the output row sets to see how much room is left in the buffer. It selects the one that is most empty. If the rows are distributed to steps that take very little processing time per row (or the exact same amount of time for each step to process a row), Load Balance is identical to Round Robin. If the rows are sent down one path that takes a long time to process, such as Sort or Group By and down another path that processes rows more quickly, the "quick path" will likely have more rows sent to it, as it will empty its buffer before the "slow path" has a chance to empty its buffer. This is typically used for clustered transformations, where the same processing occurs on different nodes. The row buffer is set, by default, to 10000. To change the row buffer size, open the Transformation Settings window, then select Nr of rows in rowset on the Miscellaneous tab.
  • Copy Data to Next Steps: Copies the data to subsequent steps.
Change Number of Copies to Start Starts several instances of a step in parallel.
Copy Copies selected items to the clipboard.
Duplicate Makes a copy of the selected items, then pastes them to the canvas.
Delete Deletes selected items from the canvas.
Hide

Hides the step from the PDI client canvas.

CAUTION:

If you hide the step, you will need to open the transformation or job XML file and hand edit it to view it again. For more details, see the troubleshooting section.

Detach Detaches the step or entry from the transformation or job.
Input Fields Shows metadata, like the field name and type, for fields that come into the step.
Output Fields Shows metadata, like the field name and type, for fields that go out of the step.
Sniff Test During Execution The sniff test displays data as it travels from one step to another in the stream. To use this, right-click a step in the transformation as it runs and select Sniff Test During Execution. The following options are available:
  • Sniff test input rows: Shows the data inputted into the step.
  • Sniff test output rows: Shows the data outputted from the step.
  • Sniff test error handling: Shows error handling data.

For more information on how to use this tool, see the Sniff Test tool article.

Check Selected Step(s) Checks transformation steps for problems that could interfere with successfully running the transformation. Right-click the transformation step that you want to check and click Check Selected Step(s). Warnings and errors appear in the Results of transformation checks window.
Error Handling Indicates how to apply error handling for a step. When this is selected, the Step error handling settings window appears.
Preview Allows you to preview the results of the transformation. Launches the Transformation Debug Dialog.
Align/Distribute Arranges steps on the canvas so that they are aligned properly or distributed evenly. This helps create a visually pleasing transformation that is easier to read and digest.

Align refers to where the steps are permitted along the x (horizontal) or y (vertical) axis. Distribute makes the horizontal and vertical spacing between steps consistent. Typically, you turn on the grid, then move the different steps or entries on the canvas so that they form some sort of pattern, like a straight or branching line.

You select steps and apply the following options, as needed:

  • Align Left: Positions all steps so their left sides start on the same "x" (horizontal) coordinate as the left-most step. After applied, steps are arranged in a straight vertical line. No changes are made to the spaces between steps.
  • Align Right: Positions all steps so their right sides start on the same "x" (horizontal) coordinate as the right-most step. After applied, steps are arranged in a straight vertical line. No changes are made to the spaces between steps.
  • Align Top: Positions all steps so their top sides start on the same "y" (vertical) coordinate as the step positioned closest to the top of the canvas. After applied, steps are arranged in a straight horizontal line. No changes are made to the spaces between steps.
  • Align Bottom: Positions all steps so their bottom sides start on the same "y" (vertical) coordinate as the step positioned closest to the bottom of the canvas. After applied, steps are arranged in a straight horizontal line. No changes are made to the spaces between steps.
  • Distribute Horizontally: Positions all steps so that they are evenly spaced horizontally. After applied, steps are arranged evenly. No changes are made to the alignment.
  • Distribute Vertically: Positions all steps so that they are evenly spaced vertically. After applied, steps are arranged evenly. No changes are made to the alignment.
  • Snap to Grid: Aligns steps on the canvas to the grid. If grid markers do not appear on the canvas, select Tools > Options > Look & Feel > Show Canvas Grid. See PDI client options for more information on how to customize the PDI client.
Data Services Create, edit, delete, or test a Pentaho Data Service. The Pentaho Data Service allows others to obtain the results of a transformation, even if the person does not have the PDI client or Pentaho Server installed. The Pentaho Data Service is discussed in great detail in Pentaho Data Services.
Mapping Provides a way for you to map target fields from the step to source columns in a database. When selected, the Mapping window appears containing the following fields:
  • Source Fields: Lists the field names from the incoming stream.
  • Target Fields: Lists the column names in a target table.
  • Auto Target Selection: Automatically selects a matching table column if the target field is selected.
  • Auto Source Select: Automatically selects a matching target field if the table column is selected.
  • Add: Allows you to move the mapped target and source information to the mappings grid.
  • Guess: Makes mappings based on a computer algorithm.
  • Hide assigned source fields and Hide assigned target fields: Removes mappings from the Source Fields and Target Fields lists those fields are added to the mapping grid.
  • Delete: Removes mappings from the mapping grid so that they reappear in the Target Fields and Source Fields lists again.

When you click OK, the Mapping window closes and a Select / Rename Values step appears on the canvas. It is usually named after the step that right-clicked. The Select / Rename Values window contains the mappings. If you are not able to make mappings, the step still appears, but the properties are blank.

Partitions Splits data into subsets according to a rule that is applied on a row of data. Partitions are discussed in detail in the Partitioning data article.
Clusters Allows you to create Carte clusters. For more information, see Use Carte Clusters.