Tour the environment

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15

The following illustration shows selected data visualized as a Bar chart in Model View.


Data inspection features

Use the numbered items in the illustration above to reference the sections of the inspection environment in the table below.

Item Feature Description
1 Header bar

Use the header bar to access:

  • The title of the step being inspected.
  • The row count of the data that was sampled, up to a maximum default of 50,000 rows.
  • The Publish data source button (Publish Data Source Button), use to create a data source for collaborative use later through a data service.
  • The Return to transformation button (Return To Transformation Button), use to return to your transformation.
2 Stream View / Model View

Toggle between the Stream View and Model View modes to inspect data and build visualizations based on the data sampled.

  • Use Stream View to inspect the data using data types and formats from the PDI data stream.
  • Use Model View to inspect the data using a dimensional model that can be adjusted with the Annotate Stream step.
Note: When a visualization mode is not supported, the unsupported view is disabled.
Search box Use the Search box to find a specific field in the list of available fields. This feature is especially useful in Stream View where the order of the fields is solely determined by the transformation.
Available fields panel

The available fields panel lists all available fields from the subset of data being inspected. Field types are automatically assigned as the step data are ingested, including:

  • Default fields, which contain default data depending upon the view:
    • Stream View data that are not numeric, with no date or timestamp, including string, Boolean and other types.
    • Model View data that are non-measure, and not annotated as location or time hierarchies.
  • Date fields (Date Field Icon icon), which contain date data. (Stream View only)
  • Numeric fields (Numeric Field Icon icon), which contain numeric data. (Stream View only)
  • Geographic fields (Geographic Field Icon icon), which contain location data. (Model View only)
  • Measure fields (Measure Field Icon icon), which contain quantitative data. (Model View only)
  • Time fields (Time Field Icon icon), which contain time data. (Model View only)

From this panel, you can select the specific fields you want to inspect and exclude others. Selected fields display with a blue disk icon (Selected Fields Icon) to the left of their names. Click a field to select or clear it, or drag a field into the Layout panel.

  • Select Clear All to remove all fields from the Layout panel, clear all filters from the Filters panel, and clear the canvas.
  • For a flat table in Stream View, click Select All to include all fields in the flat table in the order they are listed.
3 Visualization selector Use the visualization selector to choose a visualization type. Selecting a visualization from the drop-down menu produces it on the canvas.
4 Layout panel Displays the available drop zones and associated field types needed for the selected visualization. Click the header to collapse this panel and expand the Filters panel, if needed.
5 Filters panel Displays all filters applied to a visualization. Click the header to collapse this panel and expand the Layout panel, if needed. To apply a filter, you can drag a field from the available fields panel into the Filters panel. Keyboard shortcuts are available for many filter options. Also, some specific filtering actions can be applied by clicking on the visualization. See the Use Filters to Explore Your Data article for more information.
6 canvas The canvas displays the visualization you are using for data inspection.
7 Tabs bar
Use the Tabs bar to manage and navigate the tabs:
  • The active tab is always indicated with a blue highlight.
  • Create a tab for another data visualization by duplicating an existing tab or by adding a new tab.
  • Rename a tab.
  • Scroll multiple tabs.
  • Delete tabs you no longer need.
  • Display a menu (Menu Iconicon), which contains options for the selected tab (Duplicate, Delete, and Rename).