Query tab

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15
Query tab in the Query HCP step

In this tab, specify the VFS connection to connect to your HCP repository, and then refine your search with a query statement and other options.

Option Description
HCP VFS Connection From the drop-down list, select the VFS connection you created for this transformation to connect to your HCP repository.
Query statement Specify your HCP Metadata Query Engine (MQE) search statement for finding particular objects. For example, to find objects of 2000 MBs or greater, specify +(size[2000 TO *]).

You can use the metadata query engine to generate a Query statement for PDI.

  1. In the HCP Namespace Browser, specify your filters in the Structured Query tab.
  2. Click Show as advanced query. The related query statement is generated under the Advanced Query tab in the HCP Namespace Browser.
  3. Copy the resulting HCP query statement into this PDI Query statement option.
See Working with structured searches and Working with advanced searches for more details.
Sort results by Select which object properties you want to use to sort the results in alphabetical order.

You can sort by Ascending or Descending order.

Number of rows to skip (Optional) Specify a number of resulting objects to skip. This option is useful when setting up a paged query. See the "Paging through objects" section below for information about paged queries.
Number of results to return (Optional) Specify the maximum number of objects to return. The maximum is 10,000 objects. If you do not specify a value for Number of results to return, HCP returns the properties of up to 100 objects.
The following other values are possible for this option:
-1
Returns the properties of all the objects up to 10,000 objects.
0
Returns just the total count of objects in the HCP repository.

Hitachi Vantara recommends working with only a few thousand results at a time for the best performance.

Paging through objects

You can use Number of rows to skip and Number of results to return to page through in your HCP repository. First, query the repository with Number of results to return set to 0 to obtain the total count of objects. Then, query the repository for a specify number of objects at an offset location by setting Number of rows to skip. Within a PDI transformation, you can loop over the Query HCP step with a variable offset to page through the objects in the HCP repository. See Paged queries with object-based requests for more details on how to page through an HCP repository.