Configure query tab

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15

Before a value can be read from HBase, you must specify the type and column family of the value, and the type of the table key. You must define a mapping to use a source table. You can output some or all of the fields defined in the mapping. Rows from the table may be deleted to select a subset of the fields. Clearing all rows from the table indicates that all fields defined in the mapping should be output.

This tab contains connection details and basic query information. You can configure a connection by using the Hadoop cluster properties, or by using an hbase-site.xmland (an optional) hbase-default.xml configuration file.


Configure query tab

This tab includes the following fields:
Option Description
Hadoop Cluster
Click the Hadoop Cluster drop-down menu to select an existing Hadoop cluster configuration.
  • Click to Edit an existing Hadoop cluster configuration.
  • Click New to add a new Hadoop cluster configuration.
Refer to Connecting to a Hadoop cluster with the PDI client for information on creating and editing a Hadoop cluster.
URL to hbase-site.xml Specify the address of the hbase-site.xml file by entering its path or clicking Browse.
URL to hbase-default.xml Specify the address of the hbase-default.xml file by entering its path or clicking Browse.
HBase table name Select the name of the source HBase table you want to read.
Get mapped table names (button) Click to retrieve a list of all existing table names for the HBase table name field. Only table names that have been mapped are retrieved. If you enter the namespace:tablename in the HBase table name field, then click Get mapped table names, only the mapped table names in that namespace display. If you do not enter a namespace, all HBase tables across all namespaces are displayed. See Namespaces.
Mapping name A mapping you can use to decode and interpret column values. Click Get mappings for the specified table to populate the drop-down list of available mappings.
Store mapping info in step meta data

Select this option to store mapping information in the step's metadata instead of loading it from HBase at runtime.

Start key value (inclusive) for table scan Specifies the starting key value of a partial scan, including the value entered.
Stop key value (exclusing) for table scan Specifies the stopping key value of a partial scan, excluding the value entered. The start key and stop key fields may be left blank. If the stop key field is left blank, then all rows beginning with and including the start key will be returned.
Scanner row cache size The number of rows to cache each time a fetch request is made. See the Performance considerations section below for more information.