The HBase Input and HBase Output steps can run on Spark with the Adaptive Execution Layer (AEL). These steps can be used with the supported versions of Cloudera Distribution for Hadoop (CDH) and Hortonworks Data Platform (HDP). See the Administer Pentaho Data Integration and Analytics document for what versions are supported with AEL.To read or write data to HBase, you must have an HBase target table on the cluster. If one does not exist, you can create one using HBase shell commands.
Note: Due to Cloudera limitations, the HBase Input step fails when using the specific configuration of Spark in YARN mode with Kerberos.
This article explains how you can set up the Pentaho Server to run these steps.