To use the HBase Input and HBase Output steps with EMR 5.21, you must add the following parameter:
spark.hadoop.validateOutputSpecs=false
You can use any of these methods to set the parameter:
- Specify the parameter in the properties file
- Specify the parameter in Transformation properties
- Specify the parameter as an environment variable in PDI
For more information about the properties file and processing Spark parameters, see the Administer Pentaho Data Integration and Analytics document.