Set Kettle or Java environment variables in the Pentaho MapReduce job entry

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15
Pentaho MapReduce jobs are typically run in distributed fashion, with the mapper, combiner, and reducer run on different nodes. If you need to set a Java or Kettle environment variable for the different nodes, such as the KETTLE_MAX_JOB_TRACKER_SIZE, set them in the Pentaho MapReduce job entry window.
Note: Values for Kettle environment variables set in the Pentaho MapReduce window override the Kettle environment variable values in the kettle.properties file.

To set kettle or java environment variables, complete these steps:

  1. In the PDI client, double-click the Pentaho MapReduce job entry, then click the User Defined tab.
  2. In the Name field, set the environment or Kettle variable you need:
    • For Kettle environment variables, type the name of the variable in the Name field, like this: KETTLE_SAMPLE_VAR.
    • For Java environment variables, preface the value with the java.system. prefix, like this: java.system.SAMPLE_PATH_VAR.
  3. Enter the value of the variable in the Value field.
  4. Click the OK button.