Pentaho MapReduce jobs
are typically run in distributed fashion, with the mapper, combiner, and reducer run on
different nodes. If you need to set a Java or Kettle environment variable for the
different nodes, such as the KETTLE_MAX_JOB_TRACKER_SIZE, set
them in the Pentaho MapReduce job
entry window.
Note: Values for Kettle
environment variables set in the Pentaho MapReduce window override the Kettle environment variable values in
the kettle.properties file.
To set kettle or java environment variables, complete these steps: