Settings tab

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15


Settings tab, Hadoop Copy Files

Option Description
Include subfolders Select to copy all subdirectories in the chosen directory.
Destination is a file Select to specify the destination is a file.
Copy empty folders Select to copy empty directories. The Include Subfolders option must be selected for this option to be valid.
Create destination folder Select to create the specified destination directory if it does not exist.
Replace existing files Select to overwrite duplicate files in the destination directory.
Remove source files Select to remove the source files after copying them. This is equivalent to a move procedure.
Copy previous results to arguments Select to use previous step results as your sources and destinations.
Add files to result files name Select to create a list of files that were copied in this step.

If you are not using Kerberos security, this step sends the username of the logged-in user when copying the files regardless of the username entered in the connect field. To change the username, set the environment variable HADOOP_USER_NAME to the username you want to use. You can set the username by changing the OPT variable in the spoon.bat or spoon.sh file as shown in the following example:

OPT="$OPT .... -DHADOOP_USER_NAME=HadoopNameToSpoof"