Before you can add a named connection to a cluster, you must install a driver for the vendor and version of the Hadoop cluster that you are connecting to. Perform the following steps to install a driver for the PDI client:
-
In the PDI client, select the View tab of your transformation or job.
-
Right-click the Hadoop clusters folder and click Add driver.
The
Add driver dialog box
appears.

-
Click Browse
The Choose File to Upload dialog box appears.
-
Navigate to the location where you downloaded your driver file.
-
Select the driver (.kar file) you want to add, click
Open, and then click
Next.
The selected file name appears in the
Browse text field. The vendor distribution files
contain their abbreviations in the
.kar file names as shown
below:
- Amazon EMR (emr)
- Azure HDInsight (hdi)
- Cloudera (cdh)
- Cloudera Data Platform (cdp)
- Google Dataproc (dataproc)
- Hortonworks (hdp)
-
Click Next.
The Congratulations dialog box
appears, notifying you that you must restart the Pentaho Server and the
PDI client. The Driver field in the New cluster and Import
cluster dialog boxes now displays the driver you have
added.