Install a driver for the PDI client

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15
Before you can add a named connection to a cluster, you must install a driver for the vendor and version of the Hadoop cluster that you are connecting to. Perform the following steps to install a driver for the PDI client:
This task assumes that you are not using the default Hadoop driver and that you have downloaded your vendor-specific driver from the Hitachi Vantara Lumada and Pentaho Support Portal.
  1. In the PDI client, select the View tab of your transformation or job.
  2. Right-click the Hadoop clusters folder and click Add driver.
    The Add driver dialog box appears.
    Add driver dialog box
  3. Click Browse
    The Choose File to Upload dialog box appears.
  4. Navigate to the location where you downloaded your driver file.
  5. Select the driver (.kar file) you want to add, click Open, and then click Next.
    The selected file name appears in the Browse text field. The vendor distribution files contain their abbreviations in the .kar file names as shown below:
    • Amazon EMR (emr)
    • Azure HDInsight (hdi)
    • Cloudera (cdh)
    • Cloudera Data Platform (cdp)
    • Google Dataproc (dataproc)
    • Hortonworks (hdp)
  6. Click Next.
    The Congratulations dialog box appears, notifying you that you must restart the Pentaho Server and the PDI client. The Driver field in the New cluster and Import cluster dialog boxes now displays the driver you have added.