This table shows the Big Data sources that are compatible with specific Pentaho tools.
Data Source | Versions | Analyzer | PIR/PDD | Pentaho Reporting | DSW | PDIServer/Client | PRD | PSW | PME |
---|---|---|---|---|---|---|---|---|---|
Amazon EMRa | 5.21, 5.24, 5.36 | Yes | Yes | No | No | Yes | Yes | No | Yes |
Cloudera Data Platform (CDP) Private Cloud | 7.1.x (for job execution) | No | No | No | No | Yes | Yes | No | Yes |
via Impalab (as data source) | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | |
via Hive3c (as data source) | No | Yes | Yes | Yes | Yes | Yes | No | Yes | |
Datastax | 4.6, 4.8 | No | No | No | No | Yes | No | No | No |
Google BigQuery | 1.2.25d | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Google Dataproce (for job execution) | 1.4, 2.2f | No | No | No | No | Yes | Yes | No | No |
via Hive2 and Google BigQuery (as data source) | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes | |
Greenplum | 4.2, 4.3 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Microsoft Azure HDInsight | 4.0 | Yes | Yes | No | No | Yes | No | No | Yes |
MongoDB | 4.4 | No | No | Yes | No | Yes | Yes | No | No |
Netezza | 7.1, 7.2 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
SAP HANA | SPS | No | No | No | No | Yes | No | No | No |
Teradata | 16.20 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Vertica | 10 & 11 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
Notes: A generic Apache Hadoop driver is included in the Pentaho distribution for version 9.4: Other supported drivers can be downloaded from the Hitachi Vantara Lumada and Pentaho Support Portal. a Use the EMR 5.21 driver for your EMR 5.24 or EMR 5.36 cluster. The EMR 5.21 driver is certified to work for EMR 5.24. b You must have the current version of the Pentaho release to use the CDP 7.1.4 driver. The CDP 7.1.4 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver. c Hive3 as a data source for CDP also supports Hive LLAP, and Hive3 on Tez. d The Simba driver required for Google BigQuery is the JDBC 4.2-compatible version. See https://storage.googleapis.com/simba-bq-release/jdbc/SimbaJDBCDriverforGoogleBigQuery42_1.2.25.1029.zip. e HBase is not supported with Google Dataproc. f Use the Google Dataproc 1.8 driver for your Google Dataproc 2.2 cluster. The Google Dataproc 1.8 driver is certified to work for Google Dataproc 2.2. |