Big Data Sources: Details

Try Pentaho Data Integration and Analytics

Version
9.4.x
Audience
anonymous
Part Number
MK-95PDIA000-12

This table shows the Big Data sources that are compatible with specific Pentaho tools.

Data Source Versions Analyzer PIR/PDD Pentaho Reporting DSW PDIServer/Client PRD PSW PME
Amazon EMRa 5.21, 5.24, 5.36 Yes Yes No No Yes Yes No Yes
Cloudera Data Platform (CDP) Private Cloud 7.1.x (for job execution) No No No No Yes Yes No Yes
via Impalab (as data source) Yes Yes Yes Yes Yes Yes No Yes
via Hive3c (as data source) No Yes Yes Yes Yes Yes No Yes
Datastax 4.6, 4.8 No No No No Yes No No No
Google BigQuery 1.2.25d Yes Yes Yes Yes Yes Yes Yes Yes
Google Dataproce (for job execution) 1.4, 2.2f No No No No Yes Yes No No
via Hive2 and Google BigQuery (as data source) Yes Yes Yes Yes Yes Yes No Yes
Greenplum 4.2, 4.3 Yes Yes Yes Yes Yes Yes Yes Yes
Microsoft Azure HDInsight 4.0 Yes Yes No No Yes No No Yes
MongoDB 4.4 No No Yes No Yes Yes No No
Netezza 7.1, 7.2 Yes Yes Yes Yes Yes Yes Yes Yes
SAP HANA SPS No No No No Yes No No No
Teradata 16.20 Yes Yes Yes Yes Yes Yes Yes Yes
Vertica 10 & 11 Yes Yes Yes Yes Yes Yes Yes Yes
Notes: A generic Apache Hadoop driver is included in the Pentaho distribution for version 9.4: Other supported drivers can be downloaded from the Hitachi Vantara Lumada and Pentaho Support Portal.

a Use the EMR 5.21 driver for your EMR 5.24 or EMR 5.36 cluster. The EMR 5.21 driver is certified to work for EMR 5.24.

b You must have the current version of the Pentaho release to use the CDP 7.1.4 driver. The CDP 7.1.4 driver requires the Impala JDBC Connector 2.6.4 Cloudera driver.

c Hive3 as a data source for CDP also supports Hive LLAP, and Hive3 on Tez.

d The Simba driver required for Google BigQuery is the JDBC 4.2-compatible version. See https://storage.googleapis.com/simba-bq-release/jdbc/SimbaJDBCDriverforGoogleBigQuery42_1.2.25.1029.zip.

e HBase is not supported with Google Dataproc.

f Use the Google Dataproc 1.8 driver for your Google Dataproc 2.2 cluster. The Google Dataproc 1.8 driver is certified to work for Google Dataproc 2.2.