Support statement for Analyzer on Impala

Try Pentaho Data Integration and Analytics

Version
10.2.x
Audience
anonymous
Part Number
MK-95PDIA000-16

These are the minimum requirements for Analyzer to work with Impala:

  • Pentaho 7.1 or later
  • Impala 1.3.x or later
  • Recommend using Parquet compressed file format for tables in Impala
  • Make sure that the JDBC driver is dropped into the Pentaho Server and Schema Workbench directories. See the Install Pentaho Data Integration and Analytics document for details.
  • Turn off connection pooling in Pentaho Server.
  • In Mondrian schemas, divide dimension tables with high cardinality into several levels
Note: As with any data source, the performance of Pentaho Analyzer on Impala will be dependent upon the data shape, Impala’s configuration, and the types of queries. See the best practice, "Pentaho Analyzer with Impala as a Data Source" located at: https://support.pentaho.com/hc/en-us/articles/208652846 or download the PDF.
There are some compiled Mondrian automated test suite results for Analyzer on Impala with OEM Simba, as well as the community Apache Hive driver: