Data Sources

Using Pentaho Data Storage Optimizer

Version
10.1.x
Audience
anonymous
Part Number
MK-95PDSO000-02

With Data Storage Optimizer, you can tier and purge data from Hadoop Distributed File Systems (HDFS), file shares, and object stores. To process data from these sources, Data Storage Optimizer uses Data Catalog’s data source definitions, which maintain the connection information to your sources. The following data sources are supported in Data Storage Optimizer:

Type Data source
File share
  • Network File System (NFS), including local file and file-sharing network systems: Vendor agnostic NFS Protocol support including Hitachi Network Attached Storage (HNAS), NetApp Dell/EMC and any vendor utilizing NFS Protocol.
  • Server Message Block/Common Internet File System (SMB/CIFS), including local file and file-sharing network systems: Vendor agnostic SMB Protocol support including Hitachi Network Attached Storage (HNAS), NetApp Dell/EMC and any vendor utilizing SMB/CIFS protocol.
  • Cloud, including OneDrive and SharePoint.
  • Cloud Network Attached Storage (NAS) Azure Blob via NFS.
Object store
  • Amazon Web Service (AWS) S3.
  • Any S3 compatible platform.
  • Hitachi Content Platform (HCP).
Block storage Hadoop Distributed File System (HDFS) Cloudera and Hortonworks data platform distributions.

Click Imported on the Data Sources card to open the Data Sources page.


Data Sources page

You can use the Data Sources page to view existing data source names, status, types, and usages, and to import data sources into Data Storage Optimizer. See the following topics when working with data sources: