Architecture

Using Pentaho Data Storage Optimizer

Version
10.1.x
Audience
anonymous
Part Number
MK-95PDSO000-02

Data Storage Optimizer is a data optimization product that integrates with Pentaho Data Catalog to incorporate metadata management, a rules engine, scheduler, and a Virtual File Server for seamless file tiering and purging. Supported file types include Hadoop Distributed File System (HDFS) DataNodes, Cloud, including SharePoint, OneDrive, and Azure Blob Network Attached Storage (NAS) using Server Message Block (SMB) or Network File System (NFS) protocols and S3 storage in Hitachi Content Platform (HCP), AWS, or any other S3-compliant datastore.

As shown in the following figure, the system architecture for Data Storage Optimizer is conceptually defined by the following structures and behaviors.


System architecture