Assign data temperatures

Using Pentaho Data Storage Optimizer

Version
10.1.x
Audience
anonymous
Part Number
MK-95PDSO000-02

Use Check Data Temperature to check the domain and classification of the files in the data source that you want to optimize.

To perform rules-based tiering or purging, Data Storage Optimizer requires a specific domain name and business terms within a hierarchy. If the domain and terms do not exist, you must create them in Data Catalog:
  • Create a domain named Data Temperature.
  • Create business terms within that domain. Data temperature terms can be used to identify the usage and age of your data. You may want to consider the data temperature examples, below, when creating or assigning terms:
Note: You must use the Data Temperature domain, but you can choose different terms to better fit your environment or workflow. A category is not required.
Data temperature Criteria
Boiling Regularly searched, regularly read, created less than 180 days ago, and with a last modified date of less than 30 days.
Hot Frequently searched, frequently read, created less than 366 days ago, and with a last modified date of less than 90 days.
Warm Frequently searched, less frequently read, created less than 366 days ago, and with a last modified date of less than 180 days.
Cold Rarely searched, rarely read, created more than 366 days ago, and with a last modified date of more than 366 days.
Frozen Never searched, never read, created more than 732 days ago, and with a last modified date of more than 732 days.

Click Check Data Temperature to open the Business Glossary in Data Catalog. To add a domain or term, click Add New. See Manage Business Glossary in Data Catalog for details.