Installing Data Storage Optimizer offline

Using Pentaho Data Storage Optimizer

Part Number
You must have root privileges or have the necessary permissions to run Docker as part of the installation process.

Perform the following steps to install Data Storage Optimizer into Data Catalog:

  1. Open a terminal window on your dedicated Data Catalog deployment server.
  2. Save the Data Storage Optimizer release package in the Data Catalog server.
  3. Open a terminal window on your dedicated Data Catalog deployment server and extract the files from the release package to the /opt directory using the following command:
    tar -xvf [name of release package].tar.gz -C /opt
  4. Load the required installation images that are packaged in the vendor directory into Docker using the following command:
    cd /opt/pentaho/pdc-docker-deployment
    ./ load-images
  5. Upon first time installation or if the following message appears:
    GLOBAL_SERVER_HOST_NAME env is not set, please select an environment variable value from the list or type your own:
    1.	IP address
    2.	Hostname
    3.	Hostname.localhost.localdomain
    4.	Other 
    #?    1

    Then set the GLOBAL_SERVER_HOST_NAME variable to the Hostname or IP address of the server that Data Catalog is being deployed on. Set the variable by selecting the number for the option that you want to use and then press Enter.

    In the example above, the user selected 1. The script then sets the GLOBAL_SERVER_HOST_NAME variable, in this case, to the IP address in the conf/.env file.
  6. Start all the Dockers using the following command:
    sh up
    The installation script uses the packaged Docker images for the Data Storage Optimizer release to create and run Docker containers on your dedicated server.
The installation finishes when each Docker container has been successfully started.

Access Data Catalog through your browser (the Chrome browser is recommended) using the hostname name or IP address and then access Data Storage Optimizer using the hostname or IP address, as follows:

[hostname or IP address]/pdso

Confirm that the application is successfully installed and running.

Note: For new installations, you are redirected to the PDC login page.
To facilitate initial validation especially in sandbox and demo environments, a set of users are created by default. These users are assigned specific roles. For more information, see Manage users and permissions in the Administer Pentaho Data Catalog document.
Role Actions
Admin A user who is able to configure the product
Data User A user who is interested in leveraging Data Catalog to find data for use for a business operation
Data Steward A user who will update and process data in Data Catalog for use for a business operation, including migrating data for Pentaho Data Storage Optimizer
Business User A user who needs to view business-specific glossaries and dictionaries
Business Steward A user who will maintain business-specific glossaries and dictionaries
Data Developer A user who will create and update business rules in Data Catalog or metadata rules in Data Storage Optimizer

Refer to the installation package for credential details for the default users. This information is found in an encrypted file.

For Development and Production environments, it is a best practice to create users upon installation and deprecate these default users.