Important: It is a best practice before installing Data Catalog to save a copy of your conf/.env file to save any environment customizations you have made in case the file is overwritten during the installation process. During installation, Data Catalog checks for a PDC_DATA_ENCRYPTION_KEY environment variable in the conf/.env file. If the variable exists, the conf/.env file is retained. However, if the variable does not exist, Data Catalog generates a new .env file containing a PDC_DATA_ENCRYPTION_KEY environment variable. If needed, you can add any custom environment variable settings back in to the new .env file from your saved file.
Perform the following steps to install Data Catalog:
The installation is ready for use after all the Docker containers have successfully started.
Access Data Catalog through your browser (the Chrome browser is recommended) using the hostname name or IP address and confirm that the application is successfully installed and running.
Access Data Storage Optimizer, if installed, through your browser using the hostname or IP address: [hostname or IP]/pdso and confirm that the applications are successfully installed and running.
Note: For new installations, you are redirected to the PDC login page.
Data Catalog provides a set of default users for demonstrating and testing. These default users have the following specific roles assigned:
For more information, see Manage users and permissions in the Administer Pentaho Data Catalog document.
Role | Actions |
---|---|
Admin | A user who can configure the product |
Data User | A user who is interested in leveraging Data Catalog to find data for use for a business operation |
Data Steward | A user who will update and process data in Data Catalog for use for a business operation, including migrating data for Pentaho Data Storage Optimizer |
Business User | A user who needs to view business-specific glossaries and dictionaries |
Business Steward | A user who will maintain business-specific glossaries and dictionaries |
Data Developer | A user who will create and update business rules in Data Catalog or metadata rules in Data Storage Optimizer |
Refer to the installation package for credential details for the default users. This information is found in an encrypted file.
After installing Data Catalog, there may be other components you need to set up, depending on your environment. For more information, see the Administer Pentaho Data Catalog document.Important: For Development and Production environments, it is a best practice to create users upon installation and deprecate these default users.