Step 4: Clean the data

Try Pentaho Data Integration and Analytics

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA000-11

After completing Step 3: Resolve missing data, you can further cleanse and and categorize the data into buckets before loading it into a relational database. In this section, you will cleanses the COUNTRY field data by mapping United States to USA using the Value mapper step. Cleaning the data ensures there is only one version of USA.

In addition, you will learn how to use buckets for categorizing the SALES data into small, medium, and large categories using the Number range step. You will learn how to insert these cleaning and categorizing functions into your transformation just prior to the Write to Database step on the canvas.