Jobs are used to coordinate ETL activities such as:
- Defining the flow and dependencies for what order transformations should be run.
- Preparing for execution by checking conditions such as, "Is my source file available?" or "Does a table exist?"
- Performing bulk load database operations.
- File management such as posting or retrieving files using FTP, copying files and deleting files.
- Sending success or failure notifications through email.
For this exercise, imagine that an external system is responsible for placing your sales_data.csv input in its source location every Saturday night at 9 p.m. You want to create a job that will check to see that the file has arrived and run your transformation to load the records into the database. In a subsequent exercise, you will schedule the job to be run every Sunday morning at 9 a.m.
To complete this exercise, you must have completed the exercises in the PDI Transformation Tutorial.