PDI job entries

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15

Entries extend and expand the functionality of Pentaho Data Integration (PDI) entries. You can use the following steps in PDI.

Entries: A - F

Name Category Description
Abort job Utility Abort the job.
Add filenames to result File management Add filenames to result.
Amazon EMR Job Executor Big Data Execute MapReduce jobs in Amazon EMR.
Amazon Hive Job Executor Big Data Execute Hive jobs in Amazon EMR.
Bulk load from MySQL into file Bulk loading Load from a MySQL table into a file.
Bulk load into Amazon Redshift Bulk loading Bulk loads files located in S3 buckets into an Amazon Redshift database.
Bulk load into Azure SQL DB Bulk loading Load data into an Azure SQL database from Azure Data Lake Storage.
Bulk load into MSSQL Bulk loading Load data from a file into a MSSQL table.
Bulk load into MySQL Bulk loading Load data from a file into a MySQL table.
Bulk load into Snowflake Bulk loading Load data from a file into a Snowflake data warehouse
Check Db connections Conditions Check if we can connect to one or several databases.
Check files locked Conditions Check if one or several files are locked by another process.
Check if a folder is empty Conditions Check if a folder is empty.
Check if connected to repository Repository Return true if we are connected to a repository.
Check if XML file is well formed XML Check if one or several XML files is/are well formed.
Check webservice availability Conditions Check if a webservice is available.
Checks if files exist Conditions Checks if files exist.
Columns exist in a table Conditions Check if one or several columns exist in a table on a specified connection.
Compare folders File management Compare two folders (or two files).
Convert file between Windows and Unix File management Convert file content between Windows and Unix. Converting to Unix will replace CRLF (carriage return and line feed) by LF (line feed).
Copy Files File management Copy files.
Copy or Move result filenames File management Copy or move result filenames (since version 5.0, this job entry has been renamed to Process result filenames and it handles Delete as well).
Create a folder File management Create a folder.
Create file File management Create an empty file.
Create Snowflake warehouse Service management Create a Snowflake virtual warehouse.
Decrypt files with PGP File encryption Decrypt files encrypted with PGP (Pretty Good Privacy). This job entry needs GnuPG to work properly.
Delete file File management Delete a file.
Delete filenames from result File management Delete filenames from result.
Delete files File management Delete files.
Delete folders File management Delete specified folders. If a the folder contains files, PDI will delete them all.
Delete Snowflake warehouse Service management Drops a Snowflake warehouse.
Display Msgbox Info Utility Display a simple message information box.
DTD Validator XML Verify if an XML file corresponds to a certain structure or format.
Dummy General Use the Dummy job entry to do nothing in a job.
Encrypt files with PGP File encryption Encrypt files with PGP (Pretty Good Privacy). This job entry needs GnuPG to work properly.
Evaluate files metrics Conditions Evaluate files size or files count.
Evaluate rows number in a table Conditions Evaluate the content of a table. You can also specify an SQL query.
Example job (deprecated) Deprecated Is an example test job entry for a plugin.
Export repository to XML file Repository Export repository to XML file.
File Compare File management Compare two files.
File exists (Job Entry) Conditions Check if a file exists.
FTP Delete File transfer Delete files in a remote host.

Entries: G - L

Name Category Description
Get a file with FTP File transfer Get files using FTP (File Transfer Protocol).
Get a file with FTPS File transfer Get a file with FTP secure.
Get a file with SFTP File transfer Get files using SFTP (Secure File Transfer Protocol).
Get mails (POP3/IMAP) Mail Get mails (POP3/IMAP) server and save into a local folder.
Google BigQuery loader Big Data Load data into Google BigQuery from a Google Cloud Storage account.
Hadoop Copy Files Big Data Copies files in a Hadoop cluster from one location to another.
Hadoop job executor Big Data Execute a map/reduce job contained in a jar file.
HL7 MLLP Acknowledge Utility Acknowledge HL7 messages.
HL7 MLLP Input Utility Read data from HL7 data streams within a transformation.
HTTP File management Get or upload a file using HTTP (Hypertext Transfer Protocol).
JavaScript Scripting Evaluate the result of the execution of a previous job entry.
Job (job entry) General Execute a job.

Entries: M - R

Name Category Description
Mail Mail Send an email.
Mail validator Mail Check the validity of an email address (SNMP trap to a target host.)
Modify Snowflake warehouse Service management Modify a Snowflake virtual warehouse.
Move Files File management Move files.
MS Access bulk load (deprecated) Deprecated Load data into a Microsoft Access table from a CSV file format. Replaced by Microsoft Access Output Step.
Oozie Job Executor Big Data Execute Oozie workflows.
Palo cube create (deprecated) Deprecated Create a cube on a Palo server.
Palo cube delete (deprecated) Deprecated Delete a cube on a Palo server.
Pentaho MapReduce Big Data Execute transformation-based MapReduce jobs in Hadoop.
Pig Script Executor Big Data Execute a Pig script on a Hadoop cluster.
Ping a host Utility Ping a host.
Put a file with FTP File transfer Put a file with FTP.
Process result filenames File management Copy, move, or delete result filenames.
Put a file with SFTP File transfer Put files using SFTP (Secure File Transfer Protocol).

Entries: S - Z

Name Category Description
Send information using Syslog Utility Send information to another server using the Syslog protocol.
Send Nagios passive check Utility Send Nagios passive checks.
Send SNMP trap Utility Send SNMP trap to a target host.
Set variables General Set one or several variables.
Shell Scripting Execute a shell script.
Simple evaluation Conditions Evaluate one field or variable.
Spark Submit Big Data Submit Spark jobs to Hadoop clusters.
SQL Scripting Executes SQL on a certain database connection.
Sqoop Export Big Data Export data from the Hadoop Distributed File System (HDFS) into a relational database (RDBMS) using Apache Sqoop.
Sqoop Import Big Data Import data from a relational database (RDBMS) into the Hadoop Distributed File System (HDFS) using Apache Sqoop.
SSH2 Get (deprecated) Deprecated Get files using SSH2 (Deprecated in 5.0 in favor of the SFTP job entry).
SSH2 Put (deprecated) Deprecated Put files in a remote host using SSH2 (Deprecated in 5.0 in favor of the SFTP job entry).
Start General Defines the starting point for job execution. Every job must have one (and only one) Start.
Start a PDI Cluster on YARN Big Data Start a PDI Cluster on YARN.
Stop a PDI Cluster on YARN Big Data Stop a PDI Cluster on YARN.
Start Snowflake warehouse Service management Resumes a Snowflake warehouse.
Stop Snowflake warehouse Service management Suspends a Snowflake warehouse.
Success General Clear any error state encountered in a job and forces it to a success state.
Table exists Conditions Check if a table exists on a database connection.
Talend job execution (deprecated) Deprecated Execute an exported Talend job.
Transformation (job entry) General Run a transformation.
Truncate tables Utility Truncate one or several tables.
Unzip file File management Unzip file in a target folder.
Upload files to FTPS File transfer Upload files to a FTP secure.
Verify file signature with PGP File encryption Verify file signature with PGP (Pretty Good Privacy). This job entry needs GnuPG to work properly.
Wait for Conditions Wait for a delay.
Wait for file File management Wait for a file.
Wait for SQL Utility Scan a database and success when a specified condition on returned rows is true.
Write to file File management Write text content to file.
Write To Log Utility Write message to log.
XSD Validator XML Perform an XSD validation against data in a file or in an input field.
XSL Transformation XML Make an XSL transformation.
Zip file File management Zip files from a directory and process files.