Setup tab

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15


Kafka Producer step

Fill in the following fields.

Option Description
Connection
Select a connection type:
Direct
Specify the Bootstrap servers from which you want to receive the Kafka streaming data.
Cluster
Specify the Hadoop cluster configuration from which you want to retrieve the Kafka streaming data. In a Hadoop cluster configuration, you can specify information like host names and ports for HDFS, Job Tracker, security, and other big data cluster components. Multiple servers can be specified if these are part of the same cluster. For information on Hadoop clusters, see Connecting to a Hadoop cluster with the PDI client.
Client ID The unique Client identifier, used to identify and set up a durable connection path to the server to make requests and to distinguish between different clients.
Topic The category to which records are published.
Key Field In Kafka, all messages can be keyed, allowing for messages to be distributed to partitions based on their keys in a default routing scheme. If no key is present, messages are randomly distributed to partitions.
Message Field The individual record contained in a topic.