Avro Output

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15

The Avro output step serializes data into an Avro binary or JSON format from the PDI data stream, then writes it to file. Apache Avro is a data serialization system. Avro relies on schema for decoding binary and extracting data.

This output step creates the following files:

  • A file containing output data in the Avro format
  • An Avro schema file defined by the fields in this step

Fields can be defined manually or extracted from incoming steps.