The Avro output step serializes data into an Avro binary or JSON format from the PDI data stream, then writes it to file. Apache Avro is a data serialization system. Avro relies on schema for decoding binary and extracting data.
This output step creates the following files:
- A file containing output data in the Avro format
- An Avro schema file defined by the fields in this step
Fields can be defined manually or extracted from incoming steps.