General tab

Pentaho Data Integration

Version
9.3.x
Audience
anonymous
Part Number
MK-95PDIA003-15
ElasticSearch Bulk Insert General Tab
Option Description
Index Specifies the name of the index you want to add data to. If an index with that name doesn't yet exist in ElasticSearch, it creates one.
Type Indicates the category the data should be placed in. You define the category. In general practice, the type sometimes describes the data. For example, if the index is "twitter" the type might be tweet.
Test Index Checks whether the index exists in ElasticSearch.
Batch Size Indicates the number of items in the batch. (If you set the batch size is set to one, it is not a bulk insert, but setting it to a higher number is.)
Stop on Error Stops processing if there is an error, such as a problem with adding the document or the bulk push to the index or if the JSON is not well-formed. If this option is not selected, and an error occurs, the row is not processed, but the transformation keeps running so that other rows are processed.
Batch Timeout Indicates how long batch should be processed before the batch times out, and processing ends.
ID Field Indicates the name of the ID Field in the file.
Overwrite if exists If the output file exists because this transformation was run before, allows the output to be overwritten.
Output Rows Sends the rows that are successfully processed by ElasticSearch to the to the next step (or the output). If you've checked Stop on Error, the rows that were successful up until the time the error occurs is sent to the next step (or the output). Otherwise, rows successfully processed by Elastic search rows are sent to the next step (or the output).
ID Output Field Indicates the name if the ID field that is in the output. If this is left blank, the value in the ID Field is used instead.
JSON Input Indicates whether the input is a JSON file.
JSON Field Indicates the JSON node from which processing should begin.