The Analysis Pipeline Sequence of Processors

The Analysis pipeline defines a sequence of processors executed on a document before it is added to the index. These processors are:

  • document processors, used for transforming document meta and content,
  • and semantic processors, used for extracting structured information from unstructured document content.

See Also
Configuring the Analysis Pipeline Manually
Appendix - Configure Document Processors
Appendix - Configure Semantic Processors
The Analysis Pipeline Sequence of Processors
Use a Single Pipeline with Groups of Processors
Testing your Analysis Pipeline Behavior

The analysis pipeline processes documents one by one in a specified sequence determined by the order of the document and semantic processors. For each input document, the pipeline outputs exactly one document.

You can access the analysis pipeline in the Administration Console by going to Index > Data processing > Analysis pipelines, and selecting the pipeline name.

The following figure shows the typical analysis pipeline workflow.