Debug Processor
Dumps all the DocumentChunks specified in Input from as standard output. Provides a log of the 'Analysis' process.
Debug ProcessorDumps all the DocumentChunks specified in Input from as standard output. Provides a log of the 'Analysis' process. Discard Document ProcessorDEPRECATED It does not stop the processing of the document. To do so, add a custom document processor with the following code: document.setProcessingFlag(Operation.DISCARD_AND_DELETE); ((AnalysisDocumentProcessingContext) context).stopProcessingAfterCurrentProcessor(); Document Processor GroupContains a list of document processors, which are executed only if this group document processor condition matches. Avoids condition duplication or distinct pipelines creation when several processors share the same condition. Format Checker DateThe Format Checker Date processor checks that the chunk matches either:
Infer File Extension When the Insert Current DateAdds the current date in an output context. Precomputed Thumbnails Document ProcessorPrecomputes thumbnails of the first DocumentPart. Random DocumentChunks Generator (Uniform Distribution)Adds a new DocumentChunk for one document out of 'modulo' documents processed. The textual content of the DocumentChunk is picked out of the list specified in Values, with a uniform distribution. Random DocumentChunks Generator (Zipf Distribution)Adds a new document chunk for one document out of 'modulo'. The textual content of the document chunk is picked out of the list specified in Values, with a nonuniform discrete Zipf distribution. Real-Time AlertingMatches queries defined by end-users and alerts them as soon as possible when a new matching document is indexed. Semantic PipeInstantiates a semantic pipe and creates chunks out of resulting annotations. It helps instantiate classification processors, and perform document level operations from their output. Similar String to Part ConverterConverts the signatures in a string format from a meta to a binary part. Storage Service Document ProcessorQueries the storage for any meta to attach to the document. Multivalued pairs are pushed as multivalued metas. For example:
UTF8 CheckerChecks that the text passing through is valid UTF-8. Emits a warning with the document URI and the context name if input is malformed. Optionally deletes invalid chunks. |