Write a Java Custom Tokenizer

A Java Custom Tokenizer is useful for processing the text with an external analyzer or for implementing a specific behavior. The JavaCustomTokenizer allows you to write your own code for splitting the input and possibly adding annotations to the produced tokens.

These tokens then follow their way in the indexing chain as usual (see Sample Tokenizer ).

This page discusses:

See Also
About Tokens and Annotations