About Creating Additional Tokenization Configurations

A tokenization configuration specifies which tokenizers to use when Exalead CloudView analyzes incoming documents at index-time. It also specifies how to tokenize queries at search-time.

By default, Exalead CloudView uses tok0 as the tokenization configuration for converting text into tokens. However, if you create additional tokenization configs, you must specify them explicitly in Data Model > Semantic Types and Data Processing.

This page discusses:

See Also
Using Native Tokenizers
Using Basis Tech Tokenizer
Customizing the Tokenization Config