Subexpr and Substring
matching
|
Normally, suggest matching is prefix-based: "first" returns entries "first
test" and "first image". Sometimes, you want to be able to do a wider
matching, not always prefix-based.
- Subexpr matching allows you to find matches on every start of word. For
example, "first test" returns both for "fir" and for "tes".
- Substring matching allows you to find matches on every letter. For
example, "first test" returns for "fir", for "rs", for "es", ...
|
Sentence split and Ngram
split
|
For performance reasons, use these options to avoid long entries. By
“long”, we mean entries longer than 100 characters (100
bytes). Sentence and ngram split options allow you to break up a
suggest entry into several entries, and to perform matches independently on
the chunks.
- For sentence split, if the entry is multisentence, an entry is created
for each sentence.
- For ngram split, a sliding window of ngrams of a given size is created
and an entry created for each step of the window. For example, "a b c d e
f" with a split on 4-grams gives entries "a b c d", "b c d e", "c d e
f".
Note: 0 means no splitting.
|
Compute permutations
|
Computes all permutations for an entry and adds them as separate entries.
For example, if you start entering "Angeles", Exalead CloudView automatically suggests "Los Angeles".
Note:
Entries longer than 8 words are not permuted for performance
reasons.
This action is performed after the sentence split if the
Sentence split option is selected. To apply
permutation to Static XML suggest and
Static resource suggest types, you need to add
permutation=”true” to the SuggestDictionary
tag in your XML file or suggest resource in the Business Console.
|
Max. entry length
|
The maximum number of characters in a suggest entry. This is a
security measure to prevent overly long entries. They are automatically
truncated after the specified length. 0 means no limit.
|
Max. suggestions
|
The maximum number of suggestions that can be shown to the user for a given
input string. You cannot change this dynamically.
|
Tokenization config
|
Specifies the Tokenization configuration to use.
|
Sanitize entry
|
This option strips the entry of punctuation, and encloses any UQL operators
in quotes. It is useful when you want to suggest among a list of
product references containing "- " (hyphens) or other
delimiters, and you do not want any tokenization on these characters.
|
Build after import
|
Triggers a build automatically after the index refreshes.
|
Enable security
|
Makes use of documents and users’ security tokens to restrict
suggestions.
|