File: uaxurlemail-tokenizer.asciidoc

package info (click to toggle)
elasticsearch 1.0.3%2Bdfsg-5
  • links: PTS, VCS
  • area: main
  • in suites: jessie-kfreebsd
  • size: 37,220 kB
  • sloc: java: 365,486; xml: 1,258; sh: 714; python: 505; ruby: 354; perl: 134; makefile: 41
file content (16 lines) | stat: -rw-r--r-- 614 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
[[analysis-uaxurlemail-tokenizer]]
=== UAX Email URL Tokenizer

A tokenizer of type `uax_url_email` which works exactly like the
`standard` tokenizer, but tokenizes emails and urls as single tokens.

The following are settings that can be set for a `uax_url_email`
tokenizer type:

[cols="<,<",options="header",]
|=======================================================================
|Setting |Description
|`max_token_length` |The maximum token length. If a token is seen that
exceeds this length then it is discarded. Defaults to `255`.
|=======================================================================