File: lowercase-tokenizer.asciidoc

package info (click to toggle)

elasticsearch 1.0.3%2Bdfsg-5

links: PTS, VCS
area: main
in suites: jessie-kfreebsd
size: 37,220 kB
sloc: java: 365,486; xml: 1,258; sh: 714; python: 505; ruby: 354; perl: 134; makefile: 41

file content (15 lines) | stat: -rw-r--r-- 577 bytes

parent folder | download | duplicates (2)

[[analysis-lowercase-tokenizer]]
=== Lowercase Tokenizer

A tokenizer of type `lowercase` that performs the function of
<<analysis-letter-tokenizer,Letter
Tokenizer>> and
<<analysis-lowercase-tokenfilter,Lower
Case Token Filter>> together. It divides text at non-letters and converts
them to lower case. While it is functionally equivalent to the
combination of
<<analysis-letter-tokenizer,Letter
Tokenizer>> and
<<analysis-lowercase-tokenfilter,Lower
Case Token Filter>>, there is a performance advantage to doing the two
tasks at once, hence this (redundant) implementation.