File: snowball-tokenfilter.asciidoc

package info (click to toggle)
elasticsearch 1.0.3%2Bdfsg-5
  • links: PTS, VCS
  • area: main
  • in suites: jessie-kfreebsd
  • size: 37,220 kB
  • sloc: java: 365,486; xml: 1,258; sh: 714; python: 505; ruby: 354; perl: 134; makefile: 41
file content (33 lines) | stat: -rw-r--r-- 1,010 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
[[analysis-snowball-tokenfilter]]
=== Snowball Token Filter

A filter that stems words using a Snowball-generated stemmer. The
`language` parameter controls the stemmer with the following available
values: `Armenian`, `Basque`, `Catalan`, `Danish`, `Dutch`, `English`,
`Finnish`, `French`, `German`, `German2`, `Hungarian`, `Italian`, `Kp`,
`Lovins`, `Norwegian`, `Porter`, `Portuguese`, `Romanian`, `Russian`,
`Spanish`, `Swedish`, `Turkish`.

For example:

[source,js]
--------------------------------------------------
{
    "index" : {
        "analysis" : {
            "analyzer" : {
                "my_analyzer" : {
                    "tokenizer" : "standard",
                    "filter" : ["standard", "lowercase", "my_snow"]
                }
            },
            "filter" : {
                "my_snow" : {
                    "type" : "snowball",
                    "language" : "Lovins"
                }
            }
        }
    }
}
--------------------------------------------------