File: example4.config

package info (click to toggle)
swish-e 2.4.3-7
  • links: PTS
  • area: main
  • in suites: etch, etch-m68k
  • size: 7,308 kB
  • ctags: 7,642
  • sloc: ansic: 47,402; sh: 8,508; perl: 5,281; makefile: 723; xml: 9
file content (95 lines) | stat: -rwxr-xr-x 2,911 bytes parent folder | download | duplicates (11)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
# ----- Example 4 - Site-wide settings -------
#
#  Please see the swish-e documentation for
#  information on configuration directives.
#  Documentation is included with the swish-e
#  distribution, and also can be found on-line
#  at http://swish-e.org
#
#
#  This example demonstrates how to define
#  settings that change the way swish indexes.
#  Since you will probably want consistent
#  settings for all your indexes, you can
#  create one file, and include it in other
#  config files.
#
#  Once you define a common configuration file you
#  can include it in other configuration files.  For
#  example, if this file was saved as "common.config"
#  you can include it in other configuration files
#  with the following directive:
#
#       ...
#       IncludeConfig  /home/swish/common.config
#       ...
#
#---------------------------------------------------

# These settings tell swish what defines a word.

# We only index words that include letters, numbers, a dash,
# or a period.  (Not very realistic)

# These are the characters that are allowed in a "word".
# i.e. words are split on any character NOT found in WordCharacters

WordCharacters abcdefghijklmnopqrstuvwxyz0123456789.-


# We allow a period and a dash within words, but strip them
# from the beginning or end of a word.  This is done after
# WordCharacters above is used to split words.

IgnoreFirstChar .-
IgnoreLastChar  .-

# Finally, resulting words must begin/end with one
# of the characters listed here

BeginCharacters abcdefghijklmnopqrstuvwxyz0123456789
EndCharacters   abcdefghijklmnopqrstuvwxyz0123456789


# Turn this on for a slight performance improvement
#FollowSymLinks yes


# This is how detailed you want reporting. You can specify numbers
# 0 to 3 - 0 is totally silent, 3 is the most verbose.
# 4 is debugging.  Can be overridden with -v on the command line

IndexReport 2


# Set the stopwords (words to ignore when searching and when indexing)
# Carefully think about this feature before using a list of stopwords
# You can list the words here:
#  IgnoreWords of or and the a to
# Or you can use the compiled in defaults:
#  IgnoreWords SwishDefault
# Or you can use a file that includes your own words:

IgnoreWords file: stopwords/my_stopwords.txt

# Another option is to use the IgnoreLimit directive, and
# swish will determine what stopwords to use.  But please
# read the documentation before using the IgnoreLimit directive.
# It can be slow, and may not work with other options.


# Since we are using such a restrictive WordCharacters settings, we
# want to map eight-bit characters to ascii.
# For example, "resum" will be indexed and searched as "resume".
# See docs for more info.

TranslateCharacters :ascii7:


# We don't want pharse searches to work across sentenses, plus
# we use the pipe "|" to force a break in phrases when indexing.

BumpPositionCounterCharacters |.

# end of example