1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95
|
# ----- Example 4 - Site-wide settings -------
#
# Please see the swish-e documentation for
# information on configuration directives.
# Documentation is included with the swish-e
# distribution, and also can be found on-line
# at http://swish-e.org
#
#
# This example demonstrates how to define
# settings that change the way swish indexes.
# Since you will probably want consistent
# settings for all your indexes, you can
# create one file, and include it in other
# config files.
#
# Once you define a common configuration file you
# can include it in other configuration files. For
# example, if this file was saved as "common.config"
# you can include it in other configuration files
# with the following directive:
#
# ...
# IncludeConfig /home/swish/common.config
# ...
#
#---------------------------------------------------
# These settings tell swish what defines a word.
# We only index words that include letters, numbers, a dash,
# or a period. (Not very realistic)
# These are the characters that are allowed in a "word".
# i.e. words are split on any character NOT found in WordCharacters
WordCharacters abcdefghijklmnopqrstuvwxyz0123456789.-
# We allow a period and a dash within words, but strip them
# from the beginning or end of a word. This is done after
# WordCharacters above is used to split words.
IgnoreFirstChar .-
IgnoreLastChar .-
# Finally, resulting words must begin/end with one
# of the characters listed here
BeginCharacters abcdefghijklmnopqrstuvwxyz0123456789
EndCharacters abcdefghijklmnopqrstuvwxyz0123456789
# Turn this on for a slight performance improvement
#FollowSymLinks yes
# This is how detailed you want reporting. You can specify numbers
# 0 to 3 - 0 is totally silent, 3 is the most verbose.
# 4 is debugging. Can be overridden with -v on the command line
IndexReport 2
# Set the stopwords (words to ignore when searching and when indexing)
# Carefully think about this feature before using a list of stopwords
# You can list the words here:
# IgnoreWords of or and the a to
# Or you can use the compiled in defaults:
# IgnoreWords SwishDefault
# Or you can use a file that includes your own words:
IgnoreWords file: stopwords/my_stopwords.txt
# Another option is to use the IgnoreLimit directive, and
# swish will determine what stopwords to use. But please
# read the documentation before using the IgnoreLimit directive.
# It can be slow, and may not work with other options.
# Since we are using such a restrictive WordCharacters settings, we
# want to map eight-bit characters to ascii.
# For example, "resum" will be indexed and searched as "resume".
# See docs for more info.
TranslateCharacters :ascii7:
# We don't want pharse searches to work across sentenses, plus
# we use the pipe "|" to force a break in phrases when indexing.
BumpPositionCounterCharacters |.
# end of example
|