1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196
|
espa~nol.dicc Release 1.7-- A Spanish (Espa~nol) dictionary for using
with ispell 3.1.13 or later
Copyright (c) 1994 1995 1996 1999 2001 Santiago Rodr'iguez and Jes'us Carretero
This program is free software; you can redistribute it and/or modify
it under the terms of the GNU General Public License as published by
the Free Software Foundation.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details. This software can be
obtained from ftp.fi.upm.es in /pub/unix/espa~nol.tar.gz and
http://www.datsi.fi.upm.es/~coes
This set of data files implements a Spanish (castellano) dictionary to
be used with the international ispell program, version 3.1.13 or
further. The dictionary contains 53,000 roots (aprox.).
If you want to run the spanish dictionary, you have to undefine the
NO8BIT macro in the local.h configuration file.
Contents:
1. Uncompressing the package.
2. Building the dictionary.
3. Installing the dictionary.
4. Supported Formatters.
5. MSDOS Dictionary.
1. The distribution is included in the espa~nol.tar.gz file.
To extract sources that are in files ending in `.tar.gz' you can use the command
gzip -d < espa~nol.tar.gz | tar xf -
where `espa~nol.tar.gz' is the name of the file.
This file is expanded to the following files:
espa~nol.aff: Affixes file.
espa~nol.words: Contains a list of words that appear in the
official espa~nol dictionary (Diccionario de la Real Academia Espa'nola
de la Lengua 21st edition).
espa~nol.comp: Contains a list of words not appearing in the
official dictionary but being used in computer related texts.
antiguas.words: Contains a list of words that appear in the
official espa~nol dictionary and they are old ones that are not
currently in use.
espa~nol.words+: Contains the expanded list of words generated
from the espa~nol.words and espa~nol.comp word files.
e~nes: Script for replacing the 'n and 'N by ~n and ~N in the
espa~nol.aff, espa~nol.words and espa~nol.words+. If you use the
second way to specify this letter you have to run this script. This
script uses the sed utility. It has been checked by using the GNU sed
version 2.05. If you want to run this script make sure that you have the
GNU sed installed and type:
make e~ne
Makefile: Makefile for building the hash file
(espa~nol.hash) from the affix file and the espa~nol.words file.
This way of building the hash file needs about 50Mb of paging space and
100 Mb of temporary disk space. Please, ensure that you have
enough disk space in the tmp partition (usually /usr/tmp).
If you do not have it, you have to set the TMPDIR environment variable to
a path where you can allocate 100 Mb of temporary disk storage.
2. If you want to create the espa~nol.hash just type:
make
Quick building: If you want to create the espa~nol.hash from
the expanded word list (espa~nol.words+), just type:
make build
It does not need so much temporary space.
The size of the spanish dictionary (espa~nol.hash) is
2.5Mbytes (Solaris 2.7 has this problem). If you get a size much bigger,
probably it is due to the sort command of the operating system. In this
case we recommend to install the textutils package of GNU and be sure that
the sort command that you use is the textutils one.
3. To install the hash file become root and type
make install
4. Spanish acute chars may be codes in different ways.
Six different formatters are supported:
tex formatter: The acute characters are coded as follows:
'a
'e
'i
'o
'u
'n
"u
'A
'E
'I
'O
'U
'N
"U
TeX formatter: The acute characters are coded as follows:
\'a
\'e
\'{\i}
\'o
\'u
\'n
\"u
\'A
\'E
\'{\I}
\'O
\'U
\'N
\"U
plainTeX formatter: The acute characters are coded as follows:
\'{a}
\'{e}
\'{\i}
\'{o}
\'{u}
\'{n}
\"{u}
\'{A}
\'{E}
\'{\I}
\'{O}
\'{U}
\'{N}
\"{U}
latin1 formatter (default): The acute characters are coded as
specified in the iso_8859_1 code.
msdos formatter: The acute characters are coded as specified
in the extended ASCII MSDOS code.
html formatter: The acute characters are coded as follows:
á
é
í
ó
ú
ñ
ü
Á
É
Í
Ó
Ú
Ñ
Ü
If you want to run ispell by using one of the previous formats
please type:
ispell -T <formatter> -d espa~nol <file>
5. espa~nol.hash file is available for MSDOS users at:
http://www.datsi.fi.upm.es/~coes
Note that the affixes list and the word list are under
development. We are currently working on them. If you find words
that does not appear in the word list or words that must not appear in
the word list, please send a message to espanol-bugs@datsi.fi.upm.es
Santiago Rodriguez & Jesus Carretero
Departamento de Arquitectura
y Tecnologia de Sistemas Informaticos (DATSI)
Universidad Politecnica de Madrid
December 1994
January 1995
February 1995
November 1996
April 1999
June 2001
Email: srodri@fi.upm.es, jcarrete@fi.upm.es
|