1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65
|
** This is version 0.7 (3rd of September 2000)
Added many flagged words to the dictionary (i.e. Used more
the affix compression capabilities). This caused a massive
increase in the number of recognized words and at the same
time decreased the size of the hash file, increased speed
and reduced memory requirements.
Statistics: 6678677 - 756588 words are recognized, of which 6600334 - 678245
are derived from 9837 words. There are 88180 words
in the dictionary. From one word can on average
about 670 - 68 word forms be derived.
Size of the produced hash file is 9.0M - 2.7M.
** The previous version was 0.6 (4th of August 2000)
Dictionary contains now more words (141k new items), and Finnish
inflection patterns are now more complete. (1.5 - 11 x more word
forms generated, depending on which one of the affix files is chosen.)
Suffixes in affix files are automatically generated by the genfisuffix
program. New features in them are complete possessive suffixes and
suffix particles. Also mostly complete verb inflection patterns are
now present. (But the dictionary file doesn't use them much.)
More information is in the genfisuffix documentation.
Statistics: 2994852 - 392302 words are recognized, of which 2902107 - 299557
are derived from 6375 words. There are 99120 words
in the dictionary. From one word can on average
about 455 - 46 word forms be derived.
Size of the produced hash file is 9.2M - 3.0M.
** The previous version was 0.5 (23th of June 2000)
Contains a much larger dictionary (90k items) compiled and contributed
by Pauli Virtanen <pauli.virtanen@saunalahti.fi>. Who also contributed
a substantially improved affix file.
Many new words were introduced in the dictionary. Adjective comparison
form inflection was made better in the affix file. Other minor fixes
were also made to the affix file.
Statistics: 252139 words are recognized, of which 168264
are derived from 5473 words. There are 89348 words
in the dictionary. From one word can on average
about 30 word forms be derived.
Size of the produced hash file is 2.3M.
** The previous version was 0.1
Contained a small dictionary and a working affix file, by
Martin Vermeer <mv@liisa.pp.fi>.
Statistics: 43972 words are recognized, of which 34379
are derived from 1235 words. There are 10828 words
in the dictionary. From one word can on average
about 27 word forms be derived.
Size of the produced hash file is 388k .
|