1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62
|
MORFOLOGIK
==========
FSA (automata), stemming, dictionaries and tools. Tools quickstart:
java -jar lib/morfologik-tools-${version}-standalone.jar
MODULES
=======
This project provides:
morfologik-fsa:
- Creation of byte-based, efficient finite state automata in Java, including
custom, efficient data storage formats.
- Compatibility with FSA5, binary format of finite state automata produced by
Jan Daciuk's "fsa" package.
morfologik-stemming:
- FSA-based stemming interfaces and dictionary metadata.
morfologik-polish:
- Precompiled dictionary of inflected forms, stems and tags for the Polish
language built on top of a large dictionary.
morfologik-tools:
- Command line tools to preprocess, build and dump FSA automata and dictionaries.
- There are a few command-line tools you may find useful. Type:
java -jar lib/morfologik-tools-${version}.jar
for an up-to-date list of all tools.
morfologik-speller:
- Simplistic automaton-based spelling correction (suggester).
AUTHORS
=======
Marcin Miłkowski (http://marcinmilkowski.pl) [linguistic data lead, code]
Dawid Weiss (http://www.dawidweiss.com) [fsa lead, code]
CONTRIBUTORS
============
Grzegorz Słowikowski [initial maven configs]
QUESTIONS, COMMENTS
===================
Web site: http://www.morfologik.blogspot.com
Mailing list: morfologik-devel@lists.sourceforge.net
|