File: README

package info (click to toggle)
morfologik-stemming 1.9.0%2Bdfsg-0.1
  • links: PTS, VCS
  • area: main
  • in suites: jessie, jessie-kfreebsd, stretch
  • size: 6,012 kB
  • ctags: 1,052
  • sloc: java: 7,121; xml: 759; makefile: 6
file content (62 lines) | stat: -rwxr-xr-x 1,457 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62

MORFOLOGIK
==========

FSA (automata), stemming, dictionaries and tools. Tools quickstart:

java -jar lib/morfologik-tools-${version}-standalone.jar


MODULES
=======

This project provides:

morfologik-fsa:

  - Creation of byte-based, efficient finite state automata in Java, including
    custom, efficient data storage formats.

  - Compatibility with FSA5, binary format of finite state automata produced by
    Jan Daciuk's "fsa" package.

morfologik-stemming:

  - FSA-based stemming interfaces and dictionary metadata.

morfologik-polish:

  - Precompiled dictionary of inflected forms, stems and tags for the Polish 
    language built on top of a large dictionary.

morfologik-tools:

  - Command line tools to preprocess, build and dump FSA automata and dictionaries.

  - There are a few command-line tools you may find useful. Type:
    java -jar lib/morfologik-tools-${version}.jar
    for an up-to-date list of all tools.

morfologik-speller:

  - Simplistic automaton-based spelling correction (suggester).


AUTHORS
=======

Marcin Miłkowski (http://marcinmilkowski.pl) [linguistic data lead, code]
Dawid Weiss (http://www.dawidweiss.com) [fsa lead, code]


CONTRIBUTORS
============

Grzegorz Słowikowski [initial maven configs]


QUESTIONS, COMMENTS
===================

Web site:     http://www.morfologik.blogspot.com
Mailing list: morfologik-devel@lists.sourceforge.net