File: frog.1

package info (click to toggle)
frog 0.12.15-3
  • links: PTS, VCS
  • area: main
  • in suites: wheezy
  • size: 2,096 kB
  • sloc: sh: 11,167; cpp: 4,661; python: 765; makefile: 32
file content (128 lines) | stat: -rw-r--r-- 2,350 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
.TH frog 1 "2012 January 31"

.SH NAME
frog - Dutch morpho-syntactic analyzer, IOB chunker and dependency parser
.SH SYNOPSYS
frog [options]

frog -t test-file

.SH DESCRIPTION
frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch.
frog's current version will tokenize, tag, lemmatize, and morphologically segment word tokens in Dutch text files, add IOB chunks and will assign a dependency graph to each sentence.

.SH OPTIONS

.BR -c " <configfile>"
.RS
set the configuration using 'file'
.RE

.BR -d " <level>"
.RS
set debug level.
.RE

.BR -e " <encoding>"
.RS
set input encoding. (default UTF8)
.RE

.BR -h
.RS
give some help
.RE

.BR --keep-parser-files =[yes|no]
.RS
keep the intermediate files from the parser. Last sentence only!
.RE

.BR -n
.RS
assume inputfile to hold one sentence per line
.RE

.BR -o " <file>"
.RS
send output to 'file' instead of stdout. Defaults to the name of the inputfile with '.out' appended.
.RE

.BR --outputdir " <dir>"
.RS
send all output to 'dir' instead of stdout. Creates filenames from the inputfilename(s) with '.out' appended.
.RE

.BR --skip =[mptc]
.RS
skip parts of the proces: Tokenizer (t), Chunker (c), Multi-Word unit (m) or Parser (p)
.RE

.BR -Q
.RS
Enable quotedetection in the tokenizer. May run havock!
.RE

.BR -S " <port>"
.RS
Run a server on 'port'
.RE

.BR -t " <file>"
.RS
process 'file'
.RE

.BR -x " <xmlfile>"
.RS
process 'xmlfile', which is supposed to be in FoLiA format! If 'xmlfile' is empty, and 
.BR --testdir =<dir> 
is provided, all files in 'dir' will be processed as FoLia XML.
.RE

.BR --testdir =<dir>
.RS
process all files in 'dir'. see also 
.B --outputdir
.RE

.BR --tmpdir =<dir>
.RS
location to store intermediate files. Default /tmp.
.RE

.BR -V " or " --version
.RS
show version info
.RE

.BR --xmldir =<dir>
.RS
generate FoLiA XML output and send it to 'dir'. Creates filenames from the inputfilename with '.xml' appended.
.RE

.BR -X " <file>"
.RS
generate FoLiA XML output and send it to 'file'. Defaults to the name of the inputfile(s) with '.xml' appended.
.RE

.BR --id "=<id>"
.RS
When 
.BR -X
for FoLia is given, use 'id' to give the doc an ID.
.RE


.SH BUGS
likely

.SH AUTHORS
Maarten van Gompel proycon@anaproy.nl

Ko van der Sloot Timbl@uvt.nl

Antal van den Bosch Timbl@uvt.nl

.SH SEE ALSO
.BR ucto (1)