File: mbtg.1

package info (click to toggle)
mbt 3.4-1
  • links: PTS, VCS
  • area: main
  • in suites: bullseye, sid
  • size: 2,976 kB
  • sloc: sh: 4,244; cpp: 3,351; makefile: 38; ansic: 15
file content (106 lines) | stat: -rw-r--r-- 1,864 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
.TH mbtg 1 "2014 december 2"

.SH NAME
MBTG \- Memory Based Tagger generator
.SH SYNOPSIS
mbtg \-T <filename> \-s <setting filename>

or

mbtg [options]

.SH DESCRIPTION

This programs generates, based on a tagged corpus, all the files needed to be able to tag a text with
.B mbt.
.

.SH OPTIONS

.BR \-h " or " \-\-help
.RS
show help
.RE


.BR \-T " <tagged training corpus file>"

or

.BR \-E " <enriched tagged training corpus file>"

All further options have reasonable defaults, so using them is only
needed for the experienced user. See the mbt manual for more details.

.BR \-s " settingsfile"
.RS
.B mbtg
creates this file, which can be used to run
.B mbt
with minimal effort. (like mbt \-s settings \-T somefile)
.RE

.BR \-p " pattern"
.RS
the pattern for known words (default ddfa)
.RE

.BR \-P " pattern"
.RS
the pattern for unknown words (default dFapsss)
.RE

.BR \-% " <number>"
.RS
filter threshold for ambitag construction (default 5%)
.RE

.BR \-l " <lexiconfile>"

.BR \-L " <file with list of frequent words>"

.BR \-r " <ambitagfile>"

.BR \-k " <known words case base>"

.BR \-u " <unknown words case base>"

.BR \-K " <known words instances file>"

.BR \-U " <unknown words instances file>"

.BR \-V " or " \-\-version
.RS
show version info
.RE

.BR \-e " <sentence delimiter> (default '<utt>')"

.B \-X
.RS
keep the intermediate files
.RE

.BR \-O "timbl options"
.RS
 (Note: there is NO SPACE between O and the options)
  <options>   classifier options for both known and unknown words instances bases
  K: <options>   classifier options for known words instance base
  U: <options>   classifier options for unknown words case base
  valid
.B timbl
options are: a d k m q v w x \-
.RE

.SH BUGS
possibly

.SH AUTHORS
Ko van der Sloot Timbl@uvt.nl

Antal van den Bosch Timbl@uvt.nl

.SH SEE ALSO
.BR timbl (1)
.BR mbt (1)
.BR mbtserver (1)