File: mbtg.1

package info (click to toggle)
mbt 3.2.10-4
  • links: PTS, VCS
  • area: main
  • in suites: jessie, jessie-kfreebsd
  • size: 2,808 kB
  • ctags: 464
  • sloc: sh: 11,062; cpp: 3,109; makefile: 36; ansic: 11
file content (106 lines) | stat: -rw-r--r-- 1,837 bytes parent folder | download | duplicates (2)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
.TH mbtg 1 "2011 march 21"

.SH NAME
MBTG - Memory Based Tagger generator
.SH SYNOPSYS
mbtg -T <filename> -s <setting filename>

or

mbtg [options]

.SH DESCRIPTION

This programs generates, based on a tagged corpus, all the files needed to be able to tag a text with 
.B mbt.
.

.SH OPTIONS

.BR -h " or " --help
.RS
show help
.RE


.BR -T " <tagged training corpus file>"

or

.BR -E " <enriched tagged training corpus file>"

All further options have reasonable defaults, so using them is only
needed for the experienced user. See the mbt manual for more details.

.BR -s " settingsfile"
.RS
.B mbtg
creates this file, which can be used to run 
.B mbt
with minimal effort. (like mbt -s settings -T somefile)
.RE

.BR -p " pattern"
.RS
the pattern for known words (default ddfa)
.RE

.BR -P " pattern"
.RS
the pattern for unknown words (default dFapsss)
.RE

.BR -% " <number>"
.RS
filter threshold for ambitag construction (default 5%)
.RE

.BR -l " <lexiconfile>"

.BR -L " <file with list of frequent words>"

.BR -r " <ambitagfile>"

.BR -k " <known words case base>"

.BR -u " <unknown words case base>"

.BR -K " <known words instances file>"

.BR -U " <unknown words instances file>"

.BR -V " or " --version
.RS
show version info
.RE

.BR -e " <sentence delimiter> (default '<utt>')"

.B -X
.RS
keep the intermediate files
.RE

.BR -O "timbl options"
.RS
 (Note: there is NO SPACE between O and the options)
  <options>   classifier options for both known and unknown words instances bases
  K: <options>   classifier options for known words instance base
  U: <options>   classifier options for unknown words case base
  valid 
.B timbl
options are: a d k m q v w x -
.RE

.SH BUGS
possibly

.SH AUTHORS
Ko van der Sloot Timbl@uvt.nl

Antal van den Bosch Timbl@uvt.nl

.SH SEE ALSO
.BR timbl (1)
.BR mbt (1)
.BR mbtserver (1)