File: pynlpl-sampler.1

package info (click to toggle)
python-pynlpl 1.1.2-1
  • links: PTS, VCS
  • area: main
  • in suites: buster, stretch
  • size: 1,568 kB
  • ctags: 1,861
  • sloc: python: 20,108; sh: 88; makefile: 3
file content (36 lines) | stat: -rw-r--r-- 1,156 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
.\" DO NOT MODIFY THIS FILE!  It was generated by help2man 1.46.4.
.TH PYNLPL-SAMPLER "1" "February 2016" "pynlpl-sampler 0.7.7" "User Commands"
.SH NAME
sampler \- manual page for pynlpl-sampler 0.7.7
.SH DESCRIPTION
usage: pynlpl\-sampler [\-h] [\-t TESTSETSIZE] [\-d DEVSETSITE] [\-T TRAINSETSITE]
.TP
[\-S SEED]
files [files ...]
.PP
Extracts random samples from datasets, supports multiple parallel datasets
(such as parallel corpora), provided that corresponding data is on the same
line.
.SS "positional arguments:"
.TP
files
The data sets to sample from, must be of equal size
(i.e., same number of lines)
.SS "optional arguments:"
.TP
\fB\-h\fR, \fB\-\-help\fR
show this help message and exit
.TP
\fB\-t\fR TESTSETSIZE, \fB\-\-testsetsize\fR TESTSETSIZE
Test set size (lines) (default: 0)
.TP
\fB\-d\fR DEVSETSITE, \fB\-\-devsetsite\fR DEVSETSITE
Development set size (lines) (default: 0)
.TP
\fB\-T\fR TRAINSETSITE, \fB\-\-trainsetsite\fR TRAINSETSITE
Training set size (lines), leave unassigned (0) to
automatically use all of the remaining data (default:
0)
.TP
\fB\-S\fR SEED, \fB\-\-seed\fR SEED
Seed for random number generator (default: 0)