1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36
|
.\" DO NOT MODIFY THIS FILE! It was generated by help2man 1.46.4.
.TH PYNLPL-SAMPLER "1" "February 2016" "pynlpl-sampler 0.7.7" "User Commands"
.SH NAME
sampler \- manual page for pynlpl-sampler 0.7.7
.SH DESCRIPTION
usage: pynlpl\-sampler [\-h] [\-t TESTSETSIZE] [\-d DEVSETSITE] [\-T TRAINSETSITE]
.TP
[\-S SEED]
files [files ...]
.PP
Extracts random samples from datasets, supports multiple parallel datasets
(such as parallel corpora), provided that corresponding data is on the same
line.
.SS "positional arguments:"
.TP
files
The data sets to sample from, must be of equal size
(i.e., same number of lines)
.SS "optional arguments:"
.TP
\fB\-h\fR, \fB\-\-help\fR
show this help message and exit
.TP
\fB\-t\fR TESTSETSIZE, \fB\-\-testsetsize\fR TESTSETSIZE
Test set size (lines) (default: 0)
.TP
\fB\-d\fR DEVSETSITE, \fB\-\-devsetsite\fR DEVSETSITE
Development set size (lines) (default: 0)
.TP
\fB\-T\fR TRAINSETSITE, \fB\-\-trainsetsite\fR TRAINSETSITE
Training set size (lines), leave unassigned (0) to
automatically use all of the remaining data (default:
0)
.TP
\fB\-S\fR SEED, \fB\-\-seed\fR SEED
Seed for random number generator (default: 0)
|