1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58
|
.TH svm-subset 1 "DEC 2009" Linux "User Manuals"
.SH NAME
svm-subset \- a subset selection tool for LIBSVM
.SH SYNOPSIS
.B svm-subset [ -s
.I method
.B ]
.I dataset number
.B [
.I output1
.B ] [
.I output2
.B ]
.SH DESCRIPTION
Training large data is time consuming. Sometimes one should work on a
smaller subset first. The python script subset.py randomly selects a
specified number of samples. For classification data, we provide a
stratified selection to ensure the same class distribution in the
subset.
.SH OPTIONS
.IP "-s method"
.TP
.B 0
--
stratified selection (classification only) (default)
.TP
.B 1
--
random selection
.TP
.IP "output1"
The subset. If output1 is omitted, the subset will be printed on the screen.
.IP "output2"
The rest of data.
.SH FILES
See
.BR svm-train (1)
for the format of
.I dataset
.SH EXAMPLES
.IP
svm-subset heart_scale 100 file1 file2
.LP
From heart_scale 100 samples are randomly selected and stored in
file1. All remaining instances are stored in file2.
.SH BUGS
Please report bugs to the Debian BTS.
.SH AUTHOR
Chih-Chung Chang, Chih-Jen Lin <cjlin@csie.ntu.edu.tw>, Chen-Tse Tsai <ctse.tsai@gmail.com> (packaging)
.SH "SEE ALSO"
.BR svm-train (1),
.BR svm-predict (1)
|