File: vcf_filter.1

package info (click to toggle)
python-pyvcf 0.6.8%2Bgit20170215.476169c-9
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 1,816 kB
  • sloc: python: 2,924; makefile: 124; sh: 19
file content (81 lines) | stat: -rw-r--r-- 2,537 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
.TH VCF_FILTER "1" "October 2015" "0.6.7" "User Commands"
.SH NAME
vcf_filter \- Filter a VCF file
.SH SYNOPSIS
.B vcf_filter
[\-h] [\-\-no\-short\-circuit] [\-\-no\-filtered]
[\-\-output OUTPUT] [\-\-local\-script LOCAL_SCRIPT]
input filter [filter_args] [filter [filter_args]] ...
.SH DESCRIPTION
This script is part of PyVCF.
.SH OPTIONS
.SS "positional arguments:"
.TP
input
File to process (use \- for STDIN) (default: None)
.SS "optional arguments:"
.TP
\fB\-h\fR, \fB\-\-help\fR
Show this help message and exit. (default: False)
.TP
\fB\-\-no\-short\-circuit\fR
Do not stop filter processing on a site if any filter
is triggered (default: False)
.TP
\fB\-\-output\fR OUTPUT
Filename to output [STDOUT] (default:
<_io.TextIOWrapper name='<stdout>' mode='w'
encoding='ANSI_X3.4\-1968'>)
.TP
\fB\-\-no\-filtered\fR
Output only sites passing the filters (default: False)
.TP
\fB\-\-local\-script\fR LOCAL_SCRIPT
Python file in current working directory with the
filter classes (default: None)
.SS "mgq:"
.IP
Filters sites with only low quality variants. It is possible to have a
high site quality with many low quality calls. This filter demands at
least one call be above a threshold quality.
.TP
\fB\-\-genotype\-quality\fR GENOTYPE_QUALITY
Filter sites with no genotypes above this quality
(default: 50)
.SS "snp-only:"
.IP
Choose only SNP variants
.SS "dps:"
.IP
Threshold read depth per sample
.TP
\fB\-\-depth\-per\-sample\fR DEPTH_PER_SAMPLE
Minimum required coverage in each sample (default: 5)
.SS "avg-dps:"
.IP
Threshold average read depth per sample (read_depth / sample_count)
.TP
\fB\-\-avg\-depth\-per\-sample\fR AVG_DEPTH_PER_SAMPLE
Minimum required average coverage per sample (default:
3)
.SS "eb:"
.IP
Filter sites that look like correlated sequencing errors. Some sequencing
technologies, notably pyrosequencing, produce mutation hotspots where
there is a constant level of noise, producing some reference and some
heterozygote calls. This filter computes a Bayes Factor for each site by
comparing the binomial likelihood of the observed allelic depths under: *
A model with constant error equal to the MAF. * A model where each sample
is the ploidy reported by the caller. The test value is the log of the
bayes factor. Higher values are more likely to be errors. Note: this
filter requires rpy2
.TP
\fB\-\-eblr\fR EBLR
Filter sites above this error log odds ratio (default:
\fB\-10\fR)
.SS "sq:"
.IP
Filter low quailty sites
.TP
\fB\-\-site\-quality\fR SITE_QUALITY
Filter sites below this quality (default: 30)