1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91
|
.\" Automatically generated by Pandoc 2.19.2
.\"
.\" Define V font for inline verbatim, using C font in formats
.\" that render this, and otherwise B font.
.ie "\f[CB]x\f[]"x" \{\
. ftr V B
. ftr VI BI
. ftr VB B
. ftr VBI BI
.\}
.el \{\
. ftr V CR
. ftr VI CI
. ftr VB CB
. ftr VBI CBI
.\}
.TH "IHS" "1" "" "iHS (vcflib)" "iHS (VCF statistics)"
.hy
.SH NAME
.PP
\f[B]iHS\f[R]
.SH SYNOPSIS
.PP
\f[B]iHS\f[R] \[en]target 0,1,2,3,4,5,6,7 \[en]file my.phased.vcf
\ \[en]region chr1:1-1000 > STDOUT 2> STDERR
.SH DESCRIPTION
.PP
\f[B]iHS\f[R] calculates the integrated haplotype score which measures
the relative decay of extended haplotype homozygosity (EHH) for the
reference and alternative alleles at a site (see: voight et al.\ 2006,
Spiech & Hernandez 2014).
.SH OPTIONS
.IP
.nf
\f[C]
Our code is highly concordant with both implementations mentioned. However, we do not set an upper limit to the allele frequency. **iHS** can be run without a genetic map, in which case the change in EHH is integrated over a constant. Human genetic maps for GRCh36 and GRCh37 (hg18 & hg19) can be found at: http://bochet.gcc.biostat.washington.edu/beagle/genetic_maps/ . **iHS** by default interpolates SNV positions to genetic position (you don\[aq]t need a genetic position for every VCF entry in the map file).
**iHS** analyses requires normalization by allele frequency. It is important that **iHS** is calculated over large regions so that the normalization does not down weight real signals. For genome-wide runs it is recommended to run slightly overlapping windows and throwing out values that fail integration (columns 7 & 8 in the output) and then removing duplicates by using the \[aq]sort\[aq] and \[aq]uniq\[aq] linux commands. Normalization of the output is as simple as running \[aq]normalize-**iHS**\[aq].
**iHS** calculates the integrated ratio of haplotype decay between the reference and non-reference allele.
Output : 4 columns :
1. seqid
2. position
3. target allele frequency
4. integrated EHH (alternative)
5. integrated EHH (reference)
6. **iHS** ln(iEHHalt/iEHHref)
7. != 0 integration failure
8. != 0 integration failure
Params:
required: t,target <STRING> A zero base comma separated list of target
individuals corresponding to VCF columns
required: r,region <STRING> A tabix compliant genomic range
format: \[dq]seqid:start-end\[dq] or \[dq]seqid\[dq]
required: f,file <STRING> Proper formatted and phased VCF.
required: y,type <STRING> Genotype likelihood format: GT,PL,GL,GP
optional: a,af <DOUBLE> Alternative alleles with frquences less
than [0.05] are skipped.
optional: x,threads <INT> Number of CPUS [1].
recommended: g,gen <STRING> A PLINK formatted map file.
Type: statistics
\f[R]
.fi
.SH EXIT VALUES
.TP
\f[B]0\f[R]
Success
.TP
\f[B]not 0\f[R]
Failure
.SH SEE ALSO
.PP
\f[B]vcflib\f[R](1)
.SH OTHER
.SH LICENSE
.PP
Copyright 2011-2024 (C) Erik Garrison and vcflib contributors.
MIT licensed.
Copyright 2020-2024 (C) Pjotr Prins.
.SH AUTHORS
Erik Garrison and vcflib contributors.
|