File: readme

package info (click to toggle)
bmagic 6.3.0-1
  • links: PTS
  • area: main
  • in suites: bookworm, bullseye, sid, trixie
  • size: 49,956 kB
  • sloc: cpp: 84,298; ansic: 9,703; sh: 1,664; makefile: 742
file content (30 lines) | stat: -rwxr-xr-x 931 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
The goal here is to compress/search save bioinfromatics data on nucleotide polymorphism using BitMagic Library.
Tech.notes:
http://bitmagic.io/succinct-snp-search.html

SNP RS columnar sparse vector example:

1. Download Human SNP report from NCBI FTP
ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/chr_rpts/

2. Take Chr1 for a large experiment
chr_1.txt.gz
ftp://ftp.ncbi.nlm.nih.gov/snp/organisms/human_9606/chr_rpts/chr_1.txt.gz

3. unzip the SNP report file
unzip chr_1.txt.gz

4. Load/parse data into compressed sparse vector
./xsample03 -isnp chr_1.txt -svout rs_chr1.sv

(wait, wait, wait, ...) 

5. Build RS compressed sparse vector
./xsample03 -svin rs_chr1.sv -rscout rs_chr1.rssv  -t

6. Print diagnostics (memory usage, statistics, etc)
./xsample03 -svin rs_chr1.sv -rscin rs_chr1.rssv  -t -d

7. Search performance benchmark
./xsample03 -svin rs_chr1.sv -rscin rs_chr1.rssv  -b -t