File: yass.seed

package info (click to toggle)
last-align 490-1
  • links: PTS, VCS
  • area: main
  • in suites: jessie, jessie-kfreebsd
  • size: 2,196 kB
  • ctags: 2,317
  • sloc: cpp: 20,908; python: 1,636; ansic: 1,466; makefile: 365; sh: 107
file content (33 lines) | stat: -rw-r--r-- 919 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
# This subset seed was described in:
# "YASS: enhancing the sensitivity of DNA similarity search"
# by L NoƩ & G Kucherov, NAR 2005 33:W540-W543.
# According to them, it provides a good compromise in detecting
# similarities for both coding and non-coding DNA.

# In positions with "AG CT", A and G are treated as equivalent, and C
# and T are treated as equivalent (i.e. transitions are tolerated).
# In positions with "ACGT", all four bases are treated as equivalent
# (i.e. mismatches are tolerated).  In positions with "A C G T", only
# exact matches are allowed.  Letters that don't appear (e.g. N) are
# not allowed to match.

# (The final three positions of the YASS seed are commented out here,
# because they are the same as the first three positions, and LAST
# cyclically repeats the seed anyway.)

A C G T
AG CT
A C G T
ACGT
ACGT
A C G T
A C G T
ACGT
ACGT
A C G T
ACGT
A C G T

# A C G T
# AG CT
# A C G T