File: uniq.lua

package info (click to toggle)
genometools 1.6.2%2Bds-3
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 50,504 kB
  • sloc: ansic: 271,868; ruby: 30,327; python: 4,942; sh: 3,230; makefile: 1,214; perl: 219; pascal: 159; haskell: 37; sed: 5
file content (17 lines) | stat: -rw-r--r-- 825 bytes parent folder | download | duplicates (9)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
print ([[

A depth-first traversal of a feature node graph starts at the top-level feature
node (or pseudo-node) and explores as far along each branch as possible before
backtracking. Let's assume that the feature nodes are stored in a list in the
order of their traversal (called the ``feature node list'').

Two feature node graphs are considered to be repeated if their feature node list
(from the depth-first traversal) have the same length and each feature node pair
(from both lists at the same position) is ``similar''.

Two feature nodes are ``similar'', if they have the same sequence ID, feature
type, range, strand, and phase.

For such a repeated feature node graph the one with the higher score (of the
top-level feature) is kept. If only one of the feature node graphs has a defined
score, this one is kept.]])