File: README.source

package info (click to toggle)
vsearch 2.30.1-1
  • links: PTS, VCS
  • area: main
  • in suites: forky, sid
  • size: 23,200 kB
  • sloc: cpp: 30,213; ansic: 493; makefile: 260; sh: 101
file content (42 lines) | stat: -rw-r--r-- 1,569 bytes parent folder | download | duplicates (7)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
At the moment (v1.0.7) VSearch is somewhat awkward in that you need to choose the
compression mode of the input files at compile time.
I'm sure they'll sort this at some point, but for now we have to compile the
whole thing three times.

 -- Tim Booth <tbooth@ceh.ac.uk>  Fri, 09 Jan 2015 17:13:35 +0000


The file data/BioMarKs.fsa.gz has the very same content as
data/BioMarKs.fsa.bz2 but just a different compression method.
To save disk space the former is deleted from source tarball and
recreated in the test target where it is needed.

The same is true for data/PR2-18S-rRNA-V4.fsa which is just an
uncompressed copy of data/PR2-18S-rRNA-V4.fsa.bz2

Regarding the remaining data files here is the explanation requested by
ftpmaster according to
  http://lists.debian.org/debian-devel/2013/09/msg00332.html

Files: data/*
 The files and their origin are described in data/README.md
 .
 All the files aside from simm are from public repositories and as they
 are all natural DNA sequences and so they can't be copyrighted and
 therefore do not have a license associated.
 .
 In more detail -
 Rfam =  http://rfam.xfam.org/
 BioMarKs = http://www.biomarks.eu/
 PR2 = http://ssu-rrna.org/
 AF091148 = from EMBL-Bank
 http://www.ebi.ac.uk/Tools/dbfetch/emblfetch?style=html&Submit=Go&id=AF091148
 .
 simm = looks like simulated randomised data, presumably
        generated by the authors for testing.
 .
 Note that these are not needed for the software to function but are
 useful for testing.

 -- Andreas Tille <tille@debian.org>  Wed, 14 Jan 2015 10:40:22 +0000