1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42
|
At the moment (v1.0.7) VSearch is somewhat awkward in that you need to choose the
compression mode of the input files at compile time.
I'm sure they'll sort this at some point, but for now we have to compile the
whole thing three times.
-- Tim Booth <tbooth@ceh.ac.uk> Fri, 09 Jan 2015 17:13:35 +0000
The file data/BioMarKs.fsa.gz has the very same content as
data/BioMarKs.fsa.bz2 but just a different compression method.
To save disk space the former is deleted from source tarball and
recreated in the test target where it is needed.
The same is true for data/PR2-18S-rRNA-V4.fsa which is just an
uncompressed copy of data/PR2-18S-rRNA-V4.fsa.bz2
Regarding the remaining data files here is the explanation requested by
ftpmaster according to
http://lists.debian.org/debian-devel/2013/09/msg00332.html
Files: data/*
The files and their origin are described in data/README.md
.
All the files aside from simm are from public repositories and as they
are all natural DNA sequences and so they can't be copyrighted and
therefore do not have a license associated.
.
In more detail -
Rfam = http://rfam.xfam.org/
BioMarKs = http://www.biomarks.eu/
PR2 = http://ssu-rrna.org/
AF091148 = from EMBL-Bank
http://www.ebi.ac.uk/Tools/dbfetch/emblfetch?style=html&Submit=Go&id=AF091148
.
simm = looks like simulated randomised data, presumably
generated by the authors for testing.
.
Note that these are not needed for the software to function but are
useful for testing.
-- Andreas Tille <tille@debian.org> Wed, 14 Jan 2015 10:40:22 +0000
|