File: control

package info (click to toggle)
any2fasta 0.4.2-2
links: PTS, VCS
area: main
in suites: bookworm, forky, sid, trixie
size: 23,064 kB
sloc: perl: 231; sh: 23; makefile: 2
file content (74 lines) | stat: -rw-r--r-- 2,908 bytes
Source: any2fasta
Section: science
Priority: optional
Maintainer: Debian Med Packaging Team <debian-med-packaging@lists.alioth.debian.org>
Uploaders: Andreas Tille <tille@debian.org>
Build-Depends: debhelper-compat (= 13)
Standards-Version: 4.6.2
Vcs-Browser: https://salsa.debian.org/med-team/any2fasta
Vcs-Git: https://salsa.debian.org/med-team/any2fasta.git
Homepage: https://github.com/tseemann/any2fasta
Rules-Requires-Root: no

Package: any2fasta
Architecture: all
Depends: ${perl:Depends},
         ${misc:Depends}
Multi-Arch: foreign
Description: convert various sequence formats to FASTA
 Established tools like readseq and seqret from EMBOSS, both create mangled
 IDs containing | or . characters, and there is no way to fix this behaviour.
 This resultes in inconsitences between .gbk and .fna versions of files in
 pipelines.
 .
 This script uses only core Perl modules, has no other dependencies like
 Bioperl or Biopython, and runs very quickly.
 .
 It supports the following input formats:
 .
  1. Genbank flat file, typically .gb, .gbk, .gbff (starts with LOCUS)
  2. EMBL flat file, typically .embl, (starts with ID)
  3. GFF with sequence, typically .gff, .gff3 (starts with ##gff)
  4. FASTA DNA, typically .fasta, .fa, .fna, .ffn (starts with >)
  5. FASTQ DNA, typically .fastq, .fq (starts with @)
  6. CLUSTAL alignments, typically .clw, .clu (starts with CLUSTAL or MUSCLE)
  7. STOCKHOLM alignments, typically .sth (starts with # STOCKHOLM)
  8. GFA assembly graph, typically .gfa (starts with ^[A-Z]\t)
 .
 Files may be compressed with:
 .
  1. gzip, typically .gz
  2. bzip2, typically .bz2
  3. zip, typically .zip

Package: any2fasta-examples
Architecture: all
Depends: ${misc:Depends}
Multi-Arch: foreign
Description: convert various sequence formats to FASTA (example data)
 Established tools like readseq and seqret from EMBOSS, both create mangled
 IDs containing | or . characters, and there is no way to fix this behaviour.
 This resultes in inconsitences between .gbk and .fna versions of files in
 pipelines.
 .
 This script uses only core Perl modules, has no other dependencies like
 Bioperl or Biopython, and runs very quickly.
 .
 It supports the following input formats:
 .
  1. Genbank flat file, typically .gb, .gbk, .gbff (starts with LOCUS)
  2. EMBL flat file, typically .embl, (starts with ID)
  3. GFF with sequence, typically .gff, .gff3 (starts with ##gff)
  4. FASTA DNA, typically .fasta, .fa, .fna, .ffn (starts with >)
  5. FASTQ DNA, typically .fastq, .fq (starts with @)
  6. CLUSTAL alignments, typically .clw, .clu (starts with CLUSTAL or MUSCLE)
  7. STOCKHOLM alignments, typically .sth (starts with # STOCKHOLM)
  8. GFA assembly graph, typically .gfa (starts with ^[A-Z]\t)
 .
 Files may be compressed with:
 .
  1. gzip, typically .gz
  2. bzip2, typically .bz2
  3. zip, typically .zip
 .
 This package contains some example data for testing.