File: alignment_app_controllers.rst

package info (click to toggle)
python-cogent 1.4.1-1.2
  • links: PTS, VCS
  • area: non-free
  • in suites: squeeze
  • size: 13,260 kB
  • ctags: 20,087
  • sloc: python: 116,163; ansic: 732; makefile: 74; sh: 9
file content (49 lines) | stat: -rw-r--r-- 2,302 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
Using alignment application controllers to align unaligned sequences
====================================================================

.. sectionauthor:: Daniel McDonald

This document provides examples of how to align sequences using the alignment application controllers. Each alignment application controller module provides the support method ``align_unaligned_seqs``. This method takes as input a ``SequenceCollection`` object or a dict mapping sequence ids to sequences, the ``MolType`` of the sequences, and an option dict containing specific parameter settings. As output, the method returns an ``Alignment`` object.

First, lets import all of the ``align_unaligned_seqs`` methods:

.. doctest::
    
    >>> from cogent.app.clustalw import align_unaligned_seqs as clustalw_align_unaligned_seqs
    >>> from cogent.app.muscle import align_unaligned_seqs as muscle_align_unaligned_seqs
    >>> from cogent.app.mafft import align_unaligned_seqs as mafft_align_unaligned_seqs

Next, we'll load our test data. We will be using DNA sequences for this example:

.. doctest::

    >>> from cogent.core.moltype import DNA
    >>> from cogent import LoadSeqs
    >>> unaligned_seqs = LoadSeqs(filename='data/test2.fasta', aligned=False)

Lets align some sequences using default parameters!

.. note:: Output is truncated for document formatting

.. doctest::
    
    >>> clustalw_aln = clustalw_align_unaligned_seqs(unaligned_seqs, DNA) 
    >>> muscle_aln = muscle_align_unaligned_seqs(unaligned_seqs, DNA) 
    >>> mafft_aln = mafft_align_unaligned_seqs(unaligned_seqs, DNA) 
    >>> clustalw_aln
    5 x 60 dna alignment: NineBande[------CGCCA...], Mouse[GCAGTGAGCCA...], ...
    >>> muscle_aln
    5 x 60 dna alignment: NineBande[------CGCCA...], Mouse[GCAGTGAGCCA...], ...
    >>> mafft_aln
    5 x 60 dna alignment: NineBande[------CGCCA...], Mouse[GCAGTGAGCCA...], ...

To change specific parameters, simply specify the parameters in a dict and pass it in:

.. note:: Output is truncated for document formatting

.. doctest::
    
    >>> clustalw_params = {'-gapopen':-3, '-quicktree':True}
    >>> clustalw_aln = clustalw_align_unaligned_seqs(unaligned_seqs, DNA, params=clustalw_params)
    >>> clustalw_aln
    5 x 60 dna alignment: NineBande[------CGCCA...], Mouse[GCAGTGAGCCA...], ...