File: alignment_app_controllers.rst

package info (click to toggle)
python-cogent 1.5.3-2
  • links: PTS, VCS
  • area: main
  • in suites: jessie, jessie-kfreebsd
  • size: 16,424 kB
  • ctags: 24,343
  • sloc: python: 134,200; makefile: 100; ansic: 17; sh: 10
file content (51 lines) | stat: -rw-r--r-- 2,330 bytes parent folder | download | duplicates (4)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
.. _alignment-controllers:

Using alignment application controllers to align unaligned sequences
====================================================================

.. sectionauthor:: Daniel McDonald

This document provides examples of how to align sequences using the alignment application controllers. Each alignment application controller module provides the support method ``align_unaligned_seqs``. This method takes as input a ``SequenceCollection`` object or a dict mapping sequence ids to sequences, the ``MolType`` of the sequences, and an option dict containing specific parameter settings. As output, the method returns an ``Alignment`` object.

First, lets import all of the ``align_unaligned_seqs`` methods:

.. doctest::
    
    >>> from cogent.app.clustalw import align_unaligned_seqs as clustalw_align_unaligned_seqs
    >>> from cogent.app.muscle import align_unaligned_seqs as muscle_align_unaligned_seqs
    >>> from cogent.app.mafft import align_unaligned_seqs as mafft_align_unaligned_seqs

Next, we'll load our test data. We will be using DNA sequences for this example:

.. doctest::

    >>> from cogent.core.moltype import DNA
    >>> from cogent import LoadSeqs
    >>> unaligned_seqs = LoadSeqs(filename='data/test2.fasta', aligned=False)

Lets align some sequences using default parameters!

.. note:: Output is truncated for document formatting

.. doctest::
    
    >>> clustalw_aln = clustalw_align_unaligned_seqs(unaligned_seqs, DNA) 
    >>> muscle_aln = muscle_align_unaligned_seqs(unaligned_seqs, DNA) 
    >>> mafft_aln = mafft_align_unaligned_seqs(unaligned_seqs, DNA) 
    >>> clustalw_aln
    5 x 60 dna alignment: NineBande[------CGCCA...], Mouse[GCAGTGAGCCA...], ...
    >>> muscle_aln
    5 x 60 dna alignment: NineBande[------CGCCA...], Mouse[GCAGTGAGCCA...], ...
    >>> mafft_aln
    5 x 60 dna alignment: NineBande[------CGCCA...], Mouse[GCAGTGAGCCA...], ...

To change specific parameters, simply specify the parameters in a dict and pass it in:

.. note:: Output is truncated for document formatting

.. doctest::
    
    >>> clustalw_params = {'-gapopen':-3, '-quicktree':True}
    >>> clustalw_aln = clustalw_align_unaligned_seqs(unaligned_seqs, DNA, params=clustalw_params)
    >>> clustalw_aln
    5 x 60 dna alignment: NineBande[------CGCCA...], Mouse[GCAGTGAGCCA...], ...