File: test_seq

package info (click to toggle)
python-biopython 1.45-3
  • links: PTS, VCS
  • area: main
  • in suites: lenny
  • size: 18,192 kB
  • ctags: 12,310
  • sloc: python: 83,505; xml: 13,834; ansic: 7,015; cpp: 1,855; sql: 1,144; makefile: 179
file content (199 lines) | stat: -rw-r--r-- 8,795 bytes parent folder | download
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
test_seq

Testing Seq
===========
TCAAAAGGATGCATCATG
18
T
G
AA
Reverse using -1 stride: Seq('GTACTACGTAGGAAAACT', IUPACUnambiguousDNA())
Extract every third nucleotide (slicing with stride 3):
Seq('TAGTAA', IUPACUnambiguousDNA())
Seq('CAGGTT', IUPACUnambiguousDNA())
Seq('AAACCG', IUPACUnambiguousDNA())
GATC
IUPACUnambiguousDNA()
19
expected error, and got it
IUPACAmbiguousDNA()

Testing MutableSeq
==================
Testing creating MutableSeqs in multiple ways
MutableSeq('TCAAAAGGATGCATCATG', IUPACAmbiguousDNA())
TCAAAAGGATGCATCATG
18
Seq('TCAAAAGGATGCATCATG', IUPACAmbiguousDNA())
T
MutableSeq('CAAA', IUPACAmbiguousDNA())
Set slice with string: MutableSeq('TGATAAAGGATGCATCATG', IUPACAmbiguousDNA())
Set slice with MutableSeq: MutableSeq('TAATAAAGGATGCATCATG', IUPACAmbiguousDNA())
Set slice with array: MutableSeq('TGATTAAAGGATGCATCATG', IUPACAmbiguousDNA())
Set item: MutableSeq('TGAGTAAAGGATGCATCATG', IUPACAmbiguousDNA())
Delete slice: MutableSeq('TGAGAAAGGATGCATCATG', IUPACAmbiguousDNA())
Delete item: MutableSeq('TGAAAAGGATGCATCATG', IUPACAmbiguousDNA())
Append: MutableSeq('TGAAAAGGATGCATCATGC', IUPACAmbiguousDNA())
Insert: MutableSeq('TGAAGAAGGATGCATCATGC', IUPACAmbiguousDNA())
Pop off the last item: C
Removed Gs: MutableSeq('TAAGAAGGATGCATCATG', IUPACAmbiguousDNA())
Expected value error and got it
A count: 7
A index: 1
Reversed Seq: MutableSeq('GTACTACGTAGGAAGAAT', IUPACAmbiguousDNA())
Reverse using -1 stride: MutableSeq('TAAGAAGGATGCATCATG', IUPACAmbiguousDNA())
Extended Seq: MutableSeq('GTACTACGTAGGAAGAATGATTTT', IUPACAmbiguousDNA())
Delete stride slice: MutableSeq('GTACTACGTAGGAAGAATGATTTT', IUPACAmbiguousDNA())
Extract every third nucleotide (slicing with stride 3):
MutableSeq('GCCAAAGT', IUPACAmbiguousDNA())
MutableSeq('TTGGAAAT', IUPACAmbiguousDNA())
MutableSeq('AATGGTTT', IUPACAmbiguousDNA())
Setting wobble codon to N (set slice with stride 3):
MutableSeq('GTNCTNCGNAGNAANAANGANTTN', IUPACAmbiguousDNA())

Checking ambiguous complements
==============================

DNA Ambiguity mapping: {'A': 'A', 'C': 'C', 'B': 'CGT', 'D': 'AGT', 'G': 'G', 'H': 'ACT', 'K': 'GT', 'M': 'AC', 'N': 'GATC', 'S': 'CG', 'R': 'AG', 'T': 'T', 'W': 'AT', 'V': 'ACG', 'Y': 'CT', 'X': 'GATC'}
DNA Complement mapping: {'A': 'T', 'C': 'G', 'B': 'V', 'D': 'H', 'G': 'C', 'H': 'D', 'K': 'M', 'M': 'K', 'N': 'N', 'S': 'S', 'R': 'Y', 'T': 'A', 'W': 'W', 'V': 'B', 'Y': 'R', 'X': 'X'}
A={A} --> {T}=T
C={C} --> {G}=G
B={CGT} --> {GCA}=V
D={AGT} --> {TCA}=H
G={G} --> {C}=C
H={ACT} --> {TGA}=D
K={GT} --> {CA}=M
M={AC} --> {TG}=K
N={GATC} --> {CTAG}=N
S={CG} --> {GC}=S
R={AG} --> {TC}=Y
T={T} --> {A}=A
W={AT} --> {TA}=W
V={ACG} --> {TGC}=B
Y={CT} --> {GA}=R
X={GATC} --> {CTAG}=X

RNA Ambiguity mapping: {'A': 'A', 'C': 'C', 'B': 'CGU', 'D': 'AGU', 'G': 'G', 'H': 'ACU', 'K': 'GU', 'M': 'AC', 'N': 'GAUC', 'S': 'CG', 'R': 'AG', 'U': 'U', 'W': 'AU', 'V': 'ACG', 'Y': 'CU', 'X': 'GAUC'}
RNA Complement mapping: {'A': 'U', 'C': 'G', 'B': 'V', 'D': 'H', 'G': 'C', 'H': 'D', 'K': 'M', 'M': 'K', 'N': 'N', 'S': 'S', 'R': 'Y', 'U': 'A', 'W': 'W', 'V': 'B', 'Y': 'R', 'X': 'X'}
A={A} --> {U}=U
C={C} --> {G}=G
B={CGU} --> {GCA}=V
D={AGU} --> {UCA}=H
G={G} --> {C}=C
H={ACU} --> {UGA}=D
K={GU} --> {CA}=M
M={AC} --> {UG}=K
N={GAUC} --> {CUAG}=N
S={CG} --> {GC}=S
R={AG} --> {UC}=Y
U={U} --> {A}=A
W={AU} --> {UA}=W
V={ACG} --> {UGC}=B
Y={CU} --> {GA}=R
X={GAUC} --> {CUAG}=X

Reverse complements:
Seq('ACBDGHKMNSRUWVYX', Alphabet()) -> Seq('XRBWAYSNKMDCHVGU', Alphabet())
Seq('ACBDGHKMNSRTWVYX', Alphabet()) -> Seq('XRBWAYSNKMDCHVGT', Alphabet())
Seq('ACBDGHKMNSRUWVYX', RNAAlphabet()) -> Seq('XRBWAYSNKMDCHVGU', RNAAlphabet())
Seq('ACBDGHKMNSRTWVYX', DNAAlphabet()) -> Seq('XRBWAYSNKMDCHVGT', DNAAlphabet())
Seq('ACBDGHKMNSRUWVYX', IUPACAmbiguousRNA()) -> Seq('XRBWAYSNKMDCHVGU', IUPACAmbiguousRNA())
Seq('ACBDGHKMNSRTWVYX', IUPACAmbiguousDNA()) -> Seq('XRBWAYSNKMDCHVGT', IUPACAmbiguousDNA())
Seq('AWGAARCKG', Alphabet()) -> Seq('CMGYTTCWT', Alphabet())


Transcribe DNA into RNA
=======================
Seq('TCAAAAGGATGCATCATG', IUPACUnambiguousDNA()) -> Seq('UCAAAAGGAUGCAUCAUG', IUPACUnambiguousRNA())
Seq('T', IUPACAmbiguousDNA()) -> Seq('U', IUPACAmbiguousRNA())
Seq('TCAAAAGGATGCATCATGT', IUPACAmbiguousDNA()) -> Seq('UCAAAAGGAUGCAUCAUGU', IUPACAmbiguousRNA())
Seq('ATGAAACTG', Alphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('AUGAAACUG', DNAAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('ATGAAACTG', RNAAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('ATGAAACTG', NucleotideAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('AUGAAACTG', NucleotideAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
MutableSeq('ATGAAACTG', RNAAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('ACTGTCGTCT', ProteinAlphabet()) -> Proteins cannot be transcribed!

Back-transcribe RNA into DNA
============================
Seq('TCAAAAGGATGCATCATG', IUPACUnambiguousDNA()) -> Seq('UCAAAAGGAUGCAUCAUG', IUPACUnambiguousRNA())
Seq('T', IUPACAmbiguousDNA()) -> Seq('U', IUPACAmbiguousRNA())
Seq('TCAAAAGGATGCATCATGT', IUPACAmbiguousDNA()) -> Seq('UCAAAAGGAUGCAUCAUGU', IUPACAmbiguousRNA())
Seq('ATGAAACTG', Alphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('AUGAAACUG', DNAAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('ATGAAACTG', RNAAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('ATGAAACTG', NucleotideAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('AUGAAACTG', NucleotideAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
MutableSeq('ATGAAACTG', RNAAlphabet()) -> Seq('AUGAAACUG', RNAAlphabet())
Seq('ACTGTCGTCT', ProteinAlphabet()) -> Proteins cannot be transcribed!

Reverse Complement
==================
Seq('TCAAAAGGATGCATCATG', IUPACUnambiguousDNA())
-> Seq('CATGATGCATCCTTTTGA', IUPACUnambiguousDNA())
Seq('T', IUPACAmbiguousDNA())
-> Seq('A', IUPACAmbiguousDNA())
Seq('TCAAAAGGATGCATCATGT', IUPACAmbiguousDNA())
-> Seq('ACATGATGCATCCTTTTGA', IUPACAmbiguousDNA())
Seq('ATGAAACTG', Alphabet())
-> Seq('CAGTTTCAT', Alphabet())
Seq('AUGAAACUG', DNAAlphabet())
-> Seq('CUGTTTCUT', DNAAlphabet())
Seq('ATGAAACTG', RNAAlphabet())
-> Seq('CTGUUUCTU', RNAAlphabet())
Seq('ATGAAACTG', NucleotideAlphabet())
-> Seq('CAGTTTCAT', NucleotideAlphabet())
Seq('AUGAAACTG', NucleotideAlphabet())
-> Seq('CTGUUUCAU', NucleotideAlphabet())
MutableSeq('ATGAAACTG', RNAAlphabet())
-> Seq('CTGUUUCTU', RNAAlphabet())
Seq('ACTGTCGTCT', ProteinAlphabet())
-> Proteins do not have complements!

Translating
===========
Seq('TCAAAAGGATGCATCATG', IUPACUnambiguousDNA())
-> Seq('SKGCIM', HasStopCodon(IUPACProtein(), '*'))
Seq('T', IUPACAmbiguousDNA())
-> Seq('', HasStopCodon(ExtendedIUPACProtein(), '*'))
Seq('TCAAAAGGATGCATCATGT', IUPACAmbiguousDNA())
-> Seq('SKGCIM', HasStopCodon(ExtendedIUPACProtein(), '*'))
Seq('ATGAAACTG', Alphabet())
-> Seq('MKL', HasStopCodon(IUPACProtein(), '*'))
Seq('AUGAAACUG', DNAAlphabet())
-> Seq('MKL', HasStopCodon(IUPACProtein(), '*'))
Seq('ATGAAACTG', RNAAlphabet())
-> Seq('MKL', HasStopCodon(IUPACProtein(), '*'))
Seq('ATGAAACTG', NucleotideAlphabet())
-> Seq('MKL', HasStopCodon(IUPACProtein(), '*'))
Seq('AUGAAACTG', NucleotideAlphabet())
-> Seq('MKL', HasStopCodon(IUPACProtein(), '*'))
MutableSeq('ATGAAACTG', RNAAlphabet())
-> Seq('MKL', HasStopCodon(IUPACProtein(), '*'))
Seq('ACTGTCGTCT', ProteinAlphabet())
-> Proteins cannot be translated!

Seq's .complement() method
==========================
Seq('TCAAAAGGATGCATCATG', IUPACUnambiguousDNA()) -> Seq('AGTTTTCCTACGTAGTAC', IUPACUnambiguousDNA())
Seq('T', IUPACAmbiguousDNA()) -> Seq('A', IUPACAmbiguousDNA())
Seq('TCAAAAGGATGCATCATGT', IUPACAmbiguousDNA()) -> Seq('AGTTTTCCTACGTAGTACA', IUPACAmbiguousDNA())
Seq('ATGAAACTG', Alphabet()) -> Seq('TACTTTGAC', Alphabet())
Seq('AUGAAACUG', DNAAlphabet()) -> Seq('TUCTTTGUC', DNAAlphabet())
Seq('ATGAAACTG', RNAAlphabet()) -> Seq('UTCUUUGTC', RNAAlphabet())
Seq('ATGAAACTG', NucleotideAlphabet()) -> Seq('TACTTTGAC', NucleotideAlphabet())
Seq('AUGAAACTG', NucleotideAlphabet()) -> Seq('UACUUUGTC', NucleotideAlphabet())
Seq('ACTGTCGTCT', ProteinAlphabet()) -> Proteins do not have complements!

Seq's .reverse_complement() method
==================================
Seq('TCAAAAGGATGCATCATG', IUPACUnambiguousDNA()) -> Seq('CATGATGCATCCTTTTGA', IUPACUnambiguousDNA())
Seq('T', IUPACAmbiguousDNA()) -> Seq('A', IUPACAmbiguousDNA())
Seq('TCAAAAGGATGCATCATGT', IUPACAmbiguousDNA()) -> Seq('ACATGATGCATCCTTTTGA', IUPACAmbiguousDNA())
Seq('ATGAAACTG', Alphabet()) -> Seq('CAGTTTCAT', Alphabet())
Seq('AUGAAACUG', DNAAlphabet()) -> Seq('CUGTTTCUT', DNAAlphabet())
Seq('ATGAAACTG', RNAAlphabet()) -> Seq('CTGUUUCTU', RNAAlphabet())
Seq('ATGAAACTG', NucleotideAlphabet()) -> Seq('CAGTTTCAT', NucleotideAlphabet())
Seq('AUGAAACTG', NucleotideAlphabet()) -> Seq('CTGUUUCAU', NucleotideAlphabet())
Seq('ACTGTCGTCT', ProteinAlphabet()) -> Proteins do not have complements!