File: ecolitst.bls

package info (click to toggle)
bioperl 1.7.1-2
  • links: PTS, VCS
  • area: main
  • in suites: stretch
  • size: 50,136 kB
  • sloc: perl: 172,618; xml: 22,869; lisp: 2,034; sh: 1,984; makefile: 19
file content (245 lines) | stat: -rw-r--r-- 11,275 bytes parent folder | download | duplicates (11)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
BLASTP 2.1.3 [Apr-11-2001]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= gi|1786183|gb|AAC73113.1| (AE000111) aspartokinase I,
homoserine dehydrogenase I [Escherichia coli]
         (820 letters)

Database: ecoli.aa
           4289 sequences; 1,358,990 total letters

Searching..................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAC73113.1| (AE000111) aspartokinase I, homoserine dehydrogen...  1567  0.0
gb|AAC76922.1| (AE000468) aspartokinase II and homoserine dehydr...   332  1e-91
gb|AAC76994.1| (AE000475) aspartokinase III, lysine sensitive [E...   184  3e-47
gb|AAC73282.1| (AE000126) uridylate kinase [Escherichia coli]          42  3e-04

>gb|AAC73113.1| (AE000111) aspartokinase I, homoserine dehydrogenase I [Escherichia
           coli]
          Length = 820

 Score = 1567 bits (4058), Expect = 0.0
 Identities = 806/820 (98%), Positives = 806/820 (98%)

Query: 1   MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA 60
           MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA
Sbjct: 1   MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA 60

Query: 61  LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA 120
           LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA
Sbjct: 61  LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA 120

Query: 121 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP 180
           ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP
Sbjct: 121 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP 180

Query: 181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240
           ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV
Sbjct: 181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240

Query: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300
           PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD
Sbjct: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300

Query: 301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360
           EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF
Sbjct: 301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360

Query: 361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL 420
           CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL
Sbjct: 361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL 420

Query: 421 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQXXXXXXXXXXXXXXAL 480
           ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQ              AL
Sbjct: 421 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL 480

Query: 481 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL 540
           LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL
Sbjct: 481 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL 540

Query: 541 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR 600
           VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR
Sbjct: 541 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR 600

Query: 601 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA 660
           RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA
Sbjct: 601 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA 660

Query: 661 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA 720
           REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA
Sbjct: 661 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA 720

Query: 721 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF 780
           NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF
Sbjct: 721 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF 780

Query: 781 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV 820
           YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV
Sbjct: 781 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV 820


>gb|AAC76922.1| (AE000468) aspartokinase II and homoserine dehydrogenase II
           [Escherichia coli]
          Length = 810

 Score =  332 bits (850), Expect = 1e-91
 Identities = 243/821 (29%), Positives = 403/821 (48%), Gaps = 44/821 (5%)

Query: 5   KFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDALPNI 64
           KFGG+S+A+ + +LRVA I+   ++   +  V+SA    TN L+  ++ + + + +   +
Sbjct: 16  KFGGSSLADVKCYLRVAGIMAEYSQPDDMM-VVSAAGSTTNQLINWLKLSQTDRLSAHQV 74

Query: 65  SDAERIF-AELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINAALI 123
               R +  +L++GL  A+    L  +  FV         +  GI+      D++ A ++
Sbjct: 75  QQTLRRYQCDLISGLLPAEEADSL--ISAFVSDLERLAALLDSGIN------DAVYAEVV 126

Query: 124 CRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIPADH 183
             GE  S  +M+ VL  +G     +D  E L A      +   + E        ++   H
Sbjct: 127 GHGEVWSARLMSAVLNQQGLPAAWLDAREFLRAER---AAQPQVDEGLSYPLLQQLLVQH 183

Query: 184 ---MVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240
               +++ GF + N  GE V+LGRNGSDYSA  + A        IW+DV GVY+ DPR+V
Sbjct: 184 PGKRLVVTGFISRNNAGETVLLGRNGSDYSATQIGALAGVSRVTIWSDVAGVYSADPRKV 243

Query: 241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300
            DA LL  +   EA EL+   A VLH RT+ P++  +I   ++ +  P       G++R 
Sbjct: 244 KDACLLPLLRLDEASELARLAAPVLHARTLQPVSGSEIDLQLRCSYTPDQ-----GSTRI 298

Query: 301 EDELP----VKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEY 356
           E  L      + +++ +++ +     P  +        +   + RA++  + +   +   
Sbjct: 299 ERVLASGTGARIVTSHDDVCLIEFQVPASQDFKLAHKEIDQILKRAQVRPLAVGVHNDRQ 358

Query: 357 SISFCVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKF 416
            + FC        A + + E        GL   L + + LA++++VG G+        +F
Sbjct: 359 LLQFCYTSEVADSALKILDEA-------GLPGELRLRQGLALVAMVGAGVTRNPLHCHRF 411

Query: 417 FAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQXXXXXXXXXXXX 476
           +  L    +      Q     S+  V+      + ++  HQ +F  ++            
Sbjct: 412 WQQLKGQPVEFTW--QSDDGISLVAVLRTGPTESLIQGLHQSVFRAEKRIGLVLFGKGNI 469

Query: 477 XXALLEQLKRQQSWLKNKH-IDLRVCGVANSKALLTNVHGLN----LENWQEELAQAKEP 531
               LE   R+QS L  +   +  + GV +S+  L +  GL+    L  + +E  +  E 
Sbjct: 470 GSRWLELFAREQSTLSARTGFEFVLAGVVDSRRSLLSYDGLDASRALAFFNDEAVEQDEE 529

Query: 532 FNLGRLIRLVKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQ 591
                L   ++ +   + V++D T+SQ +ADQY DF   GFHV++ NK A  S  + Y Q
Sbjct: 530 ----SLFLWMRAHPYDDLVVLDVTASQQLADQYLDFASHGFHVISANKLAGASDSNKYRQ 585

Query: 592 LRYAAEKSRRKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGM 651
           +  A EK+ R +LY+  VGAGLP+   +++L+++GD ++  SGI SG+LS++F + D  +
Sbjct: 586 IHDAFEKTGRHWLYNATVGAGLPINHTVRDLIDSGDTILSISGIFSGTLSWLFLQFDGSV 645

Query: 652 SFSEATTLAREMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNA 711
            F+E    A + G TEPDPRDDLSG DV RKL+ILARE G  +E   + +E ++PA    
Sbjct: 646 PFTELVDQAWQQGLTEPDPRDDLSGKDVMRKLVILAREAGYNIEPDQVRVESLVPAHCEG 705

Query: 712 EGDVAAFMANLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKV 771
            G +  F  N  +L++    R+  AR+ G VLRYV   D +G  RV +  V  + PL  +
Sbjct: 706 -GSIDHFFENGDELNEQMVQRLEAAREMGLVLRYVARFDANGKARVGVEAVREDHPLASL 764

Query: 772 KNGENALAFYSHYYQPLPLVLRGYGAGNDVTAAGVFADLLR 812
              +N  A  S +Y+  PLV+RG GAG DVTA  + +D+ R
Sbjct: 765 LPCDNVFAIESRWYRDNPLVIRGPGAGRDVTAGAIQSDINR 805


>gb|AAC76994.1| (AE000475) aspartokinase III, lysine sensitive [Escherichia coli]
          Length = 449

 Score =  184 bits (467), Expect = 3e-47
 Identities = 142/471 (30%), Positives = 228/471 (48%), Gaps = 41/471 (8%)

Query: 3   VLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQ---- 58
           V KFGGTSVA+ +   R ADI+ S+A    V  VLSA A ITN LVA+ E    G+    
Sbjct: 6   VSKFGGTSVADFDAMNRSADIVLSDANVRLV--VLSASAGITNLLVALAEGLEPGERFEK 63

Query: 59  -DALPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCP-- 115
            DA+ NI                    F + +   + +    +I+ +L  I++L +    
Sbjct: 64  LDAIRNIQ-------------------FAILERLRYPNVIREEIERLLENITVLAEAAAL 104

Query: 116 ---DSINAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTR 172
               ++   L+  GE MS  +   +L  R       D  + +     +  +  DIA    
Sbjct: 105 ATSPALTDELVSHGELMSTLLFVEILRERDVQAQWFDVRKVMRTNDRFGRAEPDIAALAE 164

Query: 173 RIAASRIPA--DHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVD 230
             A   +P   + +V+  GF     KG    LGR GSDY+AA+LA  L A   +IWTDV 
Sbjct: 165 LAALQLLPRLNEGLVITQGFIGSENKGRTTTLGRGGSDYTAALLAEALHASRVDIWTDVP 224

Query: 231 GVYTCDPRQVPDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQA 290
           G+YT DPR V  A+ +  +++ EA E++ FGAKVLHP T+ P  +  IP  + ++ +P+A
Sbjct: 225 GIYTTDPRVVSAAKRIDEIAFAEAAEMATFGAKVLHPATLLPAVRSDIPVFVGSSKDPRA 284

Query: 291 PGTLIGASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLIT 350
            GTL+  ++ E+    + ++   N  + ++    M    G  A VF  ++R  ISV LIT
Sbjct: 285 GGTLV-CNKTENPPLFRALALRRNQTLLTLHSLNMLHSRGFLAEVFGILARHNISVDLIT 343

Query: 351 QSSSEYSISFCVPQSDCV-RAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTL 409
             +SE S++  +  +      +  + +   +EL    L  + V E LA+++++G+ +   
Sbjct: 344 --TSEVSVALTLDTTGSTSTGDTLLTQSLLMEL--SALCRVEVEEGLALVALIGNDLSKA 399

Query: 410 RGISAKFFAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLF 460
            G+  + F  L   NI +  I  G+S  ++  +V  +DA   V+  H  LF
Sbjct: 400 CGVGKEVFGVLEPFNIRM--ICYGASSHNLCFLVPGEDAEQVVQKLHSNLF 448


>gb|AAC73282.1| (AE000126) uridylate kinase [Escherichia coli]
          Length = 241

 Score = 41.6 bits (96), Expect = 3e-04
 Identities = 28/97 (28%), Positives = 44/97 (44%), Gaps = 8/97 (8%)

Query: 199 LVVLGRNGSDYSAAVLAACLR-----ADCCEIWTDVDGVYTCDPRQVPDARLLKSMSYQE 253
           +++    G+ +     AACLR     AD     T VDGV+T DP + P A + + ++Y E
Sbjct: 132 VILSAGTGNPFFTTDSAACLRGIEIEADVVLKATKVDGVFTADPAKDPTATMYEQLTYSE 191

Query: 254 AMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQA 290
            +E      KV+     T     ++P  + N   P A
Sbjct: 192 VLEKE---LKVMDLAAFTLARDHKLPIRVFNMNKPGA 225


  Database: ecoli.aa
    Posted date:  Dec 6, 2001  1:58 PM
  Number of letters in database: 1,358,990
  Number of sequences in database:  4289
  
Lambda     K      H
   0.319    0.135    0.383 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2022122
Number of Sequences: 4289
Number of extensions: 82424
Number of successful extensions: 256
Number of sequences better than 1.0e-03: 4
Number of HSP's better than  0.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 243
Number of HSP's gapped (non-prelim): 4
length of query: 820
length of database: 1,358,990
effective HSP length: 47
effective length of query: 773
effective length of database: 1,157,407
effective search space: 894675611
effective search space used: 894675611
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 92 (40.0 bits)