File: ecolitst.wublastp

package info (click to toggle)
bioperl 1.7.1-2
  • links: PTS, VCS
  • area: main
  • in suites: stretch
  • size: 50,136 kB
  • sloc: perl: 172,618; xml: 22,869; lisp: 2,034; sh: 1,984; makefile: 19
file content (266 lines) | stat: -rw-r--r-- 13,047 bytes parent folder | download | duplicates (11)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
BLASTP 2.0MP-WashU [12-Feb-2001] [linux-i686 01:36:08 31-Jan-2001]

Copyright (C) 1996-2000 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2000) http://blast.wustl.edu

Query=  gi|1786183|gb|AAC73113.1| (AE000111) aspartokinase I, homoserine
    dehydrogenase I [Escherichia coli]
        (820 letters)

Database:  ecoli.aa
           4289 sequences; 1,358,990 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

gb|AAC73113.1| (AE000111) aspartokinase I, homoserine deh...  4141  0.0       1
gb|AAC76922.1| (AE000468) aspartokinase II and homoserine...   844  3.1e-86   1
gb|AAC76994.1| (AE000475) aspartokinase III, lysine sensi...   483  2.8e-47   1
gb|AAC73282.1| (AE000126) uridylate kinase [Escherichia c...    97  0.0010    1



>gb|AAC73113.1| (AE000111) aspartokinase I, homoserine dehydrogenase I
            [Escherichia coli]
        Length = 820

 Score = 4141 (1462.8 bits), Expect = 0.0, P = 0.0
 Identities = 820/820 (100%), Positives = 820/820 (100%)

Query:     1 MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA 60
             MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA
Sbjct:     1 MRVLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDA 60

Query:    61 LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA 120
             LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA
Sbjct:    61 LPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDSINA 120

Query:   121 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP 180
             ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP
Sbjct:   121 ALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRIP 180

Query:   181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240
             ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV
Sbjct:   181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240

Query:   241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300
             PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD
Sbjct:   241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300

Query:   301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360
             EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF
Sbjct:   301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360

Query:   361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL 420
             CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL
Sbjct:   361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGISAKFFAAL 420

Query:   421 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL 480
             ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL
Sbjct:   421 ARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIGVGGVGGAL 480

Query:   481 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL 540
             LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL
Sbjct:   481 LEQLKRQQSWLKNKHIDLRVCGVANSKALLTNVHGLNLENWQEELAQAKEPFNLGRLIRL 540

Query:   541 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR 600
             VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR
Sbjct:   541 VKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMDYYHQLRYAAEKSR 600

Query:   601 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA 660
             RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA
Sbjct:   601 RKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKLDEGMSFSEATTLA 660

Query:   661 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA 720
             REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA
Sbjct:   661 REMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPAEFNAEGDVAAFMA 720

Query:   721 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF 780
             NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF
Sbjct:   721 NLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDPLFKVKNGENALAF 780

Query:   781 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV 820
             YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV
Sbjct:   781 YSHYYQPLPLVLRGYGAGNDVTAAGVFADLLRTLSWKLGV 820


>gb|AAC76922.1| (AE000468) aspartokinase II and homoserine dehydrogenase II
            [Escherichia coli]
        Length = 810

 Score = 844 (302.2 bits), Expect = 3.1e-86, P = 3.1e-86
 Identities = 222/705 (31%), Positives = 356/705 (50%)

Query:   116 DSINAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIA 175
             D++ A ++  GE  S  +M+ VL  +G     +D  E L A     +  VD   S   + 
Sbjct:   119 DAVYAEVVGHGEVWSARLMSAVLNQQGLPAAWLDAREFLRAE-RAAQPQVDEGLSYPLLQ 177

Query:   176 ASRI--PADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVY 233
                +  P   +V+  GF + N  GE V+LGRNGSDYSA  + A        IW+DV GVY
Sbjct:   178 QLLVQHPGKRLVV-TGFISRNNAGETVLLGRNGSDYSATQIGALAGVSRVTIWSDVAGVY 236

Query:   234 TCDPRQVPDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGT 293
             + DPR+V DA LL  +   EA EL+   A VLH RT+ P++  +I   ++ +  P    T
Sbjct:   237 SADPRKVKDACLLPLLRLDEASELARLAAPVLHARTLQPVSGSEIDLQLRCSYTPDQGST 296

Query:   294 LIGASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSS 353
              I           + +++ +++ +     P  +        +   + RA++  + +   +
Sbjct:   297 RIERVLASGT-GARIVTSHDDVCLIEFQVPASQDFKLAHKEIDQILKRAQVRPLAVGVHN 355

Query:   354 SEYSISFCVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGIS 413
                 + FC        A + + E        GL   L + + LA++++VG G+ T   + 
Sbjct:   356 DRQLLQFCYTSEVADSALKILDEA-------GLPGELRLRQGLALVAMVGAGV-TRNPLH 407

Query:   414 A-KFFAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLFNTDQVIEVFVIG 472
               +F+  L    +      Q     S+  V+      + ++  HQ +F  ++ I + + G
Sbjct:   408 CHRFWQQLKGQPVEFTW--QSDDGISLVAVLRTGPTESLIQGLHQSVFRAEKRIGLVLFG 465

Query:   473 VGGVGGALLEQLKRQQSWLKNKH-IDLRVCGVANSKALLTNVHGLN----LENWQEELAQ 527
              G +G   LE   R+QS L  +   +  + GV +S+  L +  GL+    L  + +E  +
Sbjct:   466 KGNIGSRWLELFAREQSTLSARTGFEFVLAGVVDSRRSLLSYDGLDASRALAFFNDEAVE 525

Query:   528 AKEPFNLGRLIRLVKEYHLLNPVIVDCTSSQAVADQYADFLREGFHVVTPNKKANTSSMD 587
               E      L   ++ +   + V++D T+SQ +ADQY DF   GFHV++ NK A  S  +
Sbjct:   526 QDEE----SLFLWMRAHPYDDLVVLDVTASQQLADQYLDFASHGFHVISANKLAGASDSN 581

Query:   588 YYHQLRYAAEKSRRKFLYDTNVGAGLPVIENLQNLLNAGDELMKFSGILSGSLSYIFGKL 647
              Y Q+  A EK+ R +LY+  VGAGLP+   +++L+++GD ++  SGI SG+LS++F + 
Sbjct:   582 KYRQIHDAFEKTGRHWLYNATVGAGLPINHTVRDLIDSGDTILSISGIFSGTLSWLFLQF 641

Query:   648 DEGMSFSEATTLAREMGYTEPDPRDDLSGMDVARKLLILARETGRELELADIEIEPVLPA 707
             D  + F+E    A + G TEPDPRDDLSG DV RKL+ILARE G  +E   + +E ++PA
Sbjct:   642 DGSVPFTELVDQAWQQGLTEPDPRDDLSGKDVMRKLVILAREAGYNIEPDQVRVESLVPA 701

Query:   708 EFNAEGDVAAFMANLSQLDDLFAARVAKARDEGKVLRYVGNIDEDGVCRVKIAEVDGNDP 767
                  G +  F  N  +L++    R+  AR+ G VLRYV   D +G  RV +  V  + P
Sbjct:   702 HCEG-GSIDHFFENGDELNEQMVQRLEAAREMGLVLRYVARFDANGKARVGVEAVREDHP 760

Query:   768 LFKVKNGENALAFYSHYYQPLPLVLRGYGAGNDVTAAGVFADLLR 812
             L  +   +N  A  S +Y+  PLV+RG GAG DVTA  + +D+ R
Sbjct:   761 LASLLPCDNVFAIESRWYRDNPLVIRGPGAGRDVTAGAIQSDINR 805

 Score = 321 (118.1 bits), Expect = 3.6e-27, P = 3.6e-27
 Identities = 108/406 (26%), Positives = 191/406 (47%)

Query:     5 KFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQDALPNI 64
             KFGG+S+A+ + +LRVA I+   ++   +  V+SA    TN L+  ++ + + + +   +
Sbjct:    16 KFGGSSLADVKCYLRVAGIMAEYSQPDDMM-VVSAAGSTTNQLINWLKLSQTDRLSAHQV 74

Query:    65 SDAERIF-AELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLH-GISLLGQCPDSINAAL 122
                 R +  +L++GL  A+    L  +  FV  +  ++  +L  GI+      D++ A +
Sbjct:    75 QQTLRRYQCDLISGLLPAEEADSL--ISAFVS-DLERLAALLDSGIN------DAVYAEV 125

Query:   123 ICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVGHYLESTVDIAESTRRIAASRI--P 180
             +  GE  S  +M+ VL  +G     +D  E L A     +  VD   S   +    +  P
Sbjct:   126 VGHGEVWSARLMSAVLNQQGLPAAWLDAREFLRAE-RAAQPQVDEGLSYPLLQQLLVQHP 184

Query:   181 ADHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYTCDPRQV 240
                +V+  GF + N  GE V+LGRNGSDYSA  + A        IW+DV GVY+ DPR+V
Sbjct:   185 GKRLVV-TGFISRNNAGETVLLGRNGSDYSATQIGALAGVSRVTIWSDVAGVYSADPRKV 243

Query:   241 PDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTLIGASRD 300
              DA LL  +   EA EL+   A VLH RT+ P++  +I   ++ +  P    T I     
Sbjct:   244 KDACLLPLLRLDEASELARLAAPVLHARTLQPVSGSEIDLQLRCSYTPDQGSTRIERVLA 303

Query:   301 EDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSSEYSISF 360
                   + +++ +++ +     P  +        +   + RA++  + +   +    + F
Sbjct:   304 SGT-GARIVTSHDDVCLIEFQVPASQDFKLAHKEIDQILKRAQVRPLAVGVHNDRQLLQF 362

Query:   361 CVPQSDCVRAERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGM 406
             C        A + + E        GL   L + + LA++++VG G+
Sbjct:   363 CYTSEVADSALKILDEA-------GLPGELRLRQGLALVAMVGAGV 401


>gb|AAC76994.1| (AE000475) aspartokinase III, lysine sensitive [Escherichia
            coli]
        Length = 449

 Score = 483 (175.1 bits), Expect = 2.8e-47, P = 2.8e-47
 Identities = 149/467 (31%), Positives = 233/467 (49%)

Query:     3 VLKFGGTSVANAERFLRVADILESNARQGQVATVLSAPAKITNHLVAMIEKTISGQ---- 58
             V KFGGTSVA+ +   R ADI+ S+A    V  VLSA A ITN LVA+ E    G+    
Sbjct:     6 VSKFGGTSVADFDAMNRSADIVLSDANVRLV--VLSASAGITNLLVALAEGLEPGERFEK 63

Query:    59 -DALPNISDAERIFAELLTGLAAAQPGFPLAQLKTFVDQEFAQIKHVLHGISLLGQCPDS 117
              DA+ NI      FA +L  L      +P   ++  +++    I  VL   + L   P +
Sbjct:    64 LDAIRNIQ-----FA-ILERLR-----YPNV-IREEIERLLENIT-VLAEAAALATSP-A 109

Query:   118 INAALICRGEKMSIAIMAGVLEARGHNVTVIDPVEKLLAVG-HYLESTVDIAESTRRIAA 176
             +   L+  GE MS  +   +L  R       D V K++     +  +  DIA      A 
Sbjct:   110 LTDELVSHGELMSTLLFVEILRERDVQAQWFD-VRKVMRTNDRFGRAEPDIAALAELAAL 168

Query:   177 SRIPA--DHMVLMAGFTAGNEKGELVVLGRNGSDYSAAVLAACLRADCCEIWTDVDGVYT 234
               +P   + +V+  GF     KG    LGR GSDY+AA+LA  L A   +IWTDV G+YT
Sbjct:   169 QLLPRLNEGLVITQGFIGSENKGRTTTLGRGGSDYTAALLAEALHASRVDIWTDVPGIYT 228

Query:   235 CDPRQVPDARLLKSMSYQEAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQAPGTL 294
              DPR V  A+ +  +++ EA E++ FGAKVLHP T+ P  +  IP  + ++ +P+A GTL
Sbjct:   229 TDPRVVSAAKRIDEIAFAEAAEMATFGAKVLHPATLLPAVRSDIPVFVGSSKDPRAGGTL 288

Query:   295 IGASRDEDELPVKGISNLNNMAMFSVSGPGMKGMVGMAARVFAAMSRARISVVLITQSSS 354
             +  ++ E+    + ++   N  + ++    M    G  A VF  ++R  ISV LIT  +S
Sbjct:   289 V-CNKTENPPLFRALALRRNQTLLTLHSLNMLHSRGFLAEVFGILARHNISVDLIT--TS 345

Query:   355 EYSISFCVPQSDCVRA-ERAMQEEFYLELKEGLLEPLAVTERLAIISVVGDGMRTLRGIS 413
             E S++  +  +      +  + +   +EL    L  + V E LA+++++G+ +    G+ 
Sbjct:   346 EVSVALTLDTTGSTSTGDTLLTQSLLMELSA--LCRVEVEEGLALVALIGNDLSKACGVG 403

Query:   414 AKFFAALARANINIVAIAQGSSERSISVVVNNDDATTGVRVTHQMLF 460
              + F  L   NI +  I  G+S  ++  +V  +DA   V+  H  LF
Sbjct:   404 KEVFGVLEPFNIRM--ICYGASSHNLCFLVPGEDAEQVVQKLHSNLF 448


>gb|AAC73282.1| (AE000126) uridylate kinase [Escherichia coli]
        Length = 241

 Score = 97 (39.2 bits), Expect = 0.0010, P = 0.0010
 Identities = 30/98 (30%), Positives = 45/98 (45%)

Query:   199 LVVLGRN-GSDYSAAVLAACLR-----ADCCEIWTDVDGVYTCDPRQVPDARLLKSMSYQ 252
             +V+L    G+ +     AACLR     AD     T VDGV+T DP + P A + + ++Y 
Sbjct:   131 VVILSAGTGNPFFTTDSAACLRGIEIEADVVLKATKVDGVFTADPAKDPTATMYEQLTYS 190

Query:   253 EAMELSYFGAKVLHPRTITPIAQFQIPCLIKNTGNPQA 290
             E +E      KV+     T     ++P  + N   P A
Sbjct:   191 EVLEKEL---KVMDLAAFTLARDHKLPIRVFNMNKPGA 225


Parameters:
  E=0.01

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.136   0.384    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E    S W   T  X   E2     S2
   +0      0      820       820     0.010  93 3  11 22  0.19    34
                                                    37  0.22    37


Statistics:

  Database:  /home/jes12/db/ecoli.aa
   Title:  ecoli.aa
   Posted:  2:52:35 PM EST Nov 18, 2001
   Created:  9:46:47 AM EST Nov 18, 2001
   Format:  XDF-1
   # of letters in database:  1,358,990
   # of sequences in database:  4289
   # of database sequences satisfying E:  4
  No. of states in DFA:  573 (61 KB)
  Total size of DFA:  281 KB (1149 KB)
  Time to generate neighborhood:  0.00u 0.02s 0.02t  Elapsed: 00:00:00
  No. of threads or processors used:  1
  Search cpu time:  1.58u 0.00s 1.58t  Elapsed: 00:00:01
  Total cpu time:  1.59u 0.02s 1.61t  Elapsed: 00:00:01
  Start:  Thu Dec  6 11:09:14 2001   End:  Thu Dec  6 11:09:15 2001