File: twores.ali

package info (click to toggle)
librg-blast-parser-perl 0.02-2
  • links: PTS, VCS
  • area: main
  • in suites: wheezy
  • size: 156 kB
  • sloc: perl: 37; makefile: 6
file content (310 lines) | stat: -rw-r--r-- 10,119 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.


Reference for composition-based statistics starting in round 2:
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= ORF02707
         (51 letters)

Database: big_80 
           6,172,567 sequences; 2,144,756,490 total letters

Searching..................................................done



 ***** No hits found ******

Searching..................................................done



 ***** No hits found ******

  Database: big_80
    Posted date:  Jun 21, 2011 12:12 AM
  Number of letters in database: 999,999,540
  Number of sequences in database:  1,233,551
  
  Database: /mnt/project/rost_db/work/big/20110619-1/big_80.01
    Posted date:  Jun 21, 2011 12:15 AM
  Number of letters in database: 999,999,953
  Number of sequences in database:  3,404,212
  
  Database: /mnt/project/rost_db/work/big/20110619-1/big_80.02
    Posted date:  Jun 21, 2011 12:16 AM
  Number of letters in database: 144,756,997
  Number of sequences in database:  1,534,804
  
Lambda     K      H
   0.322    0.134    0.479 

Lambda     K      H
   0.267   0.0412    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 526,009,561
Number of Sequences: 6172567
Number of extensions: 15216483
Number of successful extensions: 47546
Number of sequences better than  1.0: 1
Number of HSP's better than  1.0 without gapping: 0
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 47546
Number of HSP's gapped (non-prelim): 2
length of query: 51
length of database: 2,144,756,490
effective HSP length: 24
effective length of query: 27
effective length of database: 1,996,614,882
effective search space: 53908601814
effective search space used: 53908601814
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 81 (35.8 bits)
BLASTP 2.2.18 [Mar-02-2008]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= sp|P47944|MT4_HUMAN Metallothionein-4 OS=Homo sapiens GN=MT4
PE=2 SV=1
         (62 letters)

Database: big_80 
           7,170,017 sequences; 2,505,616,640 total letters

Searching..................................................done



>tr|F7G3K3|F7G3K3_ORNAN Metallothionein OS=Ornithorhynchus
          anatinus GN=MT4 PE=3 SV=1
          Length = 63

 Score = 77.0 bits (188), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/63 (85%), Positives = 58/63 (92%), Gaps = 1/63 (1%)

Query: 1  MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCCPPGCAKCARGCICKGGSD-KCS 59
          MDP EC CMSGGIC+CGDNCKCT CNCKTCRKSCCPCCPPGCAKCA+GC+CK G+D KCS
Sbjct: 1  MDPEECTCMSGGICICGDNCKCTNCNCKTCRKSCCPCCPPGCAKCAQGCVCKRGADNKCS 60

Query: 60 CCP 62
          CCP
Sbjct: 61 CCP 63


>tr|F7IDP1|F7IDP1_CALJA Metallothionein OS=Callithrix jacchus PE=3
          SV=1
          Length = 61

 Score = 62.4 bits (150), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 38/62 (61%), Positives = 42/62 (67%), Gaps = 1/62 (1%)

Query: 1  MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCCPPGCAKCARGCICKGGSDKCSC 60
          MDP  C C +GG C C  +CKC  C C + RKSCC CCP GCA CA+GCICK  SDK SC
Sbjct: 1  MDPN-CSCAAGGSCTCTSSCKCKECTCTSYRKSCCSCCPMGCANCAQGCICKEASDKYSC 59

Query: 61 CP 62
          C 
Sbjct: 60 CA 61


>tr|G5CAW2|G5CAW2_HETGA Metallothionein-3 OS=Heterocephalus glaber
          GN=GW7_14200 PE=4 SV=1
          Length = 68

 Score = 55.1 bits (131), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 34/67 (50%), Positives = 41/67 (61%), Gaps = 6/67 (8%)

Query: 1  MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCCPPGCAKCARGCIC------KGG 54
          MDP  C C +GG C C D+CKC  C C  C++SCC CCP  C KCA+ C+C      K  
Sbjct: 1  MDPETCPCPAGGSCTCADSCKCKGCRCTDCKQSCCSCCPADCEKCAKDCVCTGVKGGKAE 60

Query: 55 SDKCSCC 61
           +KCSCC
Sbjct: 61 EEKCSCC 67


>sp|P15786|MT1_COLLI Metallothionein-1 OS=Columba livia PE=1 SV=1
          Length = 63

 Score = 53.5 bits (127), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 40/62 (64%), Positives = 47/62 (75%), Gaps = 1/62 (1%)

Query: 1  MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCCPPGCAKCARGCICKG-GSDKCS 59
          MD ++C C +GG C CGDNCKC  C C +C+K CC CCP GCAKCA+GC+CKG  S KCS
Sbjct: 1  MDSQDCPCAAGGTCTCGDNCKCKNCKCTSCKKGCCSCCPAGCAKCAQGCVCKGPPSAKCS 60

Query: 60 CC 61
          CC
Sbjct: 61 CC 62


>sp|Q6J1T3|MT_SCYTO Metallothionein OS=Scyliorhinus torazame GN=mt
          PE=3 SV=1
          Length = 68

 Score = 51.2 bits (121), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/66 (54%), Positives = 43/66 (65%), Gaps = 7/66 (10%)

Query: 2  DPRECVCMSGGICMCGDNCKCTTCNCKT-----CRKSCCPCCPPGCAKCARGCICKG-GS 55
          D + CVC+ G  C C + C+C+ C C T     C+KSCC CCP GC  CA GC+CKG  S
Sbjct: 3  DTKPCVCLDGS-CSCENTCRCSDCRCPTSKAGRCQKSCCSCCPAGCTNCANGCVCKGKAS 61

Query: 56 DKCSCC 61
          DKCSCC
Sbjct: 62 DKCSCC 67


>tr|F6PZW8|F6PZW8_MACMU Metallothionein OS=Macaca mulatta
          GN=LOC702286 PE=3 SV=1
          Length = 61

 Score = 50.8 bits (120), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 32/61 (52%), Positives = 39/61 (63%), Gaps = 1/61 (1%)

Query: 1  MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCCPPGCAKCARGCICKGGSDKCSC 60
          MDP  C C +G  C C  +C C  C C +C+KSCC     GCAKCA+G +CKG S+ CSC
Sbjct: 1  MDP-NCSCAAGDSCSCAGSCTCKECKCTSCKKSCCSRYRVGCAKCAQGYVCKGASETCSC 59

Query: 61 C 61
          C
Sbjct: 60 C 60


>tr|G1QP37|G1QP37_NOMLE Metallothionein OS=Nomascus leucogenys
          PE=3 SV=1
          Length = 63

 Score = 47.8 bits (112), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 34/64 (53%), Positives = 42/64 (65%), Gaps = 3/64 (4%)

Query: 1  MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCC--PPGCAKCARGCICKGGSDKC 58
          MDP  C C +GG C C  + KC  C C + +K+ C C   P GCAKCA+GC+CKG S+KC
Sbjct: 1  MDP-NCSCATGGSCSCASSLKCKECKCTSRKKNVCCCSCCPMGCAKCAQGCVCKGTSEKC 59

Query: 59 SCCP 62
          SCC 
Sbjct: 60 SCCA 63


>tr|G1SSN8|G1SSN8_RABIT Uncharacterized protein OS=Oryctolagus
          cuniculus GN=MT2A PE=4 SV=1
          Length = 53

 Score = 46.6 bits (109), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 29/50 (58%), Positives = 37/50 (74%), Gaps = 4/50 (8%)

Query: 14 CMCGDNCKC-TTCNCKTCRKSCCPCCPPGCAKCARGCICKGGSDKCSCCP 62
          C+  D+C C ++C C +CRKSCCP    GC +CA+GCICKG S+KCSCC 
Sbjct: 7  CITSDSCTCASSCTCMSCRKSCCP---VGCVQCAQGCICKGASNKCSCCA 53


>tr|B5XDS4|B5XDS4_SALSA Metallothionein OS=Salmo salar GN=MT PE=3
          SV=1
          Length = 59

 Score = 46.6 bits (109), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 33/54 (61%), Positives = 35/54 (64%), Gaps = 4/54 (7%)

Query: 1  MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCCPPGCAKCARGCICKGG 54
          MDP  C C   G C CG  CKCT C C T  KSCC CCP GC+KCA GC+CK G
Sbjct: 1  MDP--CDCSKTGRCSCGGLCKCTNCGCAT--KSCCSCCPTGCSKCASGCVCKEG 50


>tr|G1QP67|G1QP67_NOMLE Metallothionein OS=Nomascus leucogenys
          GN=MT1CP PE=3 SV=1
          Length = 61

 Score = 45.4 bits (106), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/61 (59%), Positives = 43/61 (70%), Gaps = 1/61 (1%)

Query: 1  MDPRECVCMSGGICMCGDNCKCTTCNCKTCRKSCCPCCPPGCAKCARGCICKGGSDKCSC 60
          MD   C C +GG C C  +CKC    C +C+K+CC CCP GCAKCA+GCICKG  +KCSC
Sbjct: 1  MD-LSCSCATGGSCTCASSCKCKEYKCTSCKKNCCSCCPVGCAKCAQGCICKGALEKCSC 59

Query: 61 C 61
          C
Sbjct: 60 C 60


  Database: big_80
    Posted date:  Feb 6, 2012  9:39 AM
  Number of letters in database: 999,999,843
  Number of sequences in database:  1,100,691
  
  Database: /var/tmp/rost_db/data/big/big_80.01
    Posted date:  Feb 6, 2012  9:42 AM
  Number of letters in database: 999,999,792
  Number of sequences in database:  2,641,400
  
  Database: /var/tmp/rost_db/data/big/big_80.02
    Posted date:  Feb 6, 2012  9:44 AM
  Number of letters in database: 505,617,005
  Number of sequences in database:  3,427,926
  
Lambda     K      H
   0.331    0.144    0.623 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 334,236,310
Number of Sequences: 7170017
Number of extensions: 10911235
Number of successful extensions: 95546
Number of sequences better than 100.0: 10
Number of HSP's better than 100.0 without gapping: 275
Number of HSP's successfully gapped in prelim test: 1456
Number of HSP's that attempted gapping in prelim test: 82115
Number of HSP's gapped (non-prelim): 10847
length of query: 62
length of database: 2,505,616,640
effective HSP length: 34
effective length of query: 28
effective length of database: 2,261,836,062
effective search space: 63331409736
effective search space used: 63331409736
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 64 (29.3 bits)