File: text_2226_blastx_002.txt

package info (click to toggle)
python-biopython 1.68%2Bdfsg-3
  • links: PTS, VCS
  • area: main
  • in suites: stretch
  • size: 46,860 kB
  • ctags: 13,237
  • sloc: python: 160,306; xml: 93,216; ansic: 9,118; sql: 1,208; makefile: 155; sh: 63
file content (273 lines) | stat: -rw-r--r-- 10,569 bytes parent folder | download | duplicates (6)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
BLASTX 2.2.26+


Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A.
Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J.
Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of
protein database search programs", Nucleic Acids Res. 25:3389-3402.



Database: NCBI Protein Reference Sequences
           11,879,989 sequences; 4,140,237,112 total letters



Query= gi|356995852:1-490 Mus musculus POU domain, class 5, transcription
factor 1 (Pou5f1), transcript variant 1, mRNA

Length=490
                                                                      Score     E
Sequences producing significant alignments:                          (Bits)  Value

ref|NP_038661.2|  POU domain, class 5, transcription factor 1 iso...   192    4e-57
ref|NP_001009178.1|  POU class 5 homeobox 1 [Rattus norvegicus]        179    3e-52
ref|XP_003505911.1|  PREDICTED: POU domain, class 5, transcriptio...   147    7e-40
ref|XP_001490158.1|  PREDICTED: POU domain, class 5, transcriptio...   141    8e-38
ref|XP_001135162.1|  PREDICTED: POU domain, class 5, transcriptio...   138    1e-36
ref|NP_002692.2|  POU domain, class 5, transcription factor 1 iso...   138    2e-36
ref|XP_002809144.1|  PREDICTED: POU domain, class 5, transcriptio...   137    2e-36
ref|XP_528230.1|  PREDICTED: putative POU domain, class 5, transc...   133    1e-34
ref|XP_002746363.1|  PREDICTED: POU domain, class 5, transcriptio...   133    1e-34
ref|NP_001108427.1|  POU domain, class 5, transcription factor 1 ...   133    1e-34
ref|NP_001106531.1|  POU domain, class 5, transcription factor 1 ...   133    1e-34
ref|XP_002752527.1|  PREDICTED: POU domain, class 5, transcriptio...   132    2e-34
ref|NP_001166912.1|  POU domain, class 5, transcription factor 1 ...   131    6e-34
ref|NP_777005.1|  POU domain, class 5, transcription factor 1 [Bo...   131    6e-34
ref|NP_001153014.1|  putative POU domain, class 5, transcription ...   128    6e-33
ref|NP_001093427.1|  POU domain, class 5, transcription factor 1 ...   127    2e-32
ref|XP_538830.1|  PREDICTED: POU domain, class 5, transcription f...   119    1e-29
ref|XP_003422494.1|  PREDICTED: POU domain, class 5, transcriptio...   118    4e-29
ref|XP_003474038.1|  PREDICTED: POU domain, class 5, transcriptio...   117    7e-29
ref|XP_003271964.1|  PREDICTED: LOW QUALITY PROTEIN: POU domain, ...   111    7e-27


>ref|NP_038661.2| POU domain, class 5, transcription factor 1 isoform 1 [Mus musculus]
Length=352

 Score =  192 bits (487),  Expect = 4e-57
 Identities = 140/140 (100%), Positives = 140/140 (100%), Gaps = 0/140 (0%)
 Frame = +3

Query  69   MAGHLAsdfafspppgggdgsagLEPGWVDPRTWLSFQgppggpgigpgSEVLGISPCPP  248
            MAGHLASDFAFSPPPGGGDGSAGLEPGWVDPRTWLSFQGPPGGPGIGPGSEVLGISPCPP
Sbjct  1    MAGHLASDFAFSPPPGGGDGSAGLEPGWVDPRTWLSFQGPPGGPGIGPGSEVLGISPCPP  60

Query  249  AYEFCGGMAYCgpqvglglvpqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKLE  428
            AYEFCGGMAYCGPQVGLGLVPQVGVETLQPEGQAGARVESNSEGTSSEPCADRPNAVKLE
Sbjct  61   AYEFCGGMAYCGPQVGLGLVPQVGVETLQPEGQAGARVESNSEGTSSEPCADRPNAVKLE  120

Query  429  KVEPTPEESQDMKALQKELE  488
            KVEPTPEESQDMKALQKELE
Sbjct  121  KVEPTPEESQDMKALQKELE  140


>ref|NP_001009178.1| POU class 5 homeobox 1 [Rattus norvegicus]
Length=352

 Score =  179 bits (454),  Expect = 3e-52
 Identities = 133/140 (95%), Positives = 135/140 (96%), Gaps = 0/140 (0%)
 Frame = +3

Query  69   MAGHLAsdfafspppgggdgsagLEPGWVDPRTWLSFQgppggpgigpgSEVLGISPCPP  248
            MAGHLASDFAFSPPPGGGDGSAGLEPGWVDPRTWLSFQGPP GPGIGPGSEVLGISPCPP
Sbjct  1    MAGHLASDFAFSPPPGGGDGSAGLEPGWVDPRTWLSFQGPPSGPGIGPGSEVLGISPCPP  60

Query  249  AYEFCGGMAYCgpqvglglvpqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKLE  428
            AYEFCGGMAYCGPQVGLGLVPQVGVETLQPEGQAGARVESNSEG SS PC  RP+AVKLE
Sbjct  61   AYEFCGGMAYCGPQVGLGLVPQVGVETLQPEGQAGARVESNSEGASSGPCTARPSAVKLE  120

Query  429  KVEPTPEESQDMKALQKELE  488
            KVEP+PEESQDMKALQKELE
Sbjct  121  KVEPSPEESQDMKALQKELE  140


>ref|XP_003505911.1| PREDICTED: POU domain, class 5, transcription factor 1-like [Cricetulus 
griseus]
Length=358

 Score =  147 bits (370),  Expect = 7e-40
 Identities = 120/145 (83%), Positives = 128/145 (88%), Gaps = 5/145 (3%)
 Frame = +3

Query  69   MAGHLAsdfafspppgggdgsagLEPGWVDPRTWLSFQgppggpgigpg----SEVLGIS  236
            MAGHLASDFAFSPPPGGGDGS GLEPGWVDPRTWLSFQGPPGGPGIGPG    SEVLGIS
Sbjct  1    MAGHLASDFAFSPPPGGGDGSGGLEPGWVDPRTWLSFQGPPGGPGIGPGVGPGSEVLGIS  60

Query  237  PCPPAYEFCGGMAYCgpqvglglvpqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNA  416
            PCPP YEFCGG+AYCGPQVGLGLVP VG+ET QPEGQ+GA VE++SE +S  PC  RP  
Sbjct  61   PCPPPYEFCGGVAYCGPQVGLGLVPPVGLETSQPEGQSGAGVENDSEESSPGPCTARPIV  120

Query  417  -VKLEKVEPTPEESQDMKALQKELE  488
             VKLEKVEP+PEESQD+KALQKELE
Sbjct  121  PVKLEKVEPSPEESQDVKALQKELE  145


>ref|XP_001490158.1| PREDICTED: POU domain, class 5, transcription factor 1-like [Equus 
caballus]
Length=360

 Score =  141 bits (356),  Expect = 8e-38
 Identities = 93/122 (76%), Positives = 101/122 (83%), Gaps = 6/122 (5%)
 Frame = +3

Query  141  EPGWVDPRTWLSFQgppggpgigpg----SEVLGISPCPPAYEFCGGMAYCgpqvglglv  308
            EPGWVDPRTWLSFQGPP G GIGPG    +EV GI PCPP YEFCGGMAYCGPQVG+GLV
Sbjct  26   EPGWVDPRTWLSFQGPPSGSGIGPGVGPGAEVWGIPPCPPPYEFCGGMAYCGPQVGVGLV  85

Query  309  pqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKL--EKVEPTPEESQDMKALQKE  482
            PQ  +ET QPEG+AGARVESNSEG S EPCA  P AVK+  EK+E  PEESQD+KALQK+
Sbjct  86   PQGSLETSQPEGEAGARVESNSEGASPEPCAAPPGAVKVDKEKLEQNPEESQDIKALQKD  145

Query  483  LE  488
            LE
Sbjct  146  LE  147


>ref|XP_001135162.1| PREDICTED: POU domain, class 5, transcription factor 1 isoform 
2 [Pan troglodytes]
Length=359

 Score =  138 bits (348),  Expect = 1e-36
 Identities = 95/121 (79%), Positives = 101/121 (83%), Gaps = 5/121 (4%)
 Frame = +3

Query  141  EPGWVDPRTWLSFQgppggpgigpg----SEVLGISPCPPAYEFCGGMAYCgpqvglglv  308
            EPGWVDPRTWLSFQGPPGGPGIGPG    SEV GI PCPP YEFCGGMAYCGPQVG+GLV
Sbjct  26   EPGWVDPRTWLSFQGPPGGPGIGPGVGPGSEVWGIPPCPPPYEFCGGMAYCGPQVGVGLV  85

Query  309  pqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKLE-KVEPTPEESQDMKALQKEL  485
            PQ G+ET QPEG+AG  VESNS+G S EPC   P AVKLE K+E  PEESQD+KALQKEL
Sbjct  86   PQGGLETSQPEGEAGVGVESNSDGASPEPCTVTPGAVKLEKKLEQNPEESQDIKALQKEL  145

Query  486  E  488
            E
Sbjct  146  E  146


>ref|NP_002692.2| POU domain, class 5, transcription factor 1 isoform 1 [Homo sapiens]
 ref|NP_001238970.1| POU domain, class 5, transcription factor 1 [Pan troglodytes]
Length=360

 Score =  138 bits (347),  Expect = 2e-36
 Identities = 95/122 (78%), Positives = 101/122 (83%), Gaps = 6/122 (5%)
 Frame = +3

Query  141  EPGWVDPRTWLSFQgppggpgigpg----SEVLGISPCPPAYEFCGGMAYCgpqvglglv  308
            EPGWVDPRTWLSFQGPPGGPGIGPG    SEV GI PCPP YEFCGGMAYCGPQVG+GLV
Sbjct  26   EPGWVDPRTWLSFQGPPGGPGIGPGVGPGSEVWGIPPCPPPYEFCGGMAYCGPQVGVGLV  85

Query  309  pqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKL--EKVEPTPEESQDMKALQKE  482
            PQ G+ET QPEG+AG  VESNS+G S EPC   P AVKL  EK+E  PEESQD+KALQKE
Sbjct  86   PQGGLETSQPEGEAGVGVESNSDGASPEPCTVTPGAVKLEKEKLEQNPEESQDIKALQKE  145

Query  483  LE  488
            LE
Sbjct  146  LE  147


>ref|XP_002809144.1| PREDICTED: POU domain, class 5, transcription factor 1-like [Pongo 
abelii]
Length=360

 Score =  137 bits (346),  Expect = 2e-36
 Identities = 94/122 (77%), Positives = 100/122 (82%), Gaps = 6/122 (5%)
 Frame = +3

Query  141  EPGWVDPRTWLSFQgppggpgigpg----SEVLGISPCPPAYEFCGGMAYCgpqvglglv  308
            EPGWVDPRTWLSFQGPPGGPGIGPG    SEV GI PCPP YEFCGGMAYCGPQVG+GLV
Sbjct  26   EPGWVDPRTWLSFQGPPGGPGIGPGVGPGSEVWGIPPCPPPYEFCGGMAYCGPQVGVGLV  85

Query  309  pqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKL--EKVEPTPEESQDMKALQKE  482
            PQ  +ET QPEG+AG  VESNS+G S EPC   P AVKL  EK+E  PEESQD+KALQKE
Sbjct  86   PQGSLETSQPEGEAGVGVESNSDGASPEPCTVPPGAVKLEKEKLEQNPEESQDIKALQKE  145

Query  483  LE  488
            LE
Sbjct  146  LE  147


>ref|XP_528230.1| PREDICTED: putative POU domain, class 5, transcription factor 
1B [Pan troglodytes]
Length=359

 Score =  133 bits (334),  Expect = 1e-34
 Identities = 93/122 (76%), Positives = 98/122 (80%), Gaps = 6/122 (5%)
 Frame = +3

Query  141  EPGWVDPRTWLSFQgppggpgigpg----SEVLGISPCPPAYEFCGGMAYCgpqvglglv  308
            EPGWVDP TWL FQGPPGGPGIGPG    SEV GI PCPP YEFCGGMAYCGPQVG GLV
Sbjct  26   EPGWVDPLTWLRFQGPPGGPGIGPGVGPGSEVWGIPPCPPPYEFCGGMAYCGPQVGAGLV  85

Query  309  pqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKL--EKVEPTPEESQDMKALQKE  482
            PQ G+ET QPEG+AG  VESNS+G S EPC   P AVKL  EK+E  PEESQD+KALQKE
Sbjct  86   PQGGLETSQPEGEAGVGVESNSDGASPEPCTVPPGAVKLEKEKLEQNPEESQDIKALQKE  145

Query  483  LE  488
            LE
Sbjct  146  LE  147


>ref|XP_002746363.1| PREDICTED: POU domain, class 5, transcription factor 1-like [Callithrix 
jacchus]
Length=360

 Score =  133 bits (334),  Expect = 1e-34
 Identities = 94/122 (77%), Positives = 100/122 (82%), Gaps = 6/122 (5%)
 Frame = +3

Query  141  EPGWVDPRTWLSFQgppggpgigpg----SEVLGISPCPPAYEFCGGMAYCgpqvglglv  308
            E GWVDPRTWLSFQGPPGGPGIGPG    +EV GI PCPP YEFCGGMAYCGPQVG+GLV
Sbjct  26   ETGWVDPRTWLSFQGPPGGPGIGPGVGPGAEVWGIPPCPPPYEFCGGMAYCGPQVGVGLV  85

Query  309  pqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKL--EKVEPTPEESQDMKALQKE  482
            PQ G+ET QPEG+AGA VESNSEG S EPC   P AVKL  EK+E   EESQD+KALQKE
Sbjct  86   PQGGLETSQPEGEAGAGVESNSEGASPEPCTIPPGAVKLEKEKLEQNTEESQDIKALQKE  145

Query  483  LE  488
            LE
Sbjct  146  LE  147


>ref|NP_001108427.1| POU domain, class 5, transcription factor 1 [Macaca mulatta]
Length=360

 Score =  133 bits (334),  Expect = 1e-34
 Identities = 94/122 (77%), Positives = 100/122 (82%), Gaps = 6/122 (5%)
 Frame = +3

Query  141  EPGWVDPRTWLSFQgppggpgigpg----SEVLGISPCPPAYEFCGGMAYCgpqvglglv  308
            E GWVDPRTWLSFQGPPGGPGIGPG    SEV GI PCPP YEFCGGMAYCGPQVG+GLV
Sbjct  26   ETGWVDPRTWLSFQGPPGGPGIGPGVGPGSEVWGIPPCPPPYEFCGGMAYCGPQVGVGLV  85

Query  309  pqvgvETLQPEGQAGARVESNSEGTSSEPCADRPNAVKL--EKVEPTPEESQDMKALQKE  482
            PQ G+ET QPEG+AGA VESNS+G S EPC     AVKL  EK+E  PEESQD+KALQKE
Sbjct  86   PQGGLETSQPEGEAGAGVESNSDGASPEPCTVPTGAVKLEKEKLEQNPEESQDIKALQKE  145

Query  483  LE  488
            LE
Sbjct  146  LE  147



Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.267   0.0410    0.140 

Effective search space used: 104017620564


  Database: NCBI Protein Reference Sequences
    Posted date:  Mar 18, 2012  8:41 PM
  Number of letters in database: 4,140,237,112
  Number of sequences in database:  11,879,989



Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Neighboring words threshold: 12
Window for multiple hits: 40