File: protpars.phy

package info (click to toggle)
bioperl 1.7.8-1
  • links: PTS, VCS
  • area: main
  • in suites: bookworm, sid, trixie
  • size: 35,788 kB
  • sloc: perl: 94,019; xml: 14,811; makefile: 20
file content (413 lines) | stat: -rw-r--r-- 21,357 bytes parent folder | download | duplicates (9)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
 3 5127
SINFRUP001   .......... ......DDQV VLQCTASVLK EQIKLCLSCE GFGNRLCFLE 
SINFRUP002   .......... ......DDQV VLQCTASVLK EQIKLCLSCE GFGNRLCFLE 
ENSP000003   .MGDAEGEDE VQFLRTDDEV VLQCSATVLK EQLKLCLAAE GFGNRLCFLE 

             TTSNAQNVPP DLAICTFILE QSLSVRALQE MLANTVEMTE AVDLDKWSSQ 
             TTSNAQNVPP DLAICTFILE QSLSVRALQE MLANTVEMTE AVDLDKWSSQ 
             PTSNAQNVPP DLAICCFVLE QSLSVRALQE MLANT..... .VEAGVESSQ 

             GGGHRTLLYG HAILLRHNHS GMYLSCLTTS RSLTDKLAFD VGLQEDSTGE 
             GGGHRTLLYG HAILLRHNHS GMYLSCLTTS RSLTDKLAFD VGLQEDSTGE 
             GGGHRTLLYG HAILLRHAHS RMYLSCLTTS RSMTDKLAFD VGLQEDATGE 

             ACWWTIHPAS KQRSEGEKVR VGDDLILVSV SSERYLHLSY ASGDLMVDAS 
             ACWWTIHPAS KQRSEGEKVR VGDDLILVSV SSERYLHLSY ASGDLMVDAS 
             ACWWTMHPAS KQRSEGEKVR VGDDIILVSV SSERYLHLST ASGELQVDAS 

             FMQTLWNMNP ISSGCELAEG FLTGGHVLRL FHGHMDECLA IATPEEGEEK 
             FMQTLWNMNP ISSGCELAEG FLTGGHVLRL FHGHMDECLA IATPEEGEEK 
             FMQTLWNMNP ICSRCE..EG FVTGGHVLRL FHGHMDECLT ISPADS.DDQ 

             RRMAHYEGGS VCSQARSLWR LEPLRISWSG SHMKWGQSFR IRHITTGRYL 
             RRMAHYEGGS VCSQARSLWR LEPLRISWSG SHMKWGQSFR IRHITTGRYL 
             RRLVYYEGGA VCTHARSLWR LEPLRISWSG SHLRWGQPLR VRHVTTGQYL 

             CLDEEKGLLV VDPERANTKL SAFCFRASKE KVDVAQKRDV EGMGIPEIKY 
             CLDEEKGLLV VDPERANTKL SAFCFRASKE KVDVAQKRDV EGMGIPEIKY 
             ALTEDQGLVV VDASKAHTKA TSFCFRISKE KLDVAPKRDV EGMGPPEIKY 

             GESMCFVQHV STGLWLTYAS LDAKAARLGM MKRKVILHQE GHMDDALTVS 
             GESMCFVQHV STGLWLTYAS LDAKAARLGM MKRKVILHQE GHMDDALTVS 
             GESLCFVQHV ASGLWLTYAA PDPKALRLGV LKKKAMLHQE GHMDDALSLT 

             RSQTEESQAA RMIYSTVGLF RQFIKGLDTL TGKNKSPGAL S...LPLEGV 
             RSQTEESQAA RMIYSTVGLF RQFIKGLDTL TGKNKSPGAL S...LPLEGV 
             RCQQEESQAA RMIHSTNGLY NQFIKSLDSF SGKPRGSGPP AGTALPIEGV 

             ILSLQDLIFY FRPPDEELEH EEKQTKLRSL RNRQNLFQEE GMITIVLECI 
             ILSLQDLIFY FRPPDEELEH EEKQTKLRSL RNRQNLFQEE GMITIVLECI 
             ILSLQDLIIY FEPPSEDLQH EEKQSKLRSL RNRQSLFQEE GMLSMVLNCI 

             DRLNVYNTAA HFSEFAGEEA AESWKEIVNL LYELLASLIR GNRSNCALFC 
             DRLNVYNTAA HFSEFAGEEA AESWKEIVNL LYELLASLIR GNRSNCALFC 
             DRLNVYTTAA HFAEFAGEEA AESWKEIVNL LYELLASLIR GNRSNCALFS 

             DNLDWLVSKL DRLEASSGIL EVLYCVLIES PEVLNIIQEN HIKSIISLLD 
             DNLDWLVSKL DRLEASSGIL EVLYCVLIES PEVLNIIQEN HIKSIISLLD 
             TNLDWLVSKL DRLEASSGIL EVLYCVLIES PEVLNIIQEN HIKSIISLLD 

             KHGRNHKVLD VLRSLCVCNG VAVRSNQNLI TENLLPGRDL LLQTNIVNYV 
             KHGRNHKVLD VLRSLCVCNG VAVRSNQNLI TENLLPGRDL LLQTNIVNYV 
             KHGRNHKVLD VLCSLCVCNG VAVRSNQDLI TENLLPGREL LLQTNLINYV 

             TSVRPNIFLG TCEGSTQYKK WYYEVMVDHV EAFVTAQATH LRVGWAMTEG 
             TSVRPNIFLG TCEGSTQYKK WYYEVMVDHV EAFVTAQATH LRVGWAMTEG 
             TSIRPNIFVG RAEGTTQYSK WYFEVMVDEV TPFLTAQATH LRVGWALTEG 

             YSPYPGGGEG WGGNGVGDDL YSYSFDGLHL WSGTVPRQVA SPNAHTLAAD 
             YSPYPGGGEG WGGNGVGDDL YSYSFDGLHL WSGTVPRQVA SPNAHTLAAD 
             YTPYPGAGEG WGGNGVGDDL YSYGFDGLHL WTGHVARPVT SPGQHLLAPE 

             DVVSCCLDLS VPSISFRING HPVQGMFENF NVDSLFFPVI SFSAGVKARF 
             DVVSCCLDLS VPSISFRING HPVQGMFENF NVDSLFFPVI SFSAGVKARF 
             DVISCCLDLS VPSISFRING CPVQGVFESF NLDGLFFPVV SFSAGVKVRF 

             LLGGRHGDFK FMPPPGYAPC YEALLPRERM RIEPIKEYKH DFNGVRNLLG 
             LLGGRHGDFK FMPPPGYAPC YEALLPRERM RIEPIKEYKH DFNGVRNLLG 
             LLGGRHGEFK FLPPPGYAPC HEAVLPRERL HLEPIKEYRR EGPRGPHLVG 

             PTLSLTHTSF TPCPVDTVQI VLPPHLERIR EKLAENIHEL WAVTRIEQGW 
             PTLSLTHTSF TPCPVDTVQI VLPPHLERIR EKLAENIHEL WAVTRIEQGW 
             PSRCLSHTDF VPCPVDTVQI VLPPHLERIR EKLAENIHEL WALTRIEQGW 

             TYGSFRDDNK KLHPCLVDFQ SLPEPERNYN LQMSAETLKC VCAV...A.. 
             TYGSFRDDNK KLHPCLVDFQ SLPEPERNYN LQMSAETLKC VCAV...A.. 
             TYGPVRDDNK RLHPCLVDFH SLPEPERNYN LQMSGETLKT LLALGCHVGM 

             ......ETLH DCVSSR.YVM SNAYKPAPLD LSHVKLTPNQ NQLVEKLAEN 
             ......ETLH DCVSSR.YVM SNAYKPAPLD LSHVKLTPNQ NQLVEKLAEN 
             ADEKAEDNLK KTKLPKTYMM SNGYKPAPLD LSHVRLTPAQ TTLVDRLAEN 

             GHNVWARDRV RQGWTYSIVQ DILNKRNPRL VPYILLDERT KKTNRDSVNN 
             GHNVWARDRV RQGWTYSIVQ DILNKRNPRL VPYILLDERT KKTNRDSVNN 
             GHNVWARDRV GQGWSYSAVQ DIPARRNPRL VPYRLLDEAT KRSNRDSLCQ 

             AVRTLIGYGY NIEPPDQEST GHGLENTRGD KVRIFRAEKS YAVTQGKWYF 
             AVRTLIGYGY NIEPPDQEST GHGLENTRGD KVRIFRAEKS YAVTQGKWYF 
             AVRTLLGYGY NIEPPDQEPS Q.VENQSRCD RVRIFRAEKS YTVQSGRWYF 

             EFEAVTTGEM RVGWARPNVH SDTELGADEL AYVFNGNKA. ........QR 
             EFEAVTTGEM RVGWARPNVH SDTELGADEL AYVFNGNKA. ........QR 
             EFEAVTTGEM RVGWARPELR PDVELGADEL AYVFNGHRG. ........QR 

             WHIGNEPFGR QWQSGDVVGC MIDLTEMNIM FTLNGEMLIS DSGSEMAFKD 
             WHIGNEPFGR QWQSGDVVGC MIDLTEMNIM FTLNGEMLIS DSGSEMAFKD 
             WHLGSEPFGR PWQPGDVVGC MIDLTENTII FTLNGEVLMS DSGSETAFRE 

             IEIGEGFIPV CTLGLSQVGR INLGQNVSSL RYFAICGLQE GFEPFAINMK 
             IEIGEGFIPV CTLGLSQVGR INLGQNVSSL RYFAICGLQE GFEPFAINMK 
             IEIGDGFLPV CSLGPGQVGH LNLGQDVSSL RFFAICGLQE GFEPFAINMQ 

             RDTTMWFSKS LPQFVPVPAD HNHIEVSRVD GTVDSAPCLK LTHKTYGSQN 
             RDTTMWFSKS LPQFVPVPAD HNHIEVSRVD GTVDSAPCLK LTHKTYGSQN 
             RPVTTWFSKG LPQFEPVPLE HPHYEVSRVD GTVDTPPCLR LTHRTWGSQN 

             ANTDMLFLRL SMPIQFHATF KVPAGTTPLT RALTIP...E DVAVVEPDSE 
             ANTDMLFLRL SMPIQFHATF KVPAGTTPLT RALTIP...E DVAVVEPDSE 
             SLVEMLFLRL SLPVQFHQHF RCTAGATPLA PPGLQPPAED EARAAEPDPD 

             FEVLKKSASR KEQEEDKKEP SVPKEI.... ........L. .AENEKDTMS 
             FEVLKKSASR KEQEEDKKEP SVPKEI.... ........L. .AENEKDTMS 
             YENLRRSAGG WSEAENGKEG TAKEGAPGGT PQAGGEAQPA RAENEKDATT 

             EKGKKRGFFS KAKKAAMTPL A.....PPPP PTVPRLVEDV VPDD.RDDPE 
             EKGKKRGFFS KAKKAAMTPL A.....PPPP PTVPRLVEDV VPDD.RDDPE 
             EKNKKRGFLF KAKKVAMMTQ P......PAT PTLPRLPHDV VPADNRDDPE 

             IILSTTTYYY SVRIFAGQEP SGVWVGWVTP DYHQYDQTFD LSKVRSVTVT 
             IILSTTTYYY SVRIFAGQEP SGVWVGWVTP DYHQYDQTFD LSKVRSVTVT 
             IILNTTTYYY SVRVFAGQEP SCVWAGWVTP DYHQHDMSFD LSKVRVVTVT 

             VGDDKGNIYN SMKRSNCYMV WGDDLVS.NH QTRFSQEDMV IGCLVDLATG 
             VGDDKGNIYN SMKRSNCYMV WGDDLVS.NH QTRFSQEDMV IGCLVDLATG 
             MGDEQGNVHS SLKCSNCYMV WGGDFVSPGQ QGRISHTDLV IGCLVDLATG 

             LMTFTANGKE INTFYQVEPN TKLFPAVFVQ PLSQNMVQLE LGKLKNIMPI 
             LMTFTANGKE INTFYQVEPN TKLFPAVFVQ PLSQNMVQLE LGKLKNIMPI 
             LMTFTANGKE SNTFFQVEPN TKLFPAVFVL PTHQNVIQFE LGKQKNIMPL 

             SAAMFRSERN NPVPQCPPRL DVQMLTPVIW SRMPNRFLNP DVGRVSERLG 
             SAAMFRSERN NPVPQCPPRL DVQMLTPVIW SRMPNRFLNP DVGRVSERLG 
             SAAMFQSERK NPAPQCPPRL EMQMLMPVSW SRMPNHFLQV ETRRAGERLG 

             WVVECTEPLI MMALHIPEEN RCIDILELSE RQDLMKFHYH TLMLYCAVCA 
             WVVECTEPLI MMALHIPEEN RCIDILELSE RQDLMKFHYH TLMLYCAVCA 
             WAVQCQEPLT MMALHIPEEN RCMDILELSE RLDLQRFHSH TLRLYRAVCA 

             LGNNRVAHAL CSHVDESQLF YATENTYLPG PLRSGYYDLL ISIHLESAKR 
             LGNNRVAHAL CSHVDESQLF YATENTYLPG PLRSGYYDLL ISIHLESAKR 
             LGNNRVAHAL CSHVDQAQLL HALEDAHLPG PLRAGYYDLL ISIHLESACR 

             ARLGTNREFI VPMTEETLSI KLYPDAV... ...KAHSLPG VGLTTCLRPK 
             ARLGTNREFI VPMTEETLSI KLYPDAV... ...KAHSLPG VGLTTCLRPK 
             SRRSMLSEYI VPLTPETRAI TLFPPGRSTE NGHPRHGLPG VGVTTSLRPP 

             LHFS...... SINFVGTDLD LYTLSPVFPL QELKNRAISM LTEAVLDGSQ 
             LHFS...... SINFVGTDLD LYTLSPVFPL QELKNRAISM LTEAVLDGSQ 
             HHFSPPCFVA ALPAAGAAEA PARLSPAIPL EALRDKALRM LGEAVRDGGQ 

             AMRDPVGGSV EFHFVPILKL ISTLLIMGIF NDDDTKHILK MIDPNVFSGK 
             AMRDPVGGSV EFHFVPILKL ISTLLIMGIF NDDDTKHILK MIDPNVFSGK 
             HARDPVGGSV EFQFVPVLKL VSTLLVMGIF GDEDVKQILK MIEPEVFTEE 

             DDEE...... ETDKPVEGGP AEGEGDKAKG EESEEAAELE D...EGVGKV 
             DDEE...... ETDKPVEGGP AEGEGDKAKG EESEEAAELE D...EGVGKV 
             EEEE...... ..DEEEEGEE EDEEE..... .........K E...EDEEET 

             DGEKMEEEKE AEVVAVDLKD EEEGLEEGLL QMKLPESVKL QMCTLLQFFC 
             DGEKMEEEKE AEVVAVDLKD EEEGLEEGLL QMKLPESVKL QMCTLLQFFC 
             AQEKEDEEKE EEEAAE..GE KEEGLEEGLL QMKLPESVKL QMCHLLEYFC 

             DCELRHRVEA IVAYSDKFVH NIQDNQRIRY NQLMRAFTMS AAETARKTRE 
             DCELRHRVEA IVAYSDKFVH NIQDNQRIRY NQLMRAFTMS AAETARKTRE 
             DQELQHRVES LAAFAERYVD KLQANQRSRY GLLIKAFSMT AAETARRTRE 

             FRSPPQDQVL LLTNFKHSLE EEECPVPDNV RETLKEFHND LLLHCGIHIE 
             FRSPPQDQVL LLTNFKHSLE EEECPVPDNV RETLKEFHND LLLHCGIHIE 
             FRSPPQEQIN MLLQFKDGTD EEDCPLPEEI RQDLLDFHQD LLAHCGIQLD 

             EEPVEEEVDT SLRGRLLSLV DKIKSIRGKK TEEKPE.VEE ETKPSTLQEL 
             EEPVEEEVDT SLRGRLLSLV DKIKSIRGKK TEEKPE.VEE ETKPSTLQEL 
             GEEEEPEEET TLGSRLMSLL EKVRLVKKKE EKPEEERSAE ESKPRSLQEL 

             ISHTMIHWAQ ESFIQNPELV RLMFSLLHRQ YDGLGELIRA LPKAYAINAV 
             ISHTMIHWAQ ESFIQNPELV RLMFSLLHRQ YDGLGELIRA LPKAYAINAV 
             VSHMVVRWAQ EDFVQSPELV RAMFSLLHRQ YDGLGELLRA LPRAYTISPS 

             SVQDTMDLLE CLGQIRSLLI VQMGPEEERL MIQSIGNIMN NKVFYQHPNL 
             SVQDTMDLLE CLGQIRSLLI VQMGPEEERL MIQSIGNIMN NKVFYQHPNL 
             SVEDTMSLLE CLGQIRSLLI VQMGPQEENL MIQSIGNIMN NKVFYQHPNL 

             MRALGMHETV MEVMVNVLGG GGDSKEIRFP QMVTNCCRFL CYFCRISRQN 
             MRALGMHETV MEVMVNVLGG GGDSKEIRFP QMVTNCCRFL CYFCRISRQN 
             MRALGMHETV MEVMVNVLGG G.ESKEIRFP KMVTSCCRFL CYFCRISRQN 

             QRSMFDHLSY LLQNSSIGLG MRGSTPLDVA AASCIDNNEL ALALQEQDLE 
             QRSMFDHLSY LLQNSSIGLG MRGSTPLDVA AASCIDNNEL ALALQEQDLE 
             QRSMFDHLSY LLENSGIGLG MQGSTPLDVA AASVIDNNEL ALALQEQDLE 

             MVVTYLAGCG LQMCPMLLSK CYPDIGWNPC GGERYLDFLR FAVFVNGESV 
             MVVTYLAGCG LQMCPMLLSK CYPDIGWNPC GGERYLDFLR FAVFVNGESV 
             KVVSYLAGCG LQSCPMLVAK GYPDIGWNPC GGERYLDFLR FAVFVNGESV 

             EENANVVVRL LIRRPECFGP ALRGEGGNGL LAAMEEAIKI SEDPARDGPT 
             EENANVVVRL LIRRPECFGP ALRGEGGNGL LAAMEEAIKI SEDPARDGPT 
             EENANVVVRL LIRKPECFGP ALRGEGGSGL LAAIEEAIRI SEDPARDGPG 

             VKKDRRF.MF GGEEQQEENR VHLGNAIMSF YSALIDLLGR CAPEMHLIQA 
             VKKDRRF.MF GGEEQQEENR VHLGNAIMSF YSALIDLLGR CAPEMHLIQA 
             IRRDRRR.EH FGEEPPEENR VHLGHAIMSF YAALIDLLGR CAPEMHLIQA 

             GKGEALRIRA ILRSLVPIED LVGVISLPVQ IPSYGKDSQI VEPKMSASFV 
             GKGEALRIRA ILRSLVPIED LVGVISLPVQ IPSYGKDSQI VEPKMSASFV 
             GKGEALRIRA ILRSLVPLED LVGIISLPLQ IPTLGKDGAL VQPKMSASFV 

             PDHKASMVLF LDRVYGIDNQ DFLLHVLEVG FLPDMRAAAS LDTVAFSTTE 
             PDHKASMVLF LDRVYGIDNQ DFLLHVLEVG FLPDMRAAAS LDTVAFSTTE 
             PDHKASMVLF LDRVYGIENQ DFLLHVLDVG FLPDMRAAAS LDTATFSTTE 

             MALALNRYLC SAVLPLLTKC APLFAGTDHR AIMIDSMLHT IYRLSRGRAL 
             MALALNRYLC SAVLPLLTKC APLFAGTDHR AIMIDSMLHT IYRLSRGRAL 
             MALALNRYLC LAVLPLITKC APLFAGTEHR AIMVDSMLHT VYRLSRGRSL 

             TKAQRDVIEE CLMSLCKYLR PSMLQHLLRR LVFDVPILNE YAKMPLKLLT 
             TKAQRDVIEE CLMSLCKYLR PSMLQHLLRR LVFDVPILNE YAKMPLKLLT 
             TKAQRDVIED CLMSLCRYIR PSMLQHLLRR LVFDVPILNE FAKMPLKLLT 

             NHYERCWKYY CLPNGWANFG VTSEEELHLS RKLFWGIFES LAHKKFDAEL 
             NHYERCWKYY CLPNGWANFG VTSEEELHLS RKLFWGIFES LAHKKFDAEL 
             NHYERCWKYY CLPTGWANFG VTSEEELHLT RKLFWGIFDS LAHKKYDPEL 

             FKIAMPCLCA IAGAIPPDYV DASYSSHTEK KASVDAEGNF DPKPVETTNT 
             FKIAMPCLCA IAGAIPPDYV DASYSSHTEK KASVDAEGNF DPKPVETTNT 
             YRMAMPCLCA IAGALPPDYV DASYSSKAEK KATVDAEGNF DPRPVETLNV 

             IIPERLDAFI NKYAEHTHDK WAFEKIQNNW TYGEVLDEDA KTHPMLRPYK 
             IIPERLDAFI NKYAEHTHDK WAFEKIQNNW TYGEVLDEDA KTHPMLRPYK 
             IIPEKLDSFI NKFAEYTHEK WAFDKIQNNW SYGENIDEEL KTHPMLRPYK 

             TFSEKDKEIY RWPIKESIKA MLAWEWTLEK ARDGEGEVEK KAATRKISQT 
             TFSEKDKEIY RWPIKESIKA MLAWEWTLEK ARDGEGEVEK KAATRKISQT 
             TFSEKDKEIY RWPIKESLKA MIAWEWTIEK AREGEEEKTE KKKTRKISQS 

             AQATYDPSHG YSPQPIDISG MTLSRELQSM AEQLAENYHN TWGRKKKVEL 
             AQATYDPSHG YSPQPIDISG MTLSRELQSM AEQLAENYHN TWGRKKKVEL 
             AQ.TYDPREG YNPQPPDLSA VTLSRELQAM AEQLAENYHN TWGRKKKQEL 

             QSKGGGTHPL LVPYDTLTAK EKARDREKAQ DLLKFLQLNG YAVTR..GMK 
             QSKGGGTHPL LVPYDTLTAK EKARDREKAQ DLLKFLQLNG YAVTR..GMK 
             EAKGGGTHPL LVPYDTLTAK EKARDREKAQ ELLKFLQMNG YAVTRHAGLK 

             DMEQDISSIE KRFAYGFLQK LLKWMDIAQE FIAHLEAVVS SGRVEKSPHE 
             DMEQDISSIE KRFAYGFLQK LLKWMDIAQE FIAHLEAVVS SGRVEKSPHE 
             DMELDSSSIE KRFAFGFLQQ LLRWMDISQE FIAHLEAVVS SGRVEKSPHE 

             QEIKFFAKIL LPLVNQYFKN HCLYFLSTPA KVLGSGGHSS NKEKEMIASI 
             QEIKFFAKIL LPLVNQYFKN HCLYFLSTPA KVLGSGGHSS NKEKEMIASI 
             QEIKFFAKIL LPLINQYFTN HCLYFLSTPA KVLGSGGHAS NKEKEMITSL 

             FCKLAALVRH RVSLFGTDAS AVVNCLHILS RSLDARTVMK SGPEIVKAGL 
             FCKLAALVRH RVSLFGTDAS AVVNCLHILS RSLDARTVMK SGPEIVKAGL 
             FCKLAALVRH RVSLFGTDAP AVVNCLHILA RSLDARTVMK SGPEIVKAGL 

             RQFFESAADD IEKMVENLKL GKVSSRNQ.V KGVSQNINYT TIALLPVLTS 
             RQFFESAADD IEKMVENLKL GKVSSRNQ.V KGVSQNINYT TIALLPVLTS 
             RSFFESASED IEKMVENLRL GKVSQARTQV KGVGQNLTYT TVALLPVLTT 

             LFDHIAQHQF GDDVILDDLQ ISCYRIMCSI YSLGTVKTPH AEKQRPALGE 
             LFDHIAQHQF GDDVILDDLQ ISCYRIMCSI YSLGTVKTPH AEKQRPALGE 
             LFQHIAQHQF GDDVILDDVQ VSCYRTLCSI YSLGTTKNTY VEKLRPALGE 

             CLAHLAAAMP VAFLEPTLNE FNTFSVYTTK TPRERSILGL PSQVEELCPD 
             CLAHLAAAMP VAFLEPTLNE FNTFSVYTTK TPRERSILGL PSQVEELCPD 
             CLARLAAAMP VAFLEPQLNE YNACSVYTTK SPRERAILGL PNSVEEMCPD 

             IPELEVLMKD IHDLAESGAR YTEMPHVIEI TLPMLCNYLP RWWERGLEN. 
             IPELEVLMKD IHDLAESGAR YTEMPHVIEI TLPMLCNYLP RWWERGLEN. 
             IPVLERLMAD IGGLAESGAR YTEMPHVIEI TLPMLCSYLP RWWERGPEAP 

             ...FPEQEGQ ICTSVTSEQL NQLLGSIMKI VVNNLGIDEA SWMKRLAVFA 
             ...FPEQEGQ ICTSVTSEQL NQLLGSIMKI VVNNLGIDEA SWMKRLAVFA 
             PSALPAGAPP PCTAVTSDHL NSLLGNILRI IVNNLGIDEA SWMKRLAVFA 

             QPIVSRAKPE MLKSHFIPTM EKLKKRCGKV VAEEDHLRME GKTEVDSENG 
             QPIVSRAKPE MLKSHFIPTM EKLKKRCGKV VAEEDHLRME GKTEVDSENG 
             QPIVSRARPE LLQSHFIPTI GRLRKRAGKV VSEEEQLRLE AKAEAQEGEL 

             TIRDEFAVLC RDLYALYPLL IRYVDNSRAR WLTNPDPDAE ELFRMVGEVF 
             TIRDEFAVLC RDLYALYPLL IRYVDNSRAR WLTNPDPDAE ELFRMVGEVF 
             LVRDEFSVLC RDLYALYPLL IRYVDNNRAQ WLTEPNPSAE ELFRMVGEIF 

             IFWSKSHNFK REEQNFVVMN EINNMSFLTA DSKSKMSKS. ........GG 
             IFWSKSHNFK REEQNFVVMN EINNMSFLTA DSKSKMSKS. ........GG 
             IYWSKSHNFK REEQNFVVQN EINNMSFLTA DNKSKMAKVG ACPVSPQSGG 

             SEQERTKKKR RGDRYSVQTS LIVAALKKLL PIGLNMCSPA DQELINLAKI 
             SEQERTKKKR RGDRYSVQTS LIVAALKKLL PIGLNMCSPA DQELINLAKI 
             SDQERTKKKR RGDRYSVQTS LIVATLKKML PIGLNMCAPT DQDLITLAKT 

             RYSLKDTDEE VREFLHNNLH LQGKVE.DPA MRWQMSLYKE MAGKAEDAED 
             RYSLKDTDEE VREFLHNNLH LQGKVE.DPA MRWQMSLYKE MAGKAEDAED 
             RYALKDTDEE VREFLHNNLH LQGKVEGSPS LRWQMALYRG VPGREEDADD 

             PEKVVKRVQE VSAVLYHIEV TEHPFKSKKM VWHKLLSKQR RRAVVACFRM 
             PEKVVKRVQE VSAVLYHIEV TEHPFKSKKM VWHKLLSKQR RRAVVACFRM 
             PEKIVRRVQE VSAVLYYLDQ TEHPYKSKKA VWHKLLSKQR RRAVVACFRM 

             TPLYNIITHR ATNMFLDAYK RNWLETEGYS FEDKMIDDLS VSLDHIRSE. 
             TPLYNIITHR ATNMFLDAYK RNWLETEGYS FEDKMIDDLS VSLDHIRSE. 
             TPLYNLPTHR ACNMFLESYK AAWILTEDHS FEDRMIDDLS KAGEQEEEEE 

             ....KKPDPL HQLILHFSRT ALTEKMKLDV DHLYMSYADI MAKGFSVSPP 
             ....KKPDPL HQLILHFSRT ALTEKMKLDV DHLYMSYADI MAKGFSVSPP 
             EVEEKKPDPL HQLVLHFSRT ALTEKSKLDE DYLYMAYADI MAKSCHLEEG 

             CSASQ..... ........EK EMEKQRLLYQ QSRLHNRGAA EMVLQMISAC 
             CSASQ..... ........EK EMEKQRLLYQ QSRLHNRGAA EMVLQMISAC 
             GENGE...AE EEVEVSFEEK QMEKQRLLYQ QARLHTRGAA EMVLQMISAC 

             KGEPGAMVSS TLKLGISILN GGNSDVQQKM LDYLKDKKDV GFFLSIQSLM 
             KGEPGAMVSS TLKLGISILN GGNSDVQQKM LDYLKDKKDV GFFLSIQSLM 
             KGETGAMVSS TLKLGISILN GGNAEVQQKM LDYLKDKKEV GFFQSIQALM 

             QTCSVLDLNA FERQNKAEGL GMVSEEGTNE KVMADDEFTC DLFRFLQLLC 
             QTCSVLDLNA FERQNKAEGL GMVSEEGTNE KVMADDEFTC DLFRFLQLLC 
             QTCSVLDLNA FERQNKAEGL GMVNEDGTGE KVMADDEFTQ DLFRFLQLLC 

             EGHNNDFQNY LRTQTGSTTT INVIICTVDY LLRLQESISD FYWYYSGKDI 
             EGHNNDFQNY LRTQTGSTTT INVIICTVDY LLRLQESISD FYWYYSGKDI 
             EGHNNDFQNY LRTQTGNTTT INIIICTVDY LLRLQESISD FYWYYSGKDV 

             IDEPGKRNFS KAMNVAKQVF NSLTEYIQGP CTGNQQSLAH SRLWDAVVGF 
             IDEPGKRNFS KAMNVAKQVF NSLTEYIQGP CTGNQQSLAH SRLWDAVVGF 
             IEEQGKRNFS KAMSVAKQVF NSLTEYIQGP CTGNQQSLAH SRLWDAVVGF 

             LHVFAHMMMK LAQ....... ..DSSQIGLL KELLDLQKDM VVMLLSLLEG 
             LHVFAHMMMK LAQ....... ..DSSQIGLL KELLDLQKDM VVMLLSLLEG 
             LHVFAHMMMK LAQ....... ..DSSQIELL KELLDLQKDM VVMLLSLLEG 

             NVVNGTIAKQ MVDMLVESSS NVEMILKFFD MFLKLKDIVA SDAFRDYVTD 
             NVVNGTIAKQ MVDMLVESSS NVEMILKFFD MFLKLKDIVA SDAFRDYVTD 
             NVVNGMIARQ MVDMLVESSS NVEMILKFFD MFLKLKDIVG SEAFQDYVTD 

             PRGLISKKDF SKAMDSQKQY TPAEIQFLLS CSEADENEMI NFEEFADRFQ 
             PRGLISKKDF SKAMDSQKQY TPAEIQFLLS CSEADENEMI NFEEFADRFQ 
             PRGLISKKDF QKAMDSQKQF SGPEIQFLLS CSEADENEMI NCEEFANRFQ 

             EPAKDIGFNI AVLLTNLSEH VPHDTRLQNF LEQAESVLNY FRPFLGRIEI 
             EPAKDIGFNI AVLLTNLSEH VPHDTRLQNF LEQAESVLNY FRPFLGRIEI 
             EPARDIGFNV AVLLTNLSEH VPHDPRLHNF LELAESILEY FRPYLGRIEI 

             MGASRKIERI YFEISEANRN QWEMPQVRES KRQFIFDVVN EGGESEKMEM 
             MGASRKIERI YFEISEANRN QWEMPQVRES KRQFIFDVVN EGGESEKMEM 
             MGASRRIERI YFEISETNRA QWEMPQVKES KRQFIFDVVN EGGEAEKMEL 

             FVNFCEDTIF EMNIA...AH A......... .......... .......... 
             FVNFCEDTIF EMNIA...AH A......... .......... .......... 
             FVSFCEDTIF EMQIAAQISE PEGEPETDED EGAGAAEAGA EGAEEGAAGL 

             .....PESTS AFADFLKSVV NFFNMFTFRN LRRRYRRFRK MTVKEMVIGL 
             .....PESTS AFADFLKSVV NFFNMFTFRN LRRRYRRFRK MTVKEMVIGL 
             EGTAATAAAG ATARVVAAAG RALRGLSYRS LRRRVRRLRR LTAREAATAV 

             ATFVYTVVMG ILMFVYSICK GFFTLIWKVL FGGGLVESAK KMTVTDILAS 
             ATFVYTVVMG ILMFVYSICK GFFTLIWKVL FGGGLVESAK KMTVTDILAS 
             AALLWAAVTR AGAAGAGAAA GALGLLWGSL FGGGLVEGAK KVTVTELLAG 

             MPDPTQDEVH GELPPEPGSR EDQD..TEGG ADLLDPVGGE EEEEDSEERE 
             MPDPTQDEVH GELPPEPGSR EDQD..TEGG ADLLDPVGGE EEEEDSEERE 
             MPDPTSDEVH GEQPAGPGGD ADGEGASEGA GDAAEG.AGD EEEAVHEAGP 

             GGRLPGFNTP .......... GGLGDFGETT PEEPPTPEGT PLLKRKLVSR 
             GGRLPGFNTP .......... GGLGDFGETT PEEPPTPEGT PLLKRKLVSR 
             GGADGAVAVT DGGPFRPEGA GGLGDMGDTT PAEPPTPEGS PILKRKLGVD 

             HNQIGGQGEE ENAEHEEPPQ ETEKADTENG EKAKKPEAEP EVKEEEPVEE 
             HNQIGGQGEE ENAEHEEPPQ ETEKADTENG EKAKKPEAEP EVKEEEPVEE 
             GVEEE..LPP EPEPEPEPEL EPEKADAENG EKEEV....P EPTPEP.... 

             EEITVKAKAK KSKKPVEEGF ELWNELEIQR VKFMNYLSRN FYNLRYLALF 
             EEITVKAKAK KSKKPVEEGF ELWNELEIQR VKFMNYLSRN FYNLRYLALF 
             PKKQAPPSPP PKKE..EAGG EFWGELEVQR VKFLNYLSRN FYTLRFLALF 

             IAFALNFILL FYKVSDSPP. GEED.....F EGSGLFEGSG LFEGSGVQED 
             IAFALNFILL FYKVSDSPP. GEED.....F EGSGLFEGSG LFEGSGVQED 
             LAFAINFILL FYKVSDSPP. GEDD.....M EGSAAGDVSG AGSG.GSSGW 

             GSGLDDGGED DDEEGPLYYF LEESTGYMEP AMAFLSIVHT IISFLCIIGY 
             GSGLDDGGED DDEEGPLYYF LEESTGYMEP AMAFLSIVHT IISFLCIIGY 
             GLGAGEEAEG DEDENMVYYF LEESTGYMEP ALRCLSLLHT LVAFLCIIGY 

             NCLKVPLVIF KREKELARKL EFDGVYVTEQ PEDDDIKGQW DRLVLNTPSF 
             NCLKVPLVIF KREKELARKL EFDGVYVTEQ PEDDDIKGQW DRLVLNTPSF 
             NCLKVPLVIF KREKELARKL EFDGLYITEQ PEDDDVKGQW DRLVLNTPSF 

             PNNYWDKFVK RKVLDKYGDI YGRERIAELL GMDLASLDVS AMTHEKKPEP 
             PNNYWDKFVK RKVLDKYGDI YGRERIAELL GMDLASLDVS AMTHEKKPEP 
             PSNYWDKFVK RKVLDKHGDI YGRERIAELL GMDLATLEIT AHNERK.PNP 

             DTSMFSWITS IDIKYQIWKF GVVFTDNTFL YLVWYFLMSI LGHYNNFFFA 
             DTSMFSWITS IDIKYQIWKF GVVFTDNTFL YLVWYFLMSI LGHYNNFFFA 
             PPGLLTWLMS IDVKYQIWKF GVIFTDNSFL YLGWYMVMSL LGHYNNFFFA 

             AHLLDIAMGV KTLRTILSSV THNGKQLMMT VGLLAVVVYL YTVVAFNFFR 
             AHLLDIAMGV KTLRTILSSV THNGKQLMMT VGLLAVVVYL YTVVAFNFFR 
             AHLLDIAMGV KTLRTILSSV THNGKQLVMT VGLLAVVVYL YTVVAFNFFR 

             KFYNKSEDED EPDMKCDDMM TCYLFHMYVG VRAGGGIGDE IEDPAGDEYE 
             KFYNKSEDED EPDMKCDDMM TCYLFHMYVG VRAGGGIGDE IEDPAGDEYE 
             KFYNKSEDED EPDMKCDDMM TCYLFHMYVG VRAGGGIGDE IEDPAGDEYE 

             LYRVVFDITF FFFVIVILLA IIQGLIIDAF GELRDQQEQV REDMETKCFI 
             LYRVVFDITF FFFVIVILLA IIQGLIIDAF GELRDQQEQV REDMETKCFI 
             LYRVVFDITF FFFVIVILLA IIQGLIIDAF GELRDQQEQV KEDMETKCFI 

             CGIGSDYFDT TPHGFETHTL EEHNLANYMF FLMYLINKDE TEHTGQESYV 
             CGIGSDYFDT TPHGFETHTL EEHNLANYMF FLMYLINKDE TEHTGQESYV 
             CGIGSDYFDT TPHGFETHTL EEHNLANYMF FLMYLINKDE TEHTGQESYV 

             WKMYQERCWD FFPAGDCFRK QYEDQL. 
             WKMYQERCWD FFPAGDCFRK QYEDQL. 
             WKMYQERCWD FFPAGDCFRK QYEDQLS