File: test.phylip

package info (click to toggle)
fasttree 2.1.11-2
  • links: PTS, VCS
  • area: main
  • in suites: bookworm, bullseye, forky, sid, trixie
  • size: 640 kB
  • sloc: ansic: 8,064; sh: 28; makefile: 18
file content (820 lines) | stat: -rw-r--r-- 53,256 bytes parent folder | download | duplicates (4)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
 204 197
N3289      --RNRSCRRD NTNGQDLQAA LAIFAAKVYV GVALQSVQVA AGIGKHPVYK
N1763      ------ISKD TTEERFLEVD KLTFAPKSYA GTLQTKILSA VSVPAGTLYK
N2100      --RGRARPKQ TTAESNLDAT MGKFASQEYD GTMHRELGAA SGVSLGTLYP
N774       --RGRRRTKT IVSEKDLSAT MGRFAEQPYD GSLERNAATA ASAPLNTMYG
N211       --KARGRTTI ETGEKVLTGE MDRFAELQYD GSLQRDDTTG AAPPLGTLYG
N747       MGKARGITTA YAYSQVLIGR LGAHAALPYN GSLERKDVAA LDAPTNKLYG
N952       MGRGRARTTV EAGEKVLLGT MIRFAELPHD GSLQRNDSTA LAAPLNTLYA
N3964      ------RTTV EDNDKVLNAT MDRFADLPYD GSLQRDDTTA QTAPLGTLYG
N3613      LGRGMARTTV EDLETVLNAT MDRFAQLPYD GSLQRDDTTA ASAPLGTLYG
N1689      MKLGRYRTVQ TANEKYLETT AGRYADQNYA GTAQRGVQKA NSVPLGTLYP
N3700      MKMGRPRTKQ STSQRYLDTA GARYDDQAYA GTLQRGLGNA KGVPLGTLYL
N1275      ----RARSRE DQEEKFLSAA AQDFAEQTYT GTPQKEIPAA VDAPLGTLNR
N4054      ----RNKTRK DRLESFLTAA LQEFAELSFD GTVQHEIGAA IGVPLGTLNK
N824       ----RMKGRK ERLETFLAAK LQKFAEMSYT GTVQHEITAA VGVPLGTLNK
N1723      ----RRKGRK NRLESFLAAA LAEFAELSYT GTVQQDISKA VGVPLGTLNK
N1427      ----RPKGRK DRLESFLTAA LQEFAELNYT GTVQHDINTA VGVPVSTLNK
N2798      ----RPKTRN HPLQSFLDAE LEDFADQANG GTLQRELHAA NGIPLGTLGR
N186       ----RPRNRT DTNESFLPPD LEEFADPAYA GKLQREVPAA VAVPLGTLGQ
N1758      ----RPRTRN EAEAKSLKAM MEGFADKAYG GIHLREEEVA TGIPTASLYH
N2116      MKRGRHRTRI TQSEKLLRAA LKKFSNAAYA GKLEPEKPAA AHSASGTLYR
N4976      ----LGKTRK RFSEKGLGAA FGSYAEQRYA GTHQKNKVTA IGSHLGNLYS
N3479      ----HPRKRE HCWQRVLDAS LGSFTEESYG GTVQKEIDGA GGVPLVILFK
N3618      ----HPRRRN KCWQRKLETS IGSFTEESYG GSLQKQKKAA AGVPLVILYR
N4560      MATSRARTRA SMGEAFLHAI MACFDEVKYA ALLQCEIKAA DGMPLSALFR
N4920      ------RAPN ATVTKFLAGN LFSCAADPYI GSAEPMILPA ADVALGSLYL
N2156      ---KRPKARE TVNGKALIAP MMAYGEKEYA GTLQPSMLSG GKLALGGLYQ
N582       ------KARN TTNTDFLKIA LFPFAMKSYS GTLMRLVHSA AAAALGTLSK
N3146      --------RF HPPENYHNAP FGSFQEKSIA GTDEKHVYSS MAVPGATLYH
N3588      ---------- --PERFLTAT MGSYEGKSIA GTEQQHFKAA GGVPLGTLYK
N3750      ----KRRRRE KKDERFLCAS LGDFAKCSYT GILEKQIPNG QGLAASTLYP
N61        --------RE EDAERFLASA LGAFAARSYS GTIEREIPAA SKLPLGTLYR
N2709      ----RRVRAE DSPDRILVRA LGTFADQDYP GTLMSSVKAD KEIPGGTLYK
N1849      ----RKRRRD ATPEKLLDAA LGSFAEDTYA GKLLRDIHAA GGVKAATLYK
N409       ----PCRRRE ATIERFERSA LGSFAEKANG GSIMIGVPTS CNIPKGSLTP
N3956      ----IASRAE AAGDRFLLMA LGRFAGRCYN GTLKRQVGTS CAVPISNLFS
N3891      LESGGIFGWV ISKERFLQGP LGESAGQGYA GTSKQGPVNA CPDPKQALYG
N1906      LKSSGIFGWI DAKEQYLEGP LEQSAEHEYP GTSKQGPFAA TGSPMDALYG
N274       LKRRGHFSWK DSQERYLEAP LAQPVEGGYP GTLKKEPPPA HGSPLEAFNT
N710       LKRHGSFSWS DSKERYLEAP LAQSVEGAYP GILKKDPPPA RGSPLEAFDK
N4840      LKRRGTFSWS DSKERYLHAP LAQSVEGAYP GTLKKDPPPA VGSPLEAFNR
N1895      ---------- -----YLIAA GGLFAIKNYT GTLPNDVELA AGVPLGTLYV
N1629      ----SAHSPT EGRAQFLLAD LVNFAQAKYA GALKDRIRAA QGVPAGTFYA
N1543      ----NHTARE SKREHYLSAE LAPFTSGKYA GSLKRKIRAA NGIPFGALYL
N2040      ----NHTARE SKREHYLSAE LAPFTSGKYA GSLKRKIRAA NGIPFGALYL
N2366      MKHGCARRKP HERESFLAAE LGFFGKAIYA GTELRAIPTA KGGPLKTLYR
N1614      --HKCSRRRQ HACSKFLGAA VEYFGRIAFL GTLVRAICTA KGVPLSNLTK
N4609      MKHKCRRRHD HAIEKFLTDA IGYYCKTGFG STLMEAIGTA RGVPISTLYK
N4738      MKHQCRSKRE HARERMLTPA VIKYGKTGFG STLMRVITTA KGVPLSTLYK
N359       -----NRSRE HAEEHVLEPA LGIYGKKAYG GTLMRRLGHA IGVPLSPLYR
N4656      -VRKRNRRRA HAIERFLEAA LGIYGQRPFG GTLMRRFGEA AGVPLGPLYP
N1195      -VRKRHRRTA HAIDRFLSLA LGLYGQRAYG GTLMRRFGEA GGLPLGPLYP
N2006      ---------- ------LEAA LGIYGVRAFG GTLMQRFGEA AGVPLGPLYV
N3986      -------RSQ HRPEQYLNAA HGGLAHKTYG GNLMADTPAA SSTPLGTLHK
N3972      ------RRKQ HRPEQYLNAA HGGLAHKLYG GNLMSEVPAA SSTPLGTLHK
N891       MINNHLKRKD EDDEKFHPGS LGIFVANSYV GTIKNNIPAA GSVAFGTLPA
N4810      ----QRRKRD ESRQKCQTVA LFLYAEPRYA GTLKEDIQPA GSVSYGALSM
N3664      ----RKKRHE QGMGKLQTAA LFVLAEKPYA GTIRSYIPAA FPVGFGSLYR
N1846      ----RMDARH QGGAKFLPAS FFLFAELSYP GTVKKEIPAA GCVSFGTLTK
N4110      --QYRKRRKE DSNEVFLEDV GNLFINHSYA GTRKKEVQGE SAVSLGTLER
N3217      MDQKKRKRQN NGKEMFLASA TGFFHDVGYS GTLKKNIATA GKVSVGTLLR
N998       ----KNRRQK QKEETFLEPA LAYFVQKSYT GTLKPNIAVA GKVSLGALYD
N773       ----RRQRRH GPREKSLDAT LALGVNSSYA GTLTKEVNTV KSINLNTAYR
N1479      -QRKRAKRRS ENVKTFLKAT LEFSVEAAYA GTIKDDYGCA PGISLGTAYR
N485       ---DKSHKRE DSREKFLRDS LGLFVDAGYK GTLSKDKYSA YAVSGSVLVR
N3916      ----RSKRRH ESEQVNLEAK LGLYVDKAYK STPKKDKATS NSVGFAALYK
N1772      ----RKRKRQ ETTTKFLSAA FGAFLDSSYK GTLKKDKGNA AHVSVGTLYR
N3426      --RGRCRRRN ESRAHFLEAQ LFEAVDKTSL GTSEKDSMVA IQLTLGTLYM
N3880      MQRDRCRRRD ESREAYLGAQ LGQFVNKSYE GTTKSDALVA EGTTLGTMYS
N2806      ---DVSRRRD ESPDKFADGT LGSFAEKSYI GSLVKGLLVA GGVVLSALYL
N1164      MKHKVSRKRD DSRDKYLEAA IGVFAGNGYV GPLVKNLMVA CTIVLSTLYK
N4802      VKNKVSRRRE KGHHYFLGPP LGAYAEMSYS GILMRNLRAA QGAALPKLYW
N2656      MKKNVGKRHE NIPERFLIVA IGSFPEPTYA GTLMADLVAA CNMVLAKLYR
N1645      ----RKRKRS TSDEAFLHAG LGSFGAMSYA GTMMKNAKFA AKFAIGTLYL
N3165      ----RWQRRE DSKEKILAAT PGAFAGGLYA GTLVKKVVGA GALPGGTLYN
N1         ----RWQRRE NSKERFLDTA LGVFAGGSFA GTLVRNVKGA AGLLGGTMYN
N3411      ----RKKRRS SNNEQLLKSH VDLGLQEGYP GKLLRQILNA TGVPNDTLYK
N976       ----RNKRRD ESKEQFLPAV VGLFADRAYH GTFIREVFAA NVVPTKTLVN
N991       ----RARKRE ANASRVLAPE -GLFAKKGYA -ALMREVGEA DEIPKATLFF
N3274      --VKRKVRRA DHRAKILITG LTFSGVTGDE GTLSRESRAA RVVPKGGIYR
N4584      -KNKRRLVAE NVRARFLQGC LQSLGVIGFA STFVNAGKAA GNGKQETVAW
N1827      -KRQRPIRRE PPREGFLKGC LVLMEVVGFS GAEVQPGKAA GEGPLPYLYW
N3850      ----RHRRKE ERKPELNNGA LTKFGKTGFP GEFRMGKHEA RGVPKPNLYR
N1869      -TSTRKVAPD DSASRSLAAG LQAIGDKLFA GTMLKKSRIP GGLPVPDLYG
N4480      ----RMRLSQ STTTQILAAN LQRINAKGFA GNLMSCIKGA PAVGHRTYFE
N3834      ---SRHRMRE DDASRFLETA LAPLGEKVFA GKLMKSIAGA GFLHRGILYP
N1329      MRPLRRPRKL QNSETYMQAA LELLSEKGFA GTLMHGADAA SVVAKGTLYN
N2496      MRRKRRRARD ASDERFLDTL LEVFAEKGFA GTLMRAAEPA SGVARGTLYR
N4638      MKKKRQRVRD QSDSRFLQAV ANVFVEPPFA GSLIRAIEPA TGVKRAMLYR
N2235      ----RRRVKD QSNARFLSKI LKIFVETGFA GTLIRAIDPA SGRARGTLYT
N3926      ------ERVI DEPERFLAAV LHTFSELGYA GTLMTDFPNA DNHPKKTLYR
N2752      ----RPRKRI DSGSAFLDAV TPTFMGRGYT GTIMKHIHNA SGVPQGTLYP
N3163      ----RPRKRN ESSEVFLAAV IPKFMGKGYE GTLIKNIPNA ALVPKGTLYS
N65        ----RPRKKL DGEAVFLPGV VATFLATVYN GTLIKHIHDA FSVPKKTLFP
N3322      ----RPRKKI EGSAVFLPAV TATFVGRGYN GTLIKHIQNA LSVPKKTLYP
N2129      ------RFRG IDDERFLDHV LEIHNDCDFT GTIMGAIREA GGVPKDTLYK
N1490      --PKRKRPQV NHKESFLYAG LDGIAEVGYT STLMVAFKQA AGIPKGTLYR
N2443      --PKRTRRQK GHKQWSLNQG LDLFAKCGYT GTEMVNFADA RGIPKGTLYK
N64        -RPKRTRRQK GHKQWSLNQG LDLFAKCGYT GTEMVNFADA RGIPKGTLYK
N2929      --RKRIRRHI DSRDRSLTAI LSLFTEAGYA GTLKNDLQSA AGVPKGSLYR
N3691      ----RRRPKL YQEERFISPV LQLFQEYGEA GSLMDDIQAA QDVSSATLFR
N3845      ----RRRPKL YQHERFISPV LQLFQEYGEA GTTMKDLTAA QAVPSATLFR
N2199      -KRRRRRPKL YQGELTLSPV MQLFAQYGEA GNLMKDLSAA QAIPKGTLFR
N3143      -KRRRRRPKL YQGELTLSPV MQLFAQYGEA GNLMKDLSAA QAIPKGTLFR
N4466      MRRKQRKKRI SEMGCLLFAI LEQFQEKGKA STLLGEIASA GSIKKGALYA
N1295      --RKRQRRKI DKGARFLGAV LTGFEEKAYG GTIVKEIQVA GAVAKSSLYR
N2472      MRLKRARRRV KDMETALSAT LELFVDRGYV GDLKNDKPGA TGVPDAATYR
N3439      ----RSRRRA ETDNTFLSTV LKVFGEKSYA STLMQDNPAA QGGPVEMLYR
N254       -RLRRGTRRT ETSEVYLHSV MELFADKAYT GTLMKDTPTA HDVEVSMMFI
N780       ---MRERKRL PPPERFFHAV LDRFHSKGFS GTLMRTLQAD AKLPNPVQYR
N3710      ---MKKRKRE DQPERHFNSV LEIANSMEYA GSLVGDRFAD PKGPTANLFK
N2222      ---TAVQRRG DEKQSLFQTI VSLFKNNDYA GTLMSRIEAD SRVESATLLI
N1033      ---LRPKKRI EDDESMFYAV LEVFNIKGYP GTLMQDINGD ITETGGTLYK
N819       ---TRTRKRS DHQEPLFHAV IEFFNDKGYA GTLMGHIHGD SADPGGTLYR
N3296      ---TRLRKRR ENKEPLFHAV IEFFNDKGYA GTLMGHIHGD SKEPGGTLYR
N2386      ---TRLRKRR ENKEPLFHAV IEFFNDKGYA GTLMGHIHGD SKEPGGTLYR
N2859      ----RGKKRN EGNQRVASVV LDYFLKNGVC GTYLTQTKEA SPLEKDTLYT
N2870      ----RGRKRN EGNQRIASVV LDYFLKNGVC GTYLTQTKEA SPVEKDTLYT
N530       LRTKRKSRRI DAVTQFLVAV LELFNDKGYA GTSKKEIHAA AHVPKETLYQ
N2077      LRTKRKSRRI DAVTQFLVAV LELFNDKGYA GTSKKEIHVA AHVPKETLYQ
N4820      -RIDSCKERI EQRERFLVSN LELYAGKGYP GTLMLDVPGA AHVKKGTLYK
N4729      IRKKRSRKRS DHSTRIFVAN LNLYVEHGNS GVLMLNPSAA IVVPRPSLYR
N2037      MRRNRKRKRV EDRERFLSAV LDLYVEMDNA GVLMLDASEA ATVPRATLYR
N2762      -RVFRQRRKE EGLERKLSAI LRLFAPIGYS GTFLVAMNGA IGVLRPALYC
N1664      MRAFRSRRKN DQIDRLLNKS LGLFTATGYS GCYMLAMQAA IGMAQDKLYW
N2718      -RFFRYHRKA KEEQKILTEI RRLFANRPFG GCFKRGVGNA VNVTREKFYW
N3808      -RVFRQHRRD RAIEELLTAV IRLFAAMGFS GCFRRGMQAA VGVAKAKLYW
N4663      -RVSPQHRRD RAIEQLLVAI FRLFDAMGFS GCLKRGMKAA VGVAKAKLYW
N3041      -RVSPQHRRD RAIEQLLVAI FRLFDAMGFS GCLKRGLKAA VGVAKAKLYW
N2124      -RGFRTVRKD DAIEDILLII LHLFEEPTYR GCFMKGLDNA VSVAKPKLKW
N677       --AFRARRKD DSIEKVLNVI LKVFAPEGYS GCYMKGMPKA LGVAKPKLYW
N4726      ----RNKRKD DSSEHILAAI LKLYVAEGYS GCYMIGMDDA TAIDKAKLYW
N1598      --DFRAKRKD DASERILGAI LGLFAAAGYF GCYMIGMQDA KSVAKAKFYW
N2067      MRIFRERKKE DGVERCLDAI LELFAGSNCS GTYMRSFGAA SVVAKSTLYR
N3053      --VFRDRKLE GGVERYLPAI LDLFADAACS GLVMRSLGAS SIVAKPTLYR
N887       MRTGKKPRKK LGIERILPVI MGRFAERGYG GEFMRQMYPT AGVARPTLYR
N304       ---FRDKRKD DGAEHFLVVI VALYESAEYA GEQRKTKQAA VSVGQGTLFK
N3025      -RTIHNRRSD DGIHRFLIVI GNLFAAGNYS GEIVCARKAA DSVIKGTLFL
N2375      QRTIRGRRDH TVRSSFLAGI LNEFASSDIS GQQMKTKHAA KEVIKPTLFR
N2461      MRTFRPIRDE DGDERFLVSI LDLFAASDYS GQEMRTKELA SAVISPTLGP
N1347      ---------V HGQERLLIAV LELFAASDYA GEFMTGRRAA AGVAKSTLYK
N240       -RMFRNRRKK EGIERYLINI KTLFAAADYG GEFMRAIHAA MGAAKPTLYQ
N1677      ----RPRAKN EGIERVLCAI MDLFAADEYS GEFMQARPAA VGVAKATLYS
N2652      ---FRPRAKA GAGSRGLCAL MGLFAAMDYS GEFLRAMHEA QGVAKPTLYI
N4407      ----RPRGKN EGLKRGLCAI MHLFGAAIYS GEFMRAMYEA MGVVHLSLYS
N214       --PFRPRAKN DGLEHGLCAV MGLFGVADYA GEFMNALYEA MGVVKPSLYS
N2998      ----RPIAKN EGVERVLCAI MDLFSAADYS GEFMRAMCAA MEVAKPTLYS
N2277      -RTFRPRAKN EGVERVLCAI MDLFSAKDYS GEFMRRMYAA MEVAKLTLYS
N3785      MRTFRPRAKN EGVERVLCAI MELFSAADYS GEFMRRMCAA MEVAKLTLYS
N393       ----RDRAKA EGAERALCLV MDLFAAADYS GEFMKAMPAA MGVIKPTLYS
N3724      MRTFRARAKA EGPEKALCLV MDYFAAADYS GAFMQAMHAA MGVEKPTLYS
N3388      -RTFRERAKA KGAERSLCLV MQLFAAADYS GEFMRAMHAA MGVEKPTLYS
N2563      MRTFREKAKA EGAERALYLV MDLFAAADYS GEVMRAMHGA VGMEKPTLYK
N3472      MKTFRPRPKN EGAEKVLCAI KCAFAAADYS GEFMQAMHAA LGVIKPTPYS
N4951      MRKFRPRPKN EGGEKLLCAI KDNFAAADYS GEFMRAMHAA MGVVKPTLYS
N4588      ---ERRRRKD QGRDSMLDAV VEMFALAGFV ATFSRGVFAA GHVQGGTLYT
N2946      ---RRRRRKD HATDKPLASG TEPLNLEDFA AIFTRAILAA NLVPGCTLYR
N526       ---RRKRRKE RGADKALKAD VEVFGLEEFD ATLTSAIIAA KGVPGGSLYR
N3781      -RTPRRKPSE DQGQHLMGGI LEEYGALGYQ GTFMGAFWTA GIVPYGLLCW
N3501      ----RGRPSE PTMEHLLDDS LNSCEKQAFT GTFKSDFWTA AIVPKGTLYH
N3701      ---KHKKASQ ASSEVSLEGA YYPYFSKTFA GTFMKEFRTA KVVPHGPLNK
N4971      ---KVKRPGQ PSNEHLLNGS LSKYKNNGYA GTFMKAFHTA AVKEYSTLYK
N3863      -------RTA EHKERKLAGR FHLFAQVGHA GTFMPAFRTA AGVPKGTVYS
N3862      ----RRRRSD VQPPSSLDGA LPLFADQSYA -TFLKVFGAA SGVPPGNLYN
N567       -RTRRWRQKE EGGDKGLLAA LETFAEMVFP GTIVRLFDVA EAVPGETLYD
N1037      ----INRRKT ESLANVLTTS VQFFAAKGYT GNVMIVFHAA GGVPDGTLYT
N2248      --AGRRRRKD TGVSWTLGAG LEFLGGKAFA -TLLRIIPAA VLFPLGNSYR
N2411      ---------- -GLSWPVHAF LPYLADMKYA G-LLKIFCSA VVFPKGDSYR
N1131      ---------- -DICWAVRAG LPYLSNKAYN -TLLRIFAKA KAFPKGNSYR
N1672      --------RS AAGAMYLESE IDYQNDDGHE GYYTASLASA ITGPKATLYC
N289       --ESRHRRRD VSEAKVLESD IDYINDDGYE GYYSADLASA ITGPKAALYC
N2323      ----RHRSSL TSLDTMLGTS LAMVTQM--P GYIVREKPSA YSLPKTTLYH
N4274      ---------D TTNAKILGPV LELETEK--E GYYLMDVAAA FGLPKVPLFK
N1813      ------RKRA KTRQRELPAT ATTEYNAGYT GYLLKAITPA NGVPKFLVYM
N92        ---------- --PEVILPIA QPMITSRGFY GFSLNDAQPF KSVPHFTLFV
N2130      ---------- --TETILCAS LEVISERGFA GFFLPEGEPA IGVPKYTFFV
N893       ---------- --TDTALCSS LEAITERGFA GFFLPEGEPA IGVPKFTFFV
N1626      ------RKRT SAPKNLLKAG LQLFSIKAYA GTLLTPYSPA GPAPKGTVSK
N4805      ---RRGVVSD KGMEQILTAG LNMFSDAAYS GSFLKAYPCA CGIPSGTLYW
N1128      LRPRRVWCRE ESREPSLGAI LELTSGADNA GTFLREIPDA VGVPTNFQYK
N3764      ------QCFE ESVDRSLDAH VGLAGAKGHD GKHMSDTAAA MGVPKGVLNR
N4791      ------RCYE ESIQSLKEAY VVLTTATGYP GKFMKDIGGR PAIPKGVLNH
N3090      --PRQKRCRD ESIEPMLSTT LPITSSKAYP GTFMRAIPAA SGISTGCLYR
N3903      --PKQRKCRD ANIEPKLNTT LTLTSERAYS GTFKREIRAA SGISTACLYR
N4899      --PKQRRCRD ANIEPKLNTT LTLTSEKAYA GTFKREIPAA SGISTACLYR
N3048      --PKRNKCRD DTKGPSLAAV LYLSAAPKYK GTFMPEVPAA SGVPKGTLYK
N198       --PKSKKCRD QNQSRALEAA LQLTASNVYA GKFKAEVAAA GGSPKPTLYR
N2616      --------KE HMKESLLNEG LEMFAEKGFE GKMVWDPAAQ GGITKGSLYS
N1271      LRDTRSRRRS HPKERMLESG LGAFVTKAYA GKLVWPNLAA ASVAKDSIYK
N1668      ------RQKD HMRQRLLSAK LDHFSKKFYS GKLVLGVQCA GGVAKGSLPQ
N744       ------RQKA HMTEWLLEGG LDHLSDRAYG GKLVFDVISA DGSAKGGLKP
N21        ------RQKQ HMKEWLLDAG LDNFSKRLYA GNLVANVRSA GVVEAGRLKP
N2988      ------KRKG HIKERLLRTN LCQFAEKGNR GKVMWDNAVA RGVTNGSTYA
N1369      ---------- QKKERLLENT LNLFHERDFA GKMLWSVESA GGGSKGSLYS
N70        ---------- -LLNILLDST LEDFAQKDYA GKLMWDVTAA VGESKDALFS
N1932      ---SRHRRKN QADQRPLDTS VEAFARQGYS GKLMWIVLTA FKESKGPLYN
N4981      ---------- QLNQRPLETS VEAFARQDYA GKLMWNVLTA NGESKGPLYN
N548       ---------- -VKERLLEAS LKLTADAACP GRLMWKVLEA GGDNKNCLFT
N209       --------KQ HVKERLLEAE LNVFAEKDYV SKLMWDVVAA GGDSTGSLWT
N681       ---PRIKNKK ELSSNKLEAS LAHFSNRGYS GQLMWDVNAA YGLKKSSLYA
N2572      ---ARYRRKK NKGSKKLSAA LNTYGNRGYP DKLMWDVQAA YTCRKWTLYD
N1358      ---------- --KAKSLKTS LSPYSERGFT GKLMYDVDAA DAVSQGHLYN
N1167      ---------- QIKEKNLNTP LGLFVARGYL GKRMWDAEQA RTVAQSSLYL
N360       ---------- QMKEKPLDTP LGLFAARGYL GKFMWDAAAA RAVSHGSLYL
N157       ---NRRAKKD QGAERGLTGE LKTFGEAGYA GKLMWEITGA GAVIKGSHFL

           HIPSKKYTGL IIQELYLERL MAELADGLAD AAPDVLLDIR GLMLALDAPA
           DFPTTELALL VTLEVYQATD TSGAQDGLAA NARDILHVLV ELFLALAGFA
           DYPTWEMLIL VTLESYLEPV VSALYAGLAT DAPDILQR-L QLFLALLGFA
           EFPTQDMFLL MCLESYLIPT VLEADAE-AT EARDVLRRRL QLFLALLGFA
           KLPTQDMFLL FALESYLDPG TPELGQGLAT KAPDGLRKRL HLFLGLLSFS
           QFPDGDSWLL GALEAYIHTC PPELPQSLAT QAPETIFTRL QPYLGLADFG
           KFPTQDMFLL FALESYLHPS SPELGMGLAT PAPDILRKRL ALFLGLLSFS
           KFPTADMFLL NALESYLDPK RPELGQGLAT KAPDALRKKL QLFLGLLAFA
           KSPTADMFLQ FALESYLDPK RPELGQGLAT KAPDALRKRL QLYLGLLSFA
           DLPTRDMLLL VSLESYLESI TAGL-AGLAT KAVTLFKVVL VLFLSVTGFA
           DFPIRDMLLL VTLESYLESI VAGLYA-GAT KAPNLLQAVL ILFLNVVGFA
           DFPTREVLL- -TLEMFLERV VASVQSELAE NAPDVLSLQL DLFLALTGFA
           DFPTKELLLL E--EMYLDRI IAAVNRILAE NAADKLNGLV EIFLALPAFE
           DFPTTELLLF E--EMFLDRL LAEVGKILAE NAADKISGLI PIFLGLDAFE
           DFPTANLLL- -TLEMFLARI ISTRSRILAE NATDKLNGLV KIFLALSAFQ
           DFPTAELLL- -TLEMFLDRV ISAVSRILAE NAADKLSGLV EIFLALSAFQ
           DFPTKDLLL- -TLEIYLEKK VLAVDTGLAP NAPEELHGIV ELLLALAGFA
           DFPTLELLLL L--EMFLERI VSKVKSGLAD KAPDMLQGLT ELFLALVGFA
           DYPSNELIIG TLVEMYLEML GSDVNALLAG DSCDNLQNLV ELFMALGGSQ
           NFPSIALLII AVGEVYLERT LAELESTLAD HA-DQINTIT KLFLALSGFA
           AFPSKEMLKN SVLEMYLERF NAAFVSPLPE AAHEYLHRLV DLFLSLCGFG
           AAPQKELLRV GIIEMYLERF LAAAIGTLTD EASAFLQSVI ELCLALNGFG
           AGPSKEVLLL AIIDMYMERV FGNSIAGLTD SAQNFLQSVV EQCLALNGFG
           DSPSNELLEL PVREKYLDRI LAALDVG--- DTPDSLFPLV DLLLALIGFG
           DHPSSDLVLF TVFELYLEPL ENALIGDLID GAPPSTEWLA DLLNELIRFG
           DYPPLELACL HIFELYLEKS KTLLGSGLID LASTSLAWQA AALFALQTLS
           DYPSEEQLLL TIFELFLDKL HASLIVGLIN VAYDILHWLV SLLIALVGAG
           GYPSMELLAV PVLELYGERL ILAEEALLAS RAAEVL--PV TYYLALIGFS
           DFPSKHMLLV SVLELYLERL KAALDAGLAQ SGAKMLTY-I KLNLALQGFC
           DYPSATGAAM KVLEMYLQRM RATLRESLSD EALIGLQVLA DLLMCLAAFA
           DFPSENSLLH TPLELYLDRL IAAQ-PGLAQ NAPDVLQIDV TLVLELFAFG
           DFPRKEPLFC ----LYLDGL VSAYEAALAG CVTDPIGALV EYLLALGIFS
           DFPSEDALLS SVHELYLDRA SAAHESTLAS TQPEI-ESLV ELVLALAHFA
           DYPSKEALLS PVLELYVEKG TEAFDNKLAG NAPEVL--LV SMLLAHAEFS
           DFPIKEDLLL LVVELYLEKS ITALDTALAG DAAIM-HALV DLLLALSAYT
           DYPSKEPLIH TATDLFVERP AAPLVSELAG DSFKALSEIR TLIIALIAFA
           DYPSKESLIL GVVDLYLERP VAALVSQLAG DASAVLAGLR SLLLALEAFL
           DFPAKEALIL DVVEVYLQRL VAALLANMAG HPSDVLVAQR VLLLALEEFL
           DFPAKEALLL GVVELYTQRL VAALVADMAA NPNDGLPALR VLLLALDEFL
           DFPAKESLIL GVVELYTQRL VAALVANMAG NPNDGLVALR ALILALDEFL
           NIPSLEELLA VVVELYLERP AVLMCLELAK HALDVLEALS ALLLALVKFE
           NFPSNEALGV NVVERHGEEA IAATVSALAE ATPDCAENLR ILLFALSGFV
           DAPSTLVSGL AVLEVYLQRI IATLDIAMAD TVPPTLVGLR ALLVALYSFN
           DAPSTLVSGL AVLEVYLQRI IATLDIAMAD TVPSTLVGLR ALLVALYSFN
           DYPSKESMLL EIYELYLERI LAKYTDSLAD DAVD-LAVLR AAVLALTGFA
           DFPSQERAVA TVNELYQERD TAKLMEKLAE NST--LEVRR VLMLALVGFA
           DFPSQEQLLI GVFELYVERV VAKLKQILAD DAD--LDKLS ALMLAMSPFA
           DYPSKKALLA DVYELYVERR FAKLTDILAD EASNS--ILS ALVLAMTPFA
           NLPSRDALLA FILELYLQRG LAKLVEAL-- IAPEGLKALA ALLLALTAFA
           NFPSRDTMLL FVLELYMQRA IAMLSEALAD DA--GLIGLR PLLLALTGFD
           NFPSRDTLLL FVLELYMQRA VAMLDEPLAK DA--GLIGLR PLLLALTGFD
           NFPSRDTLLL FVLELYMQRA IAML--ALAK DAPNGLIGLR SLLLALTGFD
           DGPSPEILMA PYLEMYLERL IAVIQSSLAD EAP--LAKLT QLLLALGAFT
           DSPSPEILLA PYLEMYLERL IASIQSSLAD EAC--LAALT QLLLALGAFT
           HEPKKD---- TLFELFLERD LPSIHRSLAD HAGKGLVLIK SLLTSLMHFL
           HHPSKDTKLF PLLELYLEQT LASVGKALAD DAGELLTPYR ALLTASVSFA
           HFPSKDGELL SVLDVKLEPS LAALLAQLAG EAGELLATLR ALITALLDFA
           HSPSTDADLL PLLQVYLEKV SSSQKAGLAD DACDQLKTFR KLLMALMSFA
           HLPSKLILMP QLMYLYLEVT IFALETSLAD SAPSLLEKLH PLLAALQAFS
           HGAAKSQIKG TLLYLYLDKN IATLVTKLAD ASRKMLDGLE TKMSALLPAS
           HFPSKKSLLG KLLYLYVERF VAAFSTNLAE GSKPTLDGLR AIDTALAPSQ
           QFPSRTFLKL TLVELYLQKQ AAVADVALAD GAPDRLETLK LILTALTSFD
           DYPSKGFLLL ALLELYVERP YMTLQPVLAS HSPDLPISLK VLLVALLGFA
           HYPSQAEVMF SLLELFLEQG KAPLASMLAD EAP--LEGHR VLITALVTFG
           NFPSKFSLVL C-VLKLLDKA RDSLDNGLAD ISPNRLHSLT TLIKAMVGFS
           HFPSKAALMF TLLELYTDRG GAGLCSLLAD KAK--LAALR ILLSFLIAFS
           HIASKANILL CLLELFLERT LAVLQTALAE QAI--LATIE SLITVLGLFD
           HFARRAAILL CLTELFLERD LDVLQKALAE KAK--LFTVE SLASALSMFE
           HFPSLEVIFL TLLELYLESF IAAYMGILAA GAPDCLGAPA GLLEALILFA
           HFPSKEVLYL LLNALYLARL IAAYPTILAE NAPDQLGRLR QLLTALLALA
           HYPSAEVGIK DLLELYLEEQ IAVMTEALAA GTPHC--ALS ALFTALLAFA
           HYPHAEVLQV SLLELYLAKL LAAADA--AG GAAGRLGYVP PLLTDLLAFD
           HFPSAEAVVL ALLEVYLQRP TAALKHGLAA TAPDCLEILK AFLVALMPFR
           HHPTKEALLL SLLELYLEHL IAGYVHGLAD LAA--LDILR ALLTELLNLA
           HHVTKEALLL GLLELYLERL AAAYIHGLAD LTA--LKIMR ALLTELLSFP
           HNPSKDPFFL AALELILERL LSAL--ALAS AARESLDHLR VLLLGLRTFA
           HPHSSDSLLG YILELFTAKY ISKLNDNLAK DAAKLLAGVR ILILGRRSFG
           NFHSRDSMLL KVVELY--RL IAALFNALAD AAHDKLEDLK PLLLGRKVFS
           HYPTKESRVL SVLERYLERH ILSIKRILAQ SAPGLEVLNR SLTLAPIDFQ
           HLPAKDALRL SVLEKHVQRV LLKLSDVLAD EAPASKVDKQ AFILPPTKFA
           DTPAKNSFGQ VVLERYLQRH LLKILAELAD EAPAVKEPRR PFILAPLSFA
           HYASVSAIML KVFEKFMDAY TLDVGAGLAI ITCSLLQWVK AFVLSLIDFL
           NDPGPETLTL SVFEHHLERL VLETNELLAD SECEFLAKLR IYFLALFNFS
           HLPSREELPR LLTERCLQRL ILDVEYALAS ALGDNLRAFR SFLVALLDFA
           HLPSAEALVA KVLEHYVEKL IAKTAFILAD TLTESQKTLR GFLLALVGYK
           HFSGMDSDFL EVLEKYLERI FLELQHALAK VAPGPLGGDS GFLLAFFNFK
           HFSSHEAAVE IVIERYLEKL HLDLQAGLAD NAPDVLKALK KFLLALLDFA
           HFSSKELAVL AVVERFGEKE ELELDTGVAH TASAILEDLR DHNLALLGFT
           HFTSKELAVR AVIERFEETV PLEVGTGVAK SASDMLEKLR GHFLALLDFG
           HLPSAHGLLL LVLEFYLERL IATFIGALAL DAIDG-DELP ELLLSLLNFS
           NQPSADSLLL LVLELYLQWD LKTISAVLAL SAPECLERTP ELLLALLNYG
           HKPRVDSLAL AVMEPYMDWD LKTFSAVLAL SASDCLERSC VLLLALLNYG
           HPSSANSLFF IVLELYLEWY LRSFDDTLAL SAFDC-QNEP ELYLALINYG
           HMPSADSLFY LALELYLEWY LKTFTAILAL SARDC-QHGP ELLLALVNYG
           HFPTKEEVLL PGTDFTLERL PASFEAELAV DAKNSLQTLR AMFLVLLDFD
           HEPGIEHLVP LVLETNLEWL CAHFQGKLAL LAVEEADGQR ELLLALLEFS
           HGPNKENLSQ LHLELALEWL CAHFHGFLAL DSANALTSLR EGLLALSDFS
           HGPNKENLSQ LHLELALEWL CAHFHGFLAL DSANALTSLR EGLLALSDFS
           HYPTKESLFL VVLEFYLESL IVDVAAALDL DAPDPLNSLR ALLLALLDYE
           NFPSQDENEL HVIAFYLQPS IAANAAALAL QAPEPLIFLR ALLAALMAFN
           NFPSQDENEM HVIAFYLQPS VAASAAALAL KAPEPLIFLR DLLAALMAFN
           NFPSQEAQQL HVLEFYLEPD ISPPAAALAP EAPDPLNTLR ALILALMEFN
           NFPSQEAQQL HVLEFYLEPD ISPPAAALAP EAPDPLNTLR ALILALMEFN
           YFPSKQVLFK ANLEFNLEHL TTVFERFLAV SATKTL--MR GLQLSLLNFS
           HFPSQEVLLL IVLEAYLERK NAAFTEA-PV DAPVPLAPLA ELVLDLLGFS
           TFPCNEYLGD GLMALFLEKT FAELGGDLAL HAKDSLDSIV AKKLA-IPAA
           HIPSAEEPLL AVLEKKLERI VAIIAAALAL RAPDLLGTFV PLLEALVEFA
           HRPSQETMLL TVLEKKLERL LAAIDAALAI RAPDM-DALI AALEALLDFA
           HCPANKELKS KVLERYLEVT IALFGS-LSQ EVLPGLAPII SLLLALLDIA
           HYPSKDALNF VLLEQYLEVK PLAIVAALAQ DALD-LEVVR ALLLALMDFP
           DAPSPEELFF GILERYLEIQ KLALKQALAD GAKANLESIM ALMLVLLQLP
           HYPSEEAMMF QVLERYLENK LLALGAGLAQ AAVYQLIHVR AILLAVFPFA
           HYPSEMAMMF VVLERYLEQQ LLALQSGLTQ SADGLLIPVR PILLAVMDFA
           HYPSEQAMMF VVLERYLEQQ LLALQSGLTQ SADGLLIPVR PILLAVMDYA
           HYPSEQAMMF VVLERYLEQK LLALQSGLTQ SADGLLIPVR PILLAVMDYA
           HFPWSQSLLP EVLKRYPERP LESLSVGLAH SALPGLASIK ALLLALLDIL
           HFPWSQSLLP EVLKRYPERP LESLSVGLAH SALEGLDSIK ALLLALLDIL
           HWPWKEDLLP DVLELYLERQ IQGLAA-LAH EASESSAPLH VLLLAVFGFF
           HWPWKEDLLP DVLELYLERQ IQGLAA-LAH EASESSAPLH VLLLAVFGFF
           HDPPGEVLTR EVLERYLEKY LGAGTFAYAH AATNVNEAAR TLTFALIELA
           HKPSKESTLL SVHDQYLEKY VGSLLASLAH AEDILIQAVN PIMYALLEHA
           HLPSKDPLLH GVLERYLEKL VSTLLAALAH TAPLVIQCLR ALMFGLLEFQ
           HAPKNEQHIL TVLDAFLERH AATETGGLGD NAPDSLEKLT ALVLALEKFK
           HRPKDQQYFA TALDAYLERE MATFAGGLGA YAPDALENLS ELVVALLSVK
           HGPKLHRYVV VVLDSFLERG VGTFHGGLGD HATHPLGSLR AFTLAMLEFK
           HPPKSERFTM LALDAFLERT VTNFKGGLGD HAGDELETLR ALVLAMLEFK
           HSPKSERYSM EALDAFLERT VTNFTGGLGD PAPKSLETLS ALVLAMLEFK
           HSPKSERYSM VALDAFLERT VTNFTGGLGD HAPDSLETLS ALVLAMLEFK
           HGPKSDTYFA VVLDALLHRT VATFRASLGN NADDSLATLR LLLLA--EFK
           HKPKAAAYFV DVLDAFLERT VATFNGGLGD NAKNVLQKLQ ALILAMAEFK
           HQPKSEAYIL KVLAAFLPRT QATFGGGLGK NANCFLEYLR TLLYAMLEFK
           HEPKSEAYYV AILDAFLERT QATFGGGLGD YAKDNLETKR TLVIAMLEFK
           HSPKKDGTFL IVIDAFTESE LAKVQAGLDD PATDTLEAFR HLALALLEFN
           HSPKKEGHYL LVIDAFLERE LTTLHAALGT AAAQGLKTFR PLALALLDFK
           KTPKREGYFV QILDTFPEST LASISESLGN EGAGRVQTLR NLVLELLEFR
           HRPKLEAYSF DVMDPFLKVY RATTLSGLGE SPHSDLQGLG ALILGLLEFK
           HSPKRDGFFK AVLDTFLERM LATTVSGLG- SSHSDLHSLG SIVLALLQFK
           HAPRLDAYFI TVLDHFLEAG MATKGGGLGD ATRANL--LG PLSLALLSHK
           HGPLKEVYFL DVLDAFLEAG LATKLGVLGD TSNG--AKLG TYCLALMEYK
           HTPKFAGFFL QVADAFLERT AASFIGGLGD SSSNGLQSAG VLVLALLQFS
           HTPKKEGYFL GILDLSLERS LATEEGGLGS KSKTN--VGG ALVLALLEFK
           HTPKKESYSL SVLDAFLERT LASSTGGLGE KSRTS--TLA ALVLALKQFN
           HAPKSDGYSL SILDAFLELR LASGSGGLGD AEKSP--PLA ALVLRLQQFC
           HTLKVETYSM SNLDAFLERL VASGAGGLGE AYRAA--SLT EKVLALKQFK
           HTPKIEGYSL NVLDAFLERL LASSAGGLGE ASRAY--ALG DLVLALKQFK
           HAPKKEGYGL GVLDAFLERS PASSTGGLGE ASRSG--TLA ALVLALKQFK
           HTPKKEGYSL SVLDVFLERG PASSTGGLGE ASRSG--TLA SLVLALKQFK
           HTPKKEGYSL SVLDAFLERS PASSTGGLGE ASRSG--TLA SLVLALKQFK
           HTPKKEGYMQ SVLDAFLERK LASATGGLGE ASRSN--TLA GLVLALKEFK
           HTPKKEGFSQ AVLDAFLEKK TASVNPGLGE ASRSN--TLA ALVLALKQFK
           HKPKKEGFCQ SVLDAFLEKK LASVNPGLGE --RSNLVKLA ALVLALKQFK
           HKPKKGGFSQ SVLDAFLEKK LEAVSPGLGE A--SNLVTLA ALVLALKQFK
           HSPKKEGYTQ KVLDEFLERS LAATSGGLGE AGRSY--TLA ALVLALKQFL
           HTPKKEGYSQ AVLDEFLERS LESSSGGLGE AGRSA--TLA ALVLALKQFK
           NFTTGNAILQ MVLEAFLEPP TADSQATLAT NGQDILQPMR SLLLAMLEFQ
           LYPATSSIMT AVLERFLEPG GAMTQA--AG GGPEILDPLS TMLLAAMDFA
           LFPSLTSIAS LVLERFLEPN DNTSQTALLD D--DMLDPLR DLLLAMMEFS
           HLPSKEQKEA DLPERYAERA TS-DNAALAD AANDTLSEFK TAPLGLFEFR
           HLPSKEKDLA PALDKYLERI VSRAKNPLSP ASV--IATFR AVLLALLDFE
           HYPSSDTKLA LFLERYLDRD TSSTTP--AG HAGYKKATVV NVDLALLELD
           HYPGKELRLS ALVEKFLDRP KSSR---LAD EACYTLADYR GVKLALLAFE
           HFPSKDVSLL LPLDRYLERE LSGPQPILAE TGVHTIQIVR TLPLALVEFA
           HFPNKDTKRG RLLERLLERA EAGLNPPLAN DAPQLLSDMK KLLLGLLDFA
           HVPGPEAIHS IVLENYLAKS LGSLPPELER CKTNKLEAVR SLLLSLLEVD
           HFPSKESLLA GVLEFYLERL VASLGPALSE DKPGTLDPMQ TLFSSLLDFG
           HYVSNEEALE KALERYLEKV LATVDPTLAS DNADPLEELR DLLLSLLGYA
           HYHSVEKTLN DALERYLERA LITIAPSLAG DNPTTMADLR ALVMSLLEYA
           HYHSAEQTLN KALERYLERG LNTIPPNLAG SKDGTLEELR ALLMSLLNYA
           HHPSLDALLL FVLEKYFHRH VAGFAPELTA KAGDLLQGLS LQMLALLHHF
           HQPSVDSLLL FVLEKYFHRK VLGFTPDLTA KAGEDLQGLS LQMLAPLQHF
           HSCALDSLTV TVLDKYLHRQ -ANLPADLAD DAADNLVILR LQMMALLQFA
           HAPGEDRLLV GVFEKYMKRP LGAIGPDLAE D-IDALNEVR SQMLALLDFA
           GLPTKEALLF GALEKYMERR LALLDSRLAE HAPEI-VGEP AQLLALLEFS
           NVPSGEALHL DTLDRYMERC MSNLGAGIAD AAGNLLQGTR TPLVALMEFN
           LLPSDESIIF EVLERYMERH LAALEAPIAD GGGSLLHGAR TQLLALDEYD
           HLPSDEGIIF EVLERYMERQ LAALEAVIAD GGGSLLHGAR TPLLALVEFD
           HYPTNQILML GVLEHKLERA KG--LGPLAD KAPDLLLDLR SLLLALNEFD
           HYPAREALLL AVLERYLERG LATLAATLAD DAADVLVGVR VLLLNLLIYL
           RIPTNEGMLA IALERYLDRE LASLASALAE EAN------- VLLLALLGFT
           RFPCPESTLL AKLEAALETT LAGLMAE-AG SATDRLNGIR PLLLALIGFS
           RFPATEALLM PTLEAFLERG QASLTV-LAA EASDKLEGLR AVSQALLGFP
           RFPNKEAFLL AALERYLERN LGGLTAE-AG NEPDILEGLR ELLPALLGFY
           KFPNKEAFLL KALERYLERR LGGL-AELAS DEPDILQGLR ELLLALLGFK
           RFPNKEAFLL KALERYLERR LGGL-AELAS DEPDILQGLR ELLLALLGFK
           RIPSKEGLSL PVLEKYFERD LPALMKTLAK DYPNAVEGLK HLLLSLTGFG
           RLPSKEGLVL RALERDLDRI VAGILEALEG EAQNKLSAVS LLRLALMGFA
           QLPSSDQLLL SVLED--EKL AAVLITALGD QGPDSLPFLE ILLLHFVEFA
           QTPSKEQLLL IVLEE----- ----RAGLGD AAPLSLARLR HLNLAFPKFN
           QLPSNDHNLL AMLEEYLHRD LAELKAQMG- ---------- --TLASYPFA
           QMPTGEHRLL EMLENYMERI IEELLSAQGG N--------R VQLLAYSEIH
           SLPATQHLLL NALEKYLERL LASLDAELGG P--------- MLLLAFGSLN
           LAPSKDPLLL AVLDGYLERL VPA------D TAPGKGGLLR VLLLALADF-
           QFPSSDPLLL VVL------- --ALTAVIDE SAPGSGAVLK ELLVDRSQFA
           QFPNKDPLVL LVLE------ PNALTAPLGD SAPTYGGVLQ VLILALTEFS
           QIPRNEPLVD KVLERYLEQ- -----AGLGE DARGFGIGLE VQALALSNFA
           QIPRNDPLVA KVMERYLEQ- -----AGLGD DARGFGAGLE VQILALDNFP
           QFPSSDRLIL PSFEQYLERP ITALL----- ----AGPIMR VLDLALIQFG
           QFPSNDPVLL LSLER----- ----VSALGD LAPGAGSMLR SLPLALTHFS
           HYSGKAPPSL SVLENYLDRL VAKAKASLDG TAVGLAE--- ----SLIQFT
           HFSGKSLPGL EVLENYADRL VTDVGAGLGG TAVGCAPVAR PLRLSLLQFT
           HFPSRALLAL TTLEHYLERQ LA--VACLGD DSTATLSHPR SLISALYDFH
           HAPAKDPLSL AALQKALERF FAKLNACLGG NAPGA-ALPR ALLSALADFY
           HLPSSDPLQL AALQKYLER- -AGLKACLGN NAPGAEAVPR ALLSALHDFY
           HCPPDEPLLH AVLENYLERQ LAVELAGLGD AAAGLLLFTK MFLLA-----

           REKPIIL-LH LAASAGDALR DKGQALRREL LPRLSGLGYA GLASGALTGD
           AQDPLHLLLP MAAALTSSLR GRLRELRREL LAKGAAKVYT GLGAADATGD
           MNHPGALLKS LAATLESELC GKLKALTREV LEKLGASVFE GLPEPTLTGD
           LNHPTQLLKM LATTLHKALR GKIKDLQREV FARLTASAPA GLAAQFLTGD
           LDHPVHLLKS LATT-HKAVR GKVKDLQRDR FARLNASAPS GIAHPALTGD
           LAHPGQLLKI EATKLQRAVR GKFKELQKDA PAQLTANGIT VVGQPNLTGD
           LEHPIQLLKS LATT-HKAVR GPFKDLQKDV PAHLTATAPS GIAHPALTGD
           LSHPNRLLKS LATT-HKLVR GKLKDQEREI FARLTASAPP AIAHPALTGD
           LEHPTPLLQS LATTLHK-VR GKLKNLQREV FARLTASAAS GIAHPALTGD
           LSHPGELFLS MAAVLQTEIR GKLKNLTREL LQKLSASLTA GLAVPELTGD
           LLHPGALLLT MAAVLHNELI GKLKEFSREL LERLAASVIT GLAVPELTGD
           LENPQVMELS EPATLAPDKS GMLEGLSREL LERLNGTMLI GLLASELTGD
           NVSAETIQLS MPATLTGHLP GKLHGLSRKL LERLNASRLL ELDSPGLVGD
           YLSPDPVQLS MPATLTGHLP GKLHGLSRKL LYSLMATRLL ELPSPGLVGD
           YESPGQVQLA MPAKLSGHLP ASLHGLRRKI LERLLVTHLH ELASPGINGD
           YESPGEVQLA EPATLTGHLP GTLHGMSRKL LERLLATKRY ELSSPGLYGD
           FENPDAKQLA VSAALAAQTA GKLEELTRKI LDRLAATILE SLAASELGSD
           LSNPNTMALT LPATFAAALS GKLESLNRIL LERLSATLLI KLAASEMSGD
           LDNACDLSLG MAPTLGAGLQ DNVDELGREL LEQLPARRMA PLPAPEAEQD
           RYNPSGLMLK LAPVLSSGLN EKLEDVARDI LERLSATMVG GLNAGAMSGD
           LNHTLTLLLV ADATHAAKLR SSLQHLAREV LEKLDTKSGA GLAASALSGT
           LAKTLTLLLT MAATCTSELT GLIQELARAK LERLAATF-- GLEAGELTGD
           LENTFTLLLS MAATCTSELR GLQQDLARAA LENLSATVDP GLNAGTLTGE
           LHNTATVLIT LAAALAPEAS GRVRNLSRDL LKKLGAGTLN GLGCGEMRGD
           LTSPVSLMLT MADTVLPELR GKLRSLTREL LEQRLAYSAA GNGAGGVNGD
           LADPKELLLI PETAASSELR SDERDFLRDP LHVLCAESPH VLGAGEMDGD
           LTNPKHPLLT VASSLVPELR GKLRDLGREL LKRLHAEQPA GLGAGDMHGD
           GENPAGLVAV RAAGLGTDLR AKFRELTRDL LAQAAQYGTA GLRDDNEAGD
           LNNGKALFDV LAATRGADLR GKFRELSRDF IEIVTGAKLS GLTPGELTGD
           LGNAFALVLK LSADFTGSLR GKLRELRRAL SELLSAGMLG GVSSNQIAGN
           LANPSRLGLA FTADFTAELR GKLKELPRAV LERLKAEPLA FLSSGQLTGD
           GENQQRLLLV LAAELHAKLR GKRRALRRQL LKRLSSEEMG GLGAGAFTGK
           KDNPNYLLLT AAATLPGDLR GKIRELRVSL LSDLHGEMLG LLGAAELNGD
           IGNAAALLLT LAAAISQQLK GQLRELRREM LQHLAAESQN ALSALESLGG
           RGNPADKPLT LAATLSATIR GKLRELRREL LQGIAPDALT GLASGDLSGE
           RASCRLLPLA FAIIIVAKTH ARLSDLAREL LDKLIVLQAA DIKSKGLDGD
           RETPKLLLLA FAGIIVNQSH ARLSDLAREL LDILTVLLTA EVRSKGLGGD
           RKAAKSLLLA LAGEIVKLLH ERLRDLSCQL LNKLTGLLSS EVRAKGLGGD
           KKAPKSLLLS LAGEIVKQFH ERLRQLSRQL LDKLTGLLPS EVQARGLGGD
           KKAPKSLLKA LAGEIVKQLH ERLRQLSRQL LNKLTGLLSS EVQARGLSGD
           SQNQSELSLS PGGALTNNLR IKYRELKRDL LDRLKALLIS GLTNKELVGD
           KKHPNQSVLV VGGAEAENLR TTLTSLDRDL LDSAKKLFCT GLAEAGANGD
           QAHPRSLLLV LGGSLGWHLR SRLRGLRQDL LDEAKVLLLA GLVAGLLTGV
           QAHPRSLLLV LGGSLGWHLR SRLRGLRQDL LDEAKVLLIA GLVAGLLTGV
           KIIDRSLLLK VASHSRQQLR EKIRNLANDA LEFLSLLLSG GLDAAELNGE
           RIGLRAFLQS HASTESDELR DLKRDLRRET LNRLDAQKVS KLGTGELTFE
           RTNAKSYLLI EASVEVDELR DKKRDLQKEC LDKLDALLLE GLARAHLKGD
           RNNARRLLLT QASYETEELR ELKRELRKEA LNTLDALQLS ELASADYSGN
           RSNVSSLTLQ LEAAVNEGTR EKKKEFRREL LDKLNALQLS GLAAGEFTGE
           RISVIALMLM LAASISDELR EKKKQLRREL LEQLNALLLG GLAVGEYSGE
           RLSIIALTLM LATSVSDDLR EKKKELKRDL LEQLNALLLG GLSVGEYSGE
           RLSVMALTLM LAASVSDELR ESKKELRREL LPQLNALLLG GLSVGEYSGE
           RASLDKVLLT LAAALEDELR EKFRDLKRGL LDRLTAPCLS GLAASNLVGD
           RASLNKVLLT LAAPLEDELR EKFRDLKRGL LDKLTAPCLS GLTSSNLTGD
           VETCLTSFLG NAASYGAGLK EMVTHLRRSL MEKLAAMCTK DLGTDELTGD
           VYTPFNLLLT IAAP--DSLR EKNRTMRREL LSRLAALSNP ALAAGELS--
           TSTPISVLAT LPAALTVGSR SGDREQRREL IENLNALLLR ALDAQQLTGQ
           TETPKSLFGA VPVTQTSVVK AKTRDLRREK LEKPDALIIP GLAAGELNGD
           AGTPFKLLFG LTDAL-ESLK EKVCDSRKEL LDRMSALVLP ALTVKEERAD
           AQSLDTLALS LAPNQKELGR TEERSLHRDV LEMLLGVSIE GLTGGNKPGD
           RLSPQALALT LAPALKASVR QKVRQLDR-L SDSLLALKII ALAGVDLPGE
           WGTPKPLFLT LAPGLKESVK EPNRHVKREM LFQLEAALLP IVGAGDIKGD
           RGALAPLLMA VAPILKDALR EKLREVRREA LEHLDADLLC GLSLAELAGD
           RTTPGGLLLP LAYSMKEALR EKHRGLKREV LEKLQAILDN ALAADDLTGD
           EGTSTELKLS FAATLKADLE EKIRDLRQEA LEKLEGLLTP --AAGEEAGD
           RNTPRALVLV ISASLKATLR EKLGDLPRKT LEMLEAVSLP G-------GD
           RGKQNSLLLT LAAHVRQHLR EKNQDLKREL LHQLETLLVK GVAASDLSSD
           RGTSATLLLI LPGHLKEDLG ETHQTLRREL LHHLESLPIM GVAAAEMPSD
           KIAPKTIPLT YAASYSIGLR ERARQLQDNL VERLGGLLDP AFSSGELSGD
           EKAQTSLPLE QASTLDSGLR NRAIDLRREL VERLGALLLQ EFTAGDLTGD
           KTTPSLLQLC YAATVAQDLR VKLNDVRREN LEVLGALFFG KLATGELAGD
           KNTAKSLLLT LAANLNEELR EKVRKLRREL LRNLGALVMP VLFTSDLTGD
           TTTPKCLLFT MAAKLAGNLR EKAHEVRREL VARLGALLAS GIAASDLEAD
           RCTPLVLLLS LALVLTAQVI QKKRQLRRQG LNCLGPQMQP SLAAGENGGD
           RSTSESLPLS LALVLAAQVR EKTRELRRNQ LTRLGALMMP SLAAGEDGGD
           HENTATLPLV VAAPLASEME ERDKALRRNI LEQLVAVLHA ELGVGSKPGD
           RGHPRPLHLR LAFCLAHQLR ENVTTIHRAA LKKMSPQIAA GLTGAEVKGD
           HNNPQDLLLL LAAALGEELR EKLKALVREV LDHIG----- ----------
           AAQPYSVLLV LAAATGESLK GKIVGLCRAL LEKLRALLLP FLSSDALIGD
           PFQPKVLLLD TVAGLDESRS ERMGGERRDF LDMLDAHVCT DLGDGAVKGA
           IFQPQGVLLT LVAGLYNPVK DKSKTLKRDY LDVLNLNVYS GLSDGGFEGD
           SEDPMTVTMS LIAEIVDALK EKLRELAREA FDKEAILPYG ALAGGGLEGN
           KATPSSVLHV LAASLFGELK DKAKALEREL LDKKLSFLFP GLADGVLFGN
           RAKSQIAVLP LAAYLVEAIR EKMKALRREL QENPGSLYPP GLAASGQGGD
           KSKPAFAILV LAADSPEDLR ETLPASRREL EAPPSALVKP GLRAGGLFGD
           ENNPRTLKLI LAAALAEELI VKLRRLYHEG VEKMELLYHN DLTAAGGMGD
           RQNPHVLILV LALANANELS DKLQLLRHSS -EKLTELDLA GISAGGVVGD
           RENPVPFVLM YAAPASQELT DKVRFLRHSE IQKLSAMVVG GLTAMNVSGD
           KANPKAFLLE AAPPTAKDMT DKVQFVTQSE IQKLSSINYT GLKAVGVSGD
           TKDPKPFFFI DAEGIGGALR NKLLSLKREI LEKLGGLCIA GPRAEAPTGD
           TADERGFLFI NAEGIGGHLK QPLVAMSRET LENLVGLLVP EKRAVAMTGD
           TDDEKGFLFI TADGIEGHLC QPLVDMKREA LEGLVGLLTA NTRATAMAGE
           VEGDKVFHFN AAEGISSHLK QCGMTMDHES LEHLVGLLNS GQRAVGMTGD
           TEGDKGFLFI NAEGIGGHLR QPLMTMNRES LENLIGLLYS AQRAVGKTGD
           SENPHVLLVV LAASLADPNR EKFTEVKIEL VYY------- ----------
           SEKPFSLLPI SDESFAHGLP NKLRTASRDM LHFRDQLFDV GLRAGNLSGE
           TEKPFSLLLI LAEGISEELR EKERAVQA-L LYFLALLKEA GLHAGNLSGE
           TEKPFSLLLI LAEGISEELR EKDRAVQA-L LYFLALLKEA GLHAGNLSGE
           SENPEQLLLL MVHALGDHLR EKLQKVVREA LEQLEGILPD NLKLGTLNGE
           IANPHQFVLV LPKAPGHQLR EKLRGIKRDM FDALAPLAST TLRAAELQGE
           IANPHQLVLV VPEAPGHQLR AKLRGIKRDM FDALAGLADT TLRAAELQGE
           IANPKTLFLI LPEALGQELK DKVRGVKRDL FDALAGLGEY SLQAADLQGE
           IANPRTLFLI LPEALGQELK DKVRGVKRDL FDALAGLGEY SLQAADLQGE
           GENPAHLLMV QSASKAGELR EKMAHVDREL ISTLEGLPES GAHASNLVGS
           TQSPSSLFLI LANGLGGPL- -QLRSVRRSL LDELVALTTG GPRNDDLEGD
           TWAGRPLILL PDDNLGEHLK NKARIMRSEL VQQRAGTIDA EVDGGAVEGD
           EQSPLSLLLV LQEGQWEQFK DKVRRMARET LEDLSGIMNA ALDGGEKGGD
           TSNPMKLLLV MEEKLGEELN EKLSQMARDM LEHLEGLSPA GLNGGEKSGD
           TPKVRTLILG FADTIKSTLS EKGLALLREL PENLTGLVKA VLSSGDLTGA
           AEDARVLLHI RSEPIDDNLP KKLLAMKRVG LDKLAGLIGG GLAAGDLSGE
           AENPGKLKLV LAEASHDRLA NNIELMKTEN LENLANLSTA GLAAGDLTGK
           TDNPRAFLLV LAEELENILM NKLLLMRKEI LDNLAGYFRA GLAAGEVAGD
           SENPCALIVV LAEAIEKILA NKSLLMRKSL LDDLAGISPA GLAAGELVGD
           SDNPCALVLV LAEAIEKVLA NISVLMKKEL LDDLAGISSA GLAAGELVGD
           SDNPCKLVLV LAEAIEKVLA NISVLMRKEL LDDLAGISSA GLAAGELVGD
           TQNSLALMLI LAECCSAELE EKCPANKKEL LEDLTHVGGN ALTTKELAGG
           TQNSLALMLI LAECCSAELD EKCPANKKEL LEDLTHVGGN ALTTKELAGG
           THNKVALLLI HAESTGNALK ERLFAMRRDL LQHLNGLFVD ALAAGDLCGD
           THNKVTLLLI HAESTGNALK ERLFAMRRDL LEHLNGLFVD ALAAGDLCGD
           KDAPESLLLM LASSLGENLV QKLRVIDWEE LENLGGLHAA ALAGCDLCTD
           TNAVVLLAVV LTGTLGHDLR EKPPA-RREV LGSLGGLLME VLGAGTLRGE
           TDVPAALLLT LAGTLGDELR EMTQSMKSQA LDNLIGLMMN SLAAGDLDGD
           RQNPLVLVLQ INVAEGDKIG ESLRALKAEL LERLAVLEEG NLAGQEVPGN
           RHNPGTAILQ INVTDNTKID EQLHTMKANI LEKLMAFLDA KLGGAAKTGN
           RLTPESVSTT TNVADYSKIS EKGSTLTTDL LNRMDVLSGG GVDAEAKVGN
           HQNPDGVILT TNVTDYEKIA EKLHSLRADL LEKLDVLSEG GVGAAEKTGN
           HQNPEGIILT TNVANYERIS EKLHSLRADL LEKLDVLSEG TVGAAESTGN
           NQNPEGIILT TNVTDYERIS EKLHSLRADL LEKLDVLSEG TVGAAESTGN
           QKHPQSVILT INVKDGDKIV DHVHTAIAEL LEYLSVLGQN GFGAAEKNGN
           RHHHTAVTLM KNV-DGDKIN DTERILPADL LERLAVLAHS GLGAAEKGGN
           KNHPGAVTLE -NVTDSDKIS KKMHSIWADL LEKLTVLEDG GLEAVENAGN
           RQDPPSVILH VTVSDGDKIT NKQHSVRADF LANLAVLDSG AGAATELTGN
           KNDALVLLLT VNVTESEKIT QDVHALPAQL LERLAVLKNG ALAAGEKSGN
           RQDSEALILN VNVTETHKIT DDPHKLAAQE LERLAVLLQA VLPYASSTGN
           PEDPCDLQLT ANVSEVDRVT EPVPSLRSEL LPALAVLANT GLELGEKKGN
           RGNAGPVVLS YNNAEVRKVG EPVVVLKEGD LQKLMVPVHA GLPSGQKSGS
           KENPAKLIFT FNVTGVAEVT EPLHSVRAGK LQRLEVEVDG GLQNGQKPGS
           GEDSHKLLLS GNVDQVDKID EAIRALNGGD IQPLKVLGEG TLQASQKTGA
           SENPNKLVLT KNVEEVDGIP EPMRTLDPNE LESLEVLGEG GLQAGQNAGA
           REDPSILGLS LQVNNVEKAS QLAEDLKAGD FVRLAVAPAG SLEVGDKRGT
           RDNPGNLVLY FNVKEADKVT EPLFELRAGE LEKMGVLENG GLRAGQKAGV
           RDDPPGLVVT FGVAEVDKIA EPLHAMKGNR LHRLAVLDDG DLQAAEKPGN
           RDNPRALVLS FDVADGNKVA EPLYSLRPGR LHKLGLLGQG DLQASERGGN
           RDNPSSLVLT FNVSQVDKVG EPLYTLRAGR LRRLAVLEEE ELHAPVGGGN
           KDNPKALVLI FNVAQVDKVG EPLYTLRAGR LHRLAVLDDG DLQAAEGGGN
           HPNPSALMLT FNVAEVDKGT EPFHALRAGR LHGLAVLDDG DLQAAEKPGN
           HPNPSALMLT FNVAEVDKGT EPFHALKAGR LHGLAVLDDG DLQAAEKPGN
           HPNPSALLLT FNVAEVDKGT EPFHALRAGK LHGLAVLDDG DLQAAEKPGN
           RGNPSALVLT FNVAEVDKVA APLHALRTGR LHRLNVVSDG DLQAAEKPGN
           RGNPSALVLT FNVVETDTVA EPANALRAGR LHRLNVLDEG DLQATEKPGN
           RGNPGALVLT FNVAEVDSVA DPAHALRAGR LHRLGVLDNG DLQASEKPGH
           RGNPSALVLT FNVAEVDSVA DPAFVLRAGR LHRLNVLEDG DLQASEKPGH
           RENPAALVLN LNVAQVDAVA EPLHALRAGR LHRLSVLGNG DLQAAEKPGN
           KENPAALVLN QNVAEVDRVA EPLHGLRAGP LHRLSVLGNG DLQAAEKPGN
           KSASWGLIQV FTVAFGYKLK EKIHSLRGGL LEKLSVLGDN GLSSAALPGQ
           RAAPYLLLLV MSSVLASDLI EKFHALWPGL LEELLVLNPG GLTSNGLRAD
           RPAPWLLLLV MSSVLTPTLR ERKHALASGL LQELPALALG GLTTPSLGAN
           QFSSEVLVLI LSTLLDAALN EQVKAMVSQE LESKCALLAQ EIIVTSLRGD
           RCTSKKLLLL PATKLSDDLS EALRAIKTEH LEVL---ATT GLIVAALAGT
           RADGRLILVI LPSALEAELA EEARALASNK LDTVAFLAER KLRLSSLTGD
           RMSSLLLASV LSKALSDDLS EDAHALTSDC LETLQALAPQ ELEMSELTGD
           RTPACLLLLL LTAALAAQVS EHMRALSSTV LECTGAVLVG SLVVGSDNGD
           KQSTRLLILI LAASLQQELQ EKMRALTKEL FSFLSTTLEA GLAAGELTGD
           ILNPDVLLTI LAATLMPDLV QPLVGLTSEV --PLEALSPA GLVVEKLKNE
           RDNHRMSFIL LAGSLLQALY EWLKALTADS VQ--RALLFA DLLVTKLTGD
           KENPKNLLAI LAAVLKEQMG HPFRALTSPV VEILPVLLPP GVVVAEVGGE
           KKNPANMLAL LASSLIEEMS HQYRAPVSEI VEDLPDLLLT PLVVADFSGQ
           KENPKKLLVI LAASLVEEAG HPFRSLISEI VECLSDLVPS ELAVADMSGE
           RVSQKELLVS VAGTLVEDLR ERLKALRQEF LARLGVFTAS ALTAGELVGD
           RISPKALLVS LAGTLIEELR ERLRELCQEF LARLRVFASS ALTAGELMGD
           RLSPRFLILT LAESLLEELR EQLRVTRREV LEPLAGIVGV TLSAVALLGD
           RLSPRLLGLG RAAALQEELR GKLKALHRGL LDRTVVLDEH SLSGGELMGE
           SKNPGLIGLV LVAPLAKKLK ERLVHLT--- LARLSAIFNA NLKTADLPGS
           RPDPTLFLLI LAAPLMEQLR EKLKQLACEY LEPLGAVFAG GLTEASLEEN
           RGNPNLFTLV LAGPLIEDLR KKLKRLTDEF LEVLTAKVGA GLTDASVQNN
           RSNPDLFTLV LAAPLIEELR EKLKRLTDEF LEVLTAKVAA GLTDASLQNN
           RGEPKQLLLI MAVAAPKDLW NRA--LRREL LHQLANLLAF GAVSGELEGD
           KKEPQVLMLI GANRGKYVLR EKLQALRRAL LDTLSALVKS GLTKGKLKGD
           RKNPTVLELI LATELTSPLR EKLDQIGREL LDRLAAVLGG ALTAGSLEGD
           KCALLLLVVA EVAERD---- -RLPLLQRTL LSRVFTLQGL GLTSSELGGD
           KCSLRALVLA LVMSLYTQLE EK--VLEKNL LHKIGTLVPH GIKPNELPGD
           KKNLQLLFLK LVAALT--KK EPLRVLPLEL FERIATLLEK GLTPGAVAGD
           IKNLELVFLI LVAALTSGM- -SVRVLPLEL LEQITTLFEK GLTPGALSGD
           IKNLELLFLI LVAALTSGM- -PVRVLPLEL LEQITTLFEK GLAPGALSGD
           KENPRVLCLV SAIGLGNEMN EKPRGMRREL LEDLGTLHEV SLKAGRFPGD
           KVNEKLLIFI YATFMNEDMK EKNRSMRREL LERLATFVQT GLKADELPGG
           REDDELLLLL LATAYVATLM QKFNKFDHDI ---------- ----------
           GRNEILPLLL STAGFAFKLA QPLNKFGQDI LFPLGASYTL SLVTGPNPGD
           CQNEGLLLLI LTTIFLYSLM QKSGRGDRDI LNAICALRSY ALTGGETPGV
           FGNTGLLALI FVTAFVYATP HKLAQYGRSA LNKPCALVDL ALQEADEPGD
           KQNAELLYLL VTTDLIYGLL QKLSQFGRDI LNEQCALDEL SLQEGEVLSD
           ---------A LSPDYVYALM QKIGSFGRDV LNRLSALLGL GMHTGNQPGD
           RATDNLLLLF LSAAYPYALM QSDNTFDRDI LNQLSGLKSI CLA---NPGD
           RANEKLLLLI LTADFPFTN- -----FGRDE LTRAAALFPL YLFSSEDVGS
           RDDERMLGLI LTGFYSFTL- -HLNSFSKDE LNRLNAIFDL Y--SDPKTGS
           RDDERMLTLI LTATYSFTL- -KLNSFSKDE LNRLNALFDL Y--SDPDTGS
           QDNQTLLLLF HTPTYPYDLK QKLHNFGRNI ---LYRHLGS YLAASSENGD
           QNNEDLLLLF VTADYPYALL SKL---KRDI LNRLKRTYEI YLPAGNDPGE
           AERETILLLV KTAKYVSQLT DKSKGFRRAI MNMGKAAAQA CLGGGLDYGD
           QAREELLLLV KTPRYVAQLS QAKIGFRRDL MNRLRAVVML CLGGGLGPGD
           NEKEDLVLLI QTATTLYGLA AKHEKFLRNF LTQLKAALGM ELAGGENPGE
           GTPQKLLYLF DTTGTIFDLK QHDNDFKKDL LDRLKLLLGR NLAGGEDPGD
           GTRQDLLYLF DTTGTVFDLK QKMNHFRRDL LDRLKALLSK DLAGGEEPGD
           -EKPEFLLLN LSDLYDYELK EKFSALRRDM LDTMNALIAL GLTAGDAPGS

           NATLMSARLI GLLVSATLLA L--------- ---------- -------
           GVQLGAASLA MQLLGALLPC LRLDALLGSL ASGLPEEKLA SLAIFL-
           EATPMSAALL MPLVQALLLC LLLQPLLAKH SDDLPQIILA IYGIF--
           NATLMEAVLL MPFLAALLSC LILEPLDRKF ADDFPAVILA IYAIF--
           MATLMEAGLL MPLLAALLPI LILAPLDKKY AHDNHNDILA IYAIFLT
           LGTLSEAVVL LQLVPSLLAA IIFKPIDKKY GESAPVGILL PFSVW--
           MATLMEAVLL MPLLAALLPV LVLKPLDKKF ADDSPGDILA VYAIF--
           MATLMEAVLL MPLLAALLTV LPLEPLDKAY EDDSPGDILA VYAVF--
           MATLMDAVLL MPLLAKLLTI IILEPLDKKY SDNSPDDILA AYAAFLS
           EASLGAGKIL VPLLAALLVA LLLSPLLGGF SDDLPNMVLA IYAVTL-
           EGTLAAGVIL MALLAALLLY LLLDPLLSGF SGDLPDSGLA VHA----
           EADLKAKSLL APL------- ---------- ---------- -------
           EADLHAASLN MIYLNVLMLH ELLDAILEEF AFALPAVC-- -------
           PADLKVAGLS MVYLHVLMLN VHLGALLEEF SLGLPAA--- -------
           PADLKAASLA MVYLNVLMVS IYLNALLEVF AVTLPAVCL- -------
           EADLQAASLS MVYLNVLMLA VLLPALLEEF AVAIPAV--- -------
           DAGLPTPEFL MPILCALKLG PSLGALLKEF AAPLPAPCLA VL-----
           NADLQAANLL MPLLSALKLA LILGALLNEF ALPVPGLCLA SE-----
           AAQLLGACLL MPLLDTLLLG LAIA------ ---------- -------
           NAQLLVARLV V--------- ---------- ---------- -------
           QANLLATNLL MPLLDALLLA YSLEALLND- ---------- -------
           SADLLAGTLL MTLLDALLHN LELAALLAEF GKVMPALVLA GLAVFLA
           AADLYAASLL MPELAALLQE VELAALLSNF AKSLKSLVLA GLAAFVG
           TAELAAARKV MPLMGALLQN LLLGALLANF ARPLPAHNL- -------
           EAGRGSAHLL MPVCVVLLLA LLYGALLTQM VASLSSPTLP GDALLL-
           EAELASAAAL MALKVALLHD LMLGAGLTNF VSGLPALTLA EIC----
           EAELTAQALL MALKMALLHE LVLGALM--- ---------- -------
           QDELVAGPIA KPLLSHLLVS TRFAALSRSF SQQLPQL--- -------
           VAEVVAAVLV MPKISALLVC LLFGAMLAEF EELLPQNPLN QIAQ---
           DDELKAATAR DSLLSALLLK LEIGAAGSAF QRPMVQLVL- -------
           EADLLAATIL AALMTALLLS LILRAQLTE- ---------- -------
           QAQFRGADLV PPLLSALFLG LGSSAQLTDF A--------- -------
           DTELPAASHA IVLLSAFLQS LILGSLM--- ---------- -------
           DAMLLATALL AHLLCALLLP VVSGALLTQI EEPAQAIYLG GLAAFLR
           KAKLRAVPLL APLLTALELS LVLGALLD-- ---------- -------
           PAGLVVANGL SPTLNGLFLS LCQKALIADL EAIFEGAGLT GLVNFLD
           EAGLLAANGL DPNLSGLFLG LVQSALTTEL DGTAPDLSLT GLVSFLA
           EAGLLAATLL HQNLSALLLK LYLAALLSDL ESDASPVVLI ALA----
           EAGLLAATLG DQNLTAQLLK LYLTALLSDL QSGAAPVDLI TLAAFLT
           AAGNLAATLL DQNLTALLLK LYLTALLSDL EAGAAPVDLI SLAAFLT
           AAELFAGTLL APLFPDLLLF LFLRAQLGEL NLQLPDVVLA ADDTFLT
           EAKAVAAVLL APLLSVVLVP TLLGAALKEL KGALVAVM-- -------
           AAKSLVASGM GPLGNAILLA LLLASSVIIL TGRLCPVPLG GLIT---
           AAKSLVATGM APLGNAILLA LLLASSVIIL KGRLCPVPLG GLVT---
           GAGLLAPDLL APLVCALLLV LNLQALQDPL LHPLPRILLP E------
           DAGLGAPFLA DPFLCALLLL PHLEALLDLL SH-------- -------
           EATLSAPKLL AALVCSRVIS VEMGALV--- ---------- -------
           EAGNIGPTLM APLVCALLLT LSLNA----- ---------- -------
           TAKLLGPDLD APLLCLILLI K--------- ---------- -------
           TARLLAPELD SPLLCALLLI NIIEAALAEL SGKFSKLVLP EGAT---
           TAKLLAVELP APMLCALLIL NIIAAALDEL SKRFSKIVLP EGA----
           TAKLLSPDLD APLLCALLLI N--------- ---------- -------
           GAELTAAQLR APVLTTLLLV LALAALLA-- ---------- -------
           GADLAAEQLR APVLTTILLV LLLAALLA-- ---------- -------
           VAQLLGSPLL VPVLSTVHLC YYVGAFLRAF AFCPPLLVVS ALATY--
           ---------- ---------- ---------- ---------- -------
           AAALLVARLL APLLDKFLIG IPFMALLGDL TKLRPSNQLT GFDAPLA
           SAALLGAGLL APLLAKFLLD IAVDAFLRGV DEWPRKLSLA EFAQPLT
           EAEAYCASLR SPLLKALLLL VVHTALLTGK APPSTNQVLV RLDKLLE
           SDKLHVALLT AALLPTLVLG VDVSAVLTKL PIPLDSV--- -------
           KIALFTALLM APLRPTLLLG TGLKALFALL TSPLGDLNLA CIAKIL-
           GPSLNSSELD PNLDPTLLLA ELVTAMLVEL PAEMFSHVLA ALAKALP
           VAGLSAAALV ASLETTILVP LLLGALLMGI PASLGNTVLK G------
           SAEILSSKIE SPLLATLLLN ELLGALISDL GGH------- -------
           TAELGSAKGL APLLHYLLFG VKIGALLGWI APPTPKIL-- -------
           GSKLLAAPLL ATLLQTLLLK VFLGAKLGDL TDHLPGLVLA AV-----
           KGELAAAQVV APDLSALLKM AVLTAVLHKL PGALLDLSLG SLAKLLV
           SGELNAAPVV APDLRAVLFD LQLKALLHHL PEPLPSLSLG SLINALI
           KAELLAASLL APLLVKLLLR YPLGTLLPKL GGGFPNLMLA RLTNVLG
           KAELLAANLK APLLTKLLLK YLVGTLLPML GNGFPQLM-- -------
           HSQLLEAALL APLLSTLLLG YLLGALLKIL DERLPDLVLE GSSELLA
           ADELLAAGLL AHLLSTYLGV FQLAALLARL GHSLAEGDLA GIPTLLR
           SAGLIAASRV APLLSGTGL- ---------- ---------- -------
           NAELLNAAVL AKLAGSALVG Y--------- ---------- -------
           PVELLNATVL APLLGTL--- ---------- ---------- -------
           EAALPAAAFL APHFTALLLR VYRGAILGKM KKVMPGLVLE GMA----
           VAAVIAASLL ASLLTALLRT LLRSTLLSNH EHALPGLGLG ELADLLD
           ---------- ---------- ---------- ---------- -------
           NYQLLTASLL SPLLQSLLLF VVLVAPLTIL RRALPNALLT FLHN---
           EDVLAPMGDA ALLTSPLLLS FFVAAVLSDA AAPGGDALLA FLSA---
           AAALIAMHLM SPLLTPLVLF TLVAAAAADL NGALGELVLA LLESLLG
           KSEVLPVTQL APFLGALILR NNLVALLQHL EPALPDLLVA GLMPLLP
           DAELLSANLL AQLLRALLES LFLGTLLHEL KSVVPDLLLV GM-----
           EADEGEARLL ASLMAQMLLF NTVYANLNDL KVGVPEVLLK TVQVLLS
           KAELFMATLL DSLLAKLVLF EELNAFLGEM AQVLPQLLLA GP-----
           APELLAAILK ANLLAPLLTS LPRAALLPAL NLVLAKVIVS VLQG---
           DAELVAAGLL GPLLRALMLF L--------- ---------- -------
           EAERVAADLL CSLLSALVLF LLLGALLKLL ---------- -------
           DANLVAANLL TPLLSALVIY LAHPALLSTL LVALENALLA E------
           TAELVATKLL APLLNALLLS DKLGAMLV-- ---------- -------
           TAEFMISQVI APLLGTLLLS TTLNAVLSDL QAPLADEMLA GLDPYLG
           LAELTISTVI EPLLSSLLLS SPLAALLTDL QSALPGGVLA GLEPLLA
           AAEFLITRVA APLLGSLLLT CGLGALLYDL LPQLPDDILI SLGELLA
           SAEFLISQVI APLLGSLFLS CGLGAKLNDL LPHLP----- -------
           ---------- ---------- ---------- ---------- -------
           KGHLMKSDVL APSLGTLLLK VSLSALLSSL PTGSPEVLLA LL-----
           GGDLTRPTVL CPSLGALLLS LVLSALLTKL ATAFPELLLA GT-----
           GGDLTRPTVL CPSLGALLLS LVLSALLTKV ADAFPELLLA GT-----
           SAEALASIVL PTLLSGLLFP KRLGAMLADF ADAMTALVLA SVDPYLG
           DAELLASNLT SDTLAPILVL LILTSLLSAL SEGLPKLVLA GIYALLD
           DAELLASNLT SDTLAPILVL LILTSLLSAL SNGLPKIVLA GNYALLD
           DAELLASKLL SDTLGSIMYL LMGSALIDAL CAGLSQIALD GAEALLD
           DAELLASTLL SDTLGSIMYL LMGSALIDAL CAGLSQIALD GAEALLD
           DAELLASKVL GPLLGTLLFT LLLVADLTEL MEGLTALALD LLEPLLV
           HAQLLKSGVT SQLLGNLLLA LLLGALLD-- ---------- -------
           QAGLAEAVVL APLLGRLTTA LPVGALLNSF A--------- -------
           ASNLYPARVL TPMLAAMLLE LLLEALLAPS ATGLPAPLLA DL-----
           QAHLLLARVL APYLSALLVT LILGALLAPL ETGLPE---- -------
           NAELKSGDAL APLLSGLFLS LMVGADLYGV GQAAAGALLA TLVA---
           SAELSAGQIL APVLFGLFHG PLLGAHCLAL ---------- -------
           AAPHAAGGLF TPLLSGLFLS LAFQAFTCTE TETPRDVLMS SVVALLG
           EAELGENGVR AALLSGLVLA LLLAAPSYAL TESLASISLA SLTKALS
           EAKLGAGGVL FALLSGQTLE VLLAAQSYPY TETLSRTLLA KLTK---
           EAKLGAGGVL FALLSGGTLE VLLAAQSYPY TETLARTLLA KLTK---
           EAKLGAGGVL FALLSGQTLE VLLAAQSYPY TETLARTLLA KLTK---
           QADILDGEVA GRLLSGLFLG LPMSARLYIL R--------- -------
           QADILDGEVT GRLLSGLFLG LLMSARL--- ---------- -------
           GGELIAGYVL APLLGGLFLN LLTAAHAWDL DAGMAELLLA AEAGLL-
           GGELIAGYVL APLLGGLFLN LLTAAHAWDL DAGMAELLLA AEAGLL-
           QTELDAGNMK APLSSGLFLA FYLDASLKDL SCGIADLPLT M------
           EAGLGAGAVL VHFLSGLFKN LLLSAKLSEF AQGKADPLLL GIATILA
           DAGLGANATL LPLLSGLFRA LLLAESLRDD APALAAEMLG GLEV---
           EAELVAANIY PGMISALIKP KHVGALLEDL QFN------- -------
           DSELLADPLF SPLLSTIIKP HDLLAVVKDI QIDTPTKLLG GLP----
           ESDIPAEPMY TPMLKTLKKA HRETAIR--- ---------- -------
           ESKIPFAIIY IPMLSTL--- ---------- ---------- -------
           ETKIPFAIIF IPMLSTLKKA HSLGAQVQDL SFDHPDKLLG GPA----
           ESKIPFAIIF IPMLSTL--- ---------- ---------- -------
           ESSLLAAVIY PPMLSALIKA DLIEALLKDV DPD------- -------
           ESELLAAAIF QHMPSALIKP HHLFANLQDL DVDLPSDLLG GCAAM--
           DSSLIGAVIF PPLVAGLIKH DHLGNLLHDL EVSLPENLLG N------
           ESSLLGAVRF PPML--LAKH AYLGAMLHDL ELDLPKKLLG SYAAALS
           EAELAAAALF PHLLSALIKH KHCVTLLNEL EIESPIKLLA GLRALLS
           AAQQLAAALF PHLLSALIKL KKMAALLNDL QVRLPLKLLP VLRDLLS
           EAGLFSAALF PPGLSALQKS KHLGAVLTDL QTSLS----- -------
           DLDLVALDIF PPMKSTPVKP GSNAALIADL QA-------- -------
           AADVEPAQLY PPMLASLINP DRMAALLGIL PDKLN----- -------
           EAELLPAIAY PDKLAAVIKV PCLAALLEIM PGNFDG---- -------
           QAELVDAEMY QPMLSALLKP QSLGALLNNL QGHTPEQLLV G------
           ISDLLAAMTF PPMIGGLIKQ EHMERALVDI VPKVPR---- -------
           EAELLAAVMF PPMLSALINV EHMGALLQDM TAKITGNLLV SLTAVLN
           EAELLSAFLF PSKLSALIKP KELGALLHEL QGKVPDGLLA RFATL--
           EAAELSATLL PPMLSALIKP KHLGALLHDL HPKAPEGLLA RLNLLLQ
           EAKLLSADLF PPMLSTLIKP KSLDAMLHEL QEKIPDGLRA RLATLLN
           EANLLSATLF PPMLSTLIKP KNLGAQLHEL QDKIPDGLLS RLA----
           EAELLSAALF PRMLSALIKP KHLGALLHEL QHKLPEGLLA RLSAL--
           EAELLSAALF PRMLGALIKP KHLGALLHEL LHKLPEGLLA RVSAL--
           DAELLSAALF PRMLSALIKP KHLGALLHEL QHKLPEGLLA RLSAL--
           EAELLAALLF PPMLSALIKP KHLGALLHEL QAELPDALLA RIAALLN
           EAKLFAAVLY PPMLSALIKP KNLGALLHEL QAKNPDAMLA RSAAL--
           DAELYTALLY PPMLSALIKL KHLGALLHEL QAKLPDAVLA RLAALLH
           EAELFAALLY PPMLSALIKP KHLGALLHEL QAKLPDALLA RLAALLN
           AAELLAAALF PPMLGALIKP KALGALLHQL QAKLPDGLLA RLATLLN
           AAELLAAALF PPMLGALIQP KALGALLH-- ---------- -------
           EADLLASLLL APLLAALLID ISLGAVLEEV KGDLLATLL- -------
           QANLTAALLV APLLKALLLV SVLHAKLSDL VQNLGEVLLS AIAKLLP
           EADYLPALLL APLLKAALLI IVLCAMLNEL LHALPQAILA ALGVLLG
           IAKLQAAAVG ATMLSALLQE LSLGAKLA-- ---------- -------
           GAGMAAVSVV EVLLESLPSI ILLAAKLNQL AREYPACLLA SL-----
           QADLPPAPLR ARTLSALLLT LGQGALKHEL DS-------- -------
           EAKLAVAGLQ APMLDPLSLD LCEG------ ---------- -------
           EAKFLTGILL APMLQALFLN ILFGAILQAS GAEVSSLLLG GMAGL--
           EAELIATRLL AG-------- ---------- ---------- -------
           QADLLAAGVL ALKLSALLLS PNLGALLRNL A--------- -------
           GADLLAAKLL APILGPLLIR ALLSALLRAH SPNLPELLLA SLA----
           QSETDFSGGL ATMSGGMLLG LTLGALMKDL ASTIPTRVLA TLAALLP
           TSEVNASGGL ATMQGAMLLN LAQGALLHEL TADLPGLMLA AAV----
           TSETSASGGV ATMEGAMYLA LSLGALLVEL TQDLPGVELA SLAE---
           LAELLPDCLA APLLNRLNLG LLLGAITKQL GHDLRKTVLA D------
           LAELLADSLA APLRFSLKLG ILLGAITKEL GHNLRDTVLA G------
           DADLLAIALL AA-------- ---------- ---------- -------
           AADLFADCHL ---------- ---------- ---------- -------
           NAQLLAAPVV ASLQSALLVQ LYLCAVKSEP TNEVAALML- -------
           DGVIYSVGLL GDLLSPHLLA SFKPAVMSKL ACPAPPMAVG SPNLMLI
           GAFIFSVQLL ADMMSA---- ---------- ---------- -------
           GAFIFSVQLL ADMMSA---- ---------- ---------- -------
           HGAFIAAILI LPLPTALLAI LALGANLEGF AEDL------ -------
           QPSLFAAVLD KPLLCALLLD LALGAM--DL EDDALNLLLA SLV----
           ESELVADDLI APLLKSPLLT LLQDGVIQEL GPDMPQIFL- -------
           DADLSAASRA NPLLSSMLLA LFLSAITAKL QIEKPQLFLA AL-----
           SADLPAAPRL APLLSSLLIH LSLGAILTEL SADMPQLYL- -------
           EANLVDAAKL AQLLSALLLA LFLGAVLKEL ---------- -------
           EADLLDAQKL AQLMSALLLA LFLGAALDEL GLD------- -------
           EADLLDAQKL AQLLSALLLV LFLGAALDEL GLD------- -------
           IADLAAAILF PPLLPDLP-- ---------- ---------- -------
           EAHLGSARLK PPLLQD---- ---------- ---------- -------
           ---------- ---------- ---------- ---------- -------
           GAENLAVKLL ANKLSSLLFS LLL------- ---------- -------
           KAEHCEVSML ANLLPRLLLG LEPRALLQCL AMAKTGP--- -------
           KADDMSVMDH ADVLPAMLLA FDTCALL--- ---------- -------
           GADDLAVMEL ADALPALLLA FPCSAL-NCL TLAKTLLTLS NTSPLLC
           KADRNGVRHP CDLLASLLLK LLLEADLG-- ---------- -------
           AADHFAVKLL CELLSGLYLL L--AALLDDL NSALVNLELA SPATLLG
           KATHFAIKLK CNLLSALLLG QLLN--LENL ALAQVDVELE EVAPVLC
           KADHLGVKLK CNLLCALIIS Q--------- ---------- -------
           KADHLGVKLK CNLLCALIIS Q--------- ---------- -------
           KAEHMAVKLS CDLPRPLVKK HVLNAILL-- ---------- -------
           KAEHLAVGMK CELLSPLLLD QVLDSLLDTL A--------- -------
           EAEHKEVNQG CDVLTALLKT VGLGAVL--L TLGGVQILLA EPQPLL-
           KAEQIAVHQL CDVLAALLKI LGL------- ---------- -------
           EEDYLAMNLL CQL------- ---------- ---------- -------
           KADFLAVKVL CAL------- ---------- ---------- -------
           KENFLAVKVL CQL------- ---------- ---------- -------
           RADVACVALK CVLLDALVLA EGISALLGNL A--------- -------