File: pfam3.seed.txt

package info (click to toggle)
python-biopython 1.80%2Bdfsg-4
  • links: PTS, VCS
  • area: main
  • in suites: bookworm
  • size: 76,328 kB
  • sloc: python: 316,117; xml: 178,845; ansic: 14,577; sql: 1,208; makefile: 131; sh: 70
file content (42 lines) | stat: -rw-r--r-- 5,542 bytes parent folder | download | duplicates (3)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
# STOCKHOLM 1.0
#=GF ID   12TM_1
#=GF AC   PF09847.11
#=GF DE   Membrane protein of 12 TMs
#=GF PI   DUF2074;
#=GF AU   COGs;
#=GF AU   Finn RD;0000-0001-8626-2148
#=GF AU   Sammut SJ;0000-0003-4472-904X
#=GF SE   COGs (COG3368)
#=GF GA   33.20 33.20;
#=GF TC   33.60 33.20;
#=GF NC   33.10 32.90;
#=GF BM   hmmbuild HMM.ann SEED.ann
#=GF SM   hmmsearch -Z 57096847 -E 1000 --cpu 4 HMM pfamseq
#=GF TP   Family
#=GF CL   CL0181
#=GF DR   INTERPRO; IPR018646;
#=GF DR   SO; 0100021; polypeptide_conserved_region;
#=GF CC   This family carries twelve transmembrane regions. It does not
#=GF CC   have any characteristic nucleotide-binding-domains of the
#=GF CC   GxSGSGKST type. so it may not be an ATP-binding cassette
#=GF CC   transporter. However, it may well be a transporter of some
#=GF CC   description.  ABC transporters always have two nucleotide
#=GF CC   binding domains; this has two unusual conserved sequence-motifs:
#=GF CC   'KDhKxhhR' and 'LxxLP'.
#=GF SQ   7
#=GS O29855_ARCFU/39-477  AC O29855.1
#=GS O29125_ARCFU/30-435  AC O29125.1
#=GS Q8U2D3_PYRFU/39-485  AC Q8U2D3.1
#=GS Q5JDA6_THEKO/35-482  AC Q5JDA6.1
#=GS Q97VM1_SACS2/39-451  AC Q97VM1.1
#=GS Q9HM06_THEAC/17-497  AC Q9HM06.1
#=GS Q6L2L5_PICTO/38-510  AC Q6L2L5.1
O29855_ARCFU/39-477             WIRYNALLLKIMFTFAALFSVGPAFFDDKVS....YASSLLSLFFFFLMFGTAYAHGYFQVDL...SYMHTFYSRSDISKVRFYGFFRLFDWPAVIALLS.....LLVLVGMRNPAGLLPALLGFLAVIMGALSIVILLGKRLGSVQTGR.SLRAAFFRIFGLIAWLVSIYGLYLINQLAI......YLMTFKNYEAYDSLFP.......ISYGLWISQPFSAKYAALSLFYF.ALITLLFFYAVRELSKE....EIAKHYGSLK.GWKIKRRGKMTAMVIKDFKQLFRNPQLFVIALLPIYGALM..................QLVFYIKLSEVASVLYLQIFLAITVSSFMSLERSSYITALPLTDLEMKFSKILEGLLIYF.VSMGIVAAVVIYKG.GNLINSLSLFPTGFAVVLVAVQFSRRL.......TSEPVNVE...AVIATLISFFIVLVPAAVGGVAVLILKAPFS...SYAFPVSLAETLAVLAVFALLNRRK
O29125_ARCFU/30-435             SLRVQVAKSFFIMTFLGSFLCWVAFISSGLG.....LSLIFTLSLVFSQIYPAQNIAISASS....RVFEPLRYLPVRFSERMLVVF.FIDSINILAFAT...PTIAVLMVKNLYFGLYSLLWIIAAILLG.YSMVFLYYALFGVKV..RSGFSKSVL..AGILFFAVLVFAL...............RRFQEIPDLTPYLTP......................HLLLLSY..AASSATIKLSTGRVWRSILNPEIVEVKGSSR....LSSGSPLRAMLIKDFRLILRKN.ALFPLIVPLVIVMPNVVSIANMPN........LSIFIITTISTLSTIDLRIIGNLENVDF........LRMLPLSKRGFVMSKACLIFVISFAASLPAGSIAFIVS..QNPFYLFMAFAIPAIVSMLSSLIIFWQ.......KGEEIYFPEV.GFLKWIGLLLVNFGAVYAVLSPRFILSQPVA......DIISSVLTL....LAMTALFEK
Q8U2D3_PYRFU/39-485             NIIWGVFLQSVMYLGLGVMVAVSILYSENEVQKAIFFSSYLIIPFILTLYSTSLATAYLLSS....KAVEPLKPLPLGNLNFIVSLTLLIENLPAFVFLI.....PASLALGNSIASLLGLLWICSTILMG.HSLALFLQIKFSGIHVGKGSVVKTLVKVAGFLI....IAGIYFIVQALMRILEDNIEVIAPIFRKYFIAFP........FAASTIYEPYKS..LVLLALYT.LPFLALYFYDLKRLGEVL...EGIKTYGKVATKYKLTVANPVTAMFRKDYRIIFRKNPYLGTFLSPLLMSIYFIYNLAKEGFPVM.....MTLFSIMGISVLGLVMLDPAFAMDREVF......PFLSSLPIKRREYLLGKMLTVSLSPLTFSAILVLLSCAFNG.TEALLLIPFLASPFLTSSIGILYVKHKM......GNERIELPVL.KFYDGIVMLILSMIPFIIVAIPLFLLSVPKG......YLVSGAIIL....VGALILSKL
Q5JDA6_THEKO/35-482             DLKKTLLFQTAMYAVFGLML.FPSLKGERDA.VLVMASTYAILPFIIAFYATVTNSSYIASL....DLFKPLLPLPIKLGGRYMSVLLLLESLPVMAFMV..PGAVRIGMVVSATSGLLVLLWSAVGLMLG.HVFGLLVYYSFGKTSSGRFADLKSLAKALGVIL....IFGLFYGFSYFQDYVLQNYTSIKESLGGYEFIYP........LSVLSVDRPSFS..APLAGIYI.AILGVAYYVLISRLWVRI..SEGSYTSGRRRRAGGLGVYPPELALMVKDFKTALRNTPVLTGLLVPIVIPIINVAGIFSNPDIGAFGGRLATITFVAALGWVSAVSVETLTKIEVKSF......ELLLSLPLERGRFLRGKLLTMAAIPSAVGV.LALLGLSLKGFSSPIYLPMAVLVPLATCGIALHVYYH........GTEGLALPQG.GILKSLAVWILNAVVVGIIAG.SWYLSYPIA......LLLTAA.......IDALLLWSL
Q97VM1_SACS2/39-451             NAVTIKISNIIAYTIATIVSASISLINKDAP....FSFIFLDLIILANIFTTGLNVIFFVTNY...DLKTFLLSLPLSERDVNRAVFRGIFEFFYYGFLA..SIVIAPISTYMITSSVLQALMAELEIIFF.FSLSFALVMLLGKRI..RLGITSALFRIGTSLIWIVFIMLPYGL...........TFKYVTIPTYILPIFP........FGFLNIEG......LLISLLYTGLSVFFAYKQSLKFLSFRL........NSQYSTKYSIKLRSPLITYLYKDIRGLLRVPQASFLLTIPVFALIFSFFAPV............YAIFYTIFMITTSSIMLIL...LEASGM......QLLLSLPAGLRSSYISKLLIILIIYL.......IDVLIFSFFNRASLSLIMLPSTITSVELSLFISYNNVI.....KGKGMRLA...DPLSFIIREIEINSIIGIASILTFFANIYYS......LLFSVLSLI....MINIVVYKK
Q9HM06_THEAC/17-497             YVNISYGATSFSFIVFSLILVAPSLMEHRIY....TLSSVVLLLFVYSLFINISNSLLFFVSVNINHILDPLRILPVDFPDHVIAVSWFIYTGSSSLFAVLPAIFLAAFLLGDPYILVIGLIWSIFSVLLG.YIIGSSIFVAFGSRISGKRTRSTNILRNVGRIVFLVFVFAIFEIILYNANIV...NGIIPRLPYPYSYFIPIFNIQSTVFFFHGIYMQATG..FIISMVYT.ALASFAFIYVNRKAFYRLLEP.TARNQSRVKTQMKAEVRSRPFSFFSKDLKISSRKSQNLVLLIMPLFFVFPTIMSEVLYAPTSKADPIILYNAMVAFVIVTSSFYSILFLVIEGNGI......SFIKALPLDGNSIIRWKISAPTFIFAVISISTLAAISVKAL.MGAAFYIIIIVDMMLYFVSSTVYNMNRLYRKIPDTADTVNFYSFGGQIAFITTFAFTGLIVGSADIFSLFLQDLLRLNAYFFFLINTVIGI....IVLLFMVFR
Q6L2L5_PICTO/38-510             TILLYYISNSLSFLFFSIVLNGIYYVKGNTN....DISSFGIILFMYIFVIGIYSSLTYINGISINNLLSPVRSLPIKVNTDVPFLSWFIYTGSSYIFIIIPSLLFYYFLVHNLNTIILGLIYAFAMLLFG.FIITAIAFI.....YSSRKPRAHTSLNNFLRILLIFVFLGFFYLIIYDPNILRAYSIYISSLPVYIKYIAFPLNIDYAVYFHPDIIATFFE..YLSSFIIL.LIFFFIYKKIRSRLFYSL..EYSEEVKSTEVTRTKIKRDSISVSFIKKDIKITARKSQNLTYILMPIIFVLPFLFTIISSRQPFLS....LMFSILSLSILISSFYPIFTLIIENNGI......LIINALPINRKDIAKYKAYFSMIFYSIIITVVSIIIMAYKN.IFNLYYVFIIPDLILIFYTAMIINLNRLIKKIPKGASTINYYSF.GVFPTIVLFIVSGIIFGLLISPGIIISEFLYHSIKMSFIFDIIPDL....IIFLIMIKK
#=GC seq_cons                   slhhthhhpphhahhhulhlsssuhhscphs....hhSohhhL.Flhshahsshsshhhhss....clhcPLhsLPlp.tschhulhhhI.shsshsFhs....hltshhlhs.hsulLsLLauhhsllhG.aslshhlhhhhGthhsuRhuhspslh+.hGhllhhh.lhulahl..h.........hhh.pl.thh.hlhP........hhh.sI.t...t..hlluhlYh.hhhhhhahhshp+Lhhpl....h.cspuphppthplphtu..huhhhKDh+hhhRps.sLshllhPlhhsl..lhs.h............hhlhhlthh.shSslhl.hhhhlEssuh.......hlpuLPlscpphhhuKhhhhhlI.hhhuh.hshhshhhph.tpshhhlhhlssshhsshluhhhshpp.......su-slph..h.uhlshIshhllshlhhulssh.shhLs..hu......hlloss.hl....lhhLlhhc+
//