GenomeNet

Database: UniProt
Entry: B9PYE4_TOXGV
LinkDB: B9PYE4_TOXGV
Original site: B9PYE4_TOXGV 
ID   B9PYE4_TOXGV            Unreviewed;     10021 AA.
AC   B9PYE4; A0A0F7UU18; B6KQX8; B9QL60; S7UI30; S8GL48;
DT   24-MAR-2009, integrated into UniProtKB/TrEMBL.
DT   24-MAR-2009, sequence version 1.
DT   27-MAR-2024, entry version 110.
DE   SubName: Full=Putative type I fatty acid synthase {ECO:0000313|EMBL:ESS29205.1};
DE            EC=1.1.1.100 {ECO:0000313|EMBL:ESS29205.1};
DE            EC=2.3.1.111 {ECO:0000313|EMBL:ESS29205.1};
DE            EC=2.3.1.161 {ECO:0000313|EMBL:ESS29205.1};
DE            EC=2.3.1.39 {ECO:0000313|EMBL:ESS29205.1};
DE            EC=2.3.1.94 {ECO:0000313|EMBL:ESS29205.1};
DE            EC=6.2.1.20 {ECO:0000313|EMBL:ESS29205.1};
DE   SubName: Full=Type I fatty acid synthase, putative {ECO:0000313|EMBL:CEL71532.1};
GN   ORFNames=BN1205_013030 {ECO:0000313|EMBL:CEL71532.1}, TGVEG_294820
GN   {ECO:0000313|EMBL:ESS29205.1};
OS   Toxoplasma gondii (strain ATCC 50861 / VEG).
OC   Eukaryota; Sar; Alveolata; Apicomplexa; Conoidasida; Coccidia;
OC   Eucoccidiorida; Eimeriorina; Sarcocystidae; Toxoplasma.
OX   NCBI_TaxID=432359 {ECO:0000313|EMBL:ESS29205.1, ECO:0000313|Proteomes:UP000002226};
RN   [1] {ECO:0000313|EMBL:ESS29205.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=VEG {ECO:0000313|EMBL:ESS29205.1};
RA   Paulsen I.;
RL   Submitted (MAR-2007) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000002226}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 50861 / VEG {ECO:0000313|Proteomes:UP000002226};
RA   Lorenzi H., Inman J., Amedeo P., Brunk B., Roos D., Caler E.;
RT   "Annotation of Toxoplasma gondii VEG.";
RL   Submitted (MAR-2008) to the EMBL/GenBank/DDBJ databases.
RN   [3] {ECO:0000313|EMBL:ESS29205.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=VEG {ECO:0000313|EMBL:ESS29205.1};
RA   Sibley D., Venepally P., Karamycheva S., Hadjithomas M., Khan A., Brunk B.,
RA   Roos D., Caler E., Lorenzi H.;
RL   Submitted (AUG-2013) to the EMBL/GenBank/DDBJ databases.
RN   [4] {ECO:0000313|EMBL:CEL71532.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=VEG {ECO:0000313|EMBL:CEL71532.1};
RX   PubMed=25875305; DOI=10.1371/journal.pone.0124473;
RA   Ramaprasad A., Mourier T., Naeem R., Malas T.B., Moussa E., Panigrahi A.,
RA   Vermont S.J., Otto T.D., Wastling J., Pain A.;
RT   "Comprehensive Evaluation of Toxoplasma gondii VEG and Neospora caninum LIV
RT   Genomes with Tachyzoite Stage Transcriptome and Proteome Defines Novel
RT   Transcript Features.";
RL   PLoS ONE 10:e0124473-e0124473(2015).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LN714489; CEL71532.1; -; Genomic_DNA.
DR   EMBL; AAYL02000310; ESS29205.1; -; Genomic_DNA.
DR   STRING; 432359.B9PYE4; -.
DR   PaxDb; 5811-TGME49_094820; -.
DR   EnsemblProtists; ESS29205; ESS29205; TGVEG_294820.
DR   VEuPathDB; ToxoDB:TGVEG_294820; -.
DR   eggNOG; KOG1178; Eukaryota.
DR   eggNOG; KOG1202; Eukaryota.
DR   eggNOG; KOG3628; Eukaryota.
DR   OMA; MYPDPAA; -.
DR   Proteomes; UP000002226; Partially assembled WGS sequence.
DR   GO; GO:0004316; F:3-oxoacyl-[acyl-carrier-protein] reductase (NADPH) activity; IEA:UniProtKB-EC.
DR   GO; GO:0004315; F:3-oxoacyl-[acyl-carrier-protein] synthase activity; IEA:InterPro.
DR   GO; GO:0004314; F:[acyl-carrier-protein] S-malonyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0047879; F:erythronolide synthase activity; IEA:UniProtKB-EC.
DR   GO; GO:0008922; F:long-chain fatty acid [acyl-carrier-protein] ligase activity; IEA:UniProtKB-EC.
DR   GO; GO:0050637; F:lovastatin nonaketide synthase activity; IEA:UniProtKB-EC.
DR   GO; GO:0050111; F:mycocerosate synthase activity; IEA:UniProtKB-EC.
DR   GO; GO:0031177; F:phosphopantetheine binding; IEA:InterPro.
DR   GO; GO:0006633; P:fatty acid biosynthetic process; IEA:InterPro.
DR   GO; GO:1901566; P:organonitrogen compound biosynthetic process; IEA:UniProt.
DR   CDD; cd05195; enoyl_red; 4.
DR   CDD; cd05931; FAAL; 1.
DR   CDD; cd08955; KR_2_FAS_SDR_x; 2.
DR   CDD; cd00833; PKS; 4.
DR   CDD; cd05235; SDR_e1; 1.
DR   Gene3D; 3.30.300.30; -; 1.
DR   Gene3D; 3.40.47.10; -; 4.
DR   Gene3D; 3.40.50.11460; -; 1.
DR   Gene3D; 1.10.1200.10; ACP-like; 5.
DR   Gene3D; 3.10.129.10; Hotdog Thioesterase; 1.
DR   Gene3D; 3.30.70.250; Malonyl-CoA ACP transacylase, ACP-binding; 1.
DR   Gene3D; 3.40.366.10; Malonyl-Coenzyme A Acyl Carrier Protein, domain 2; 3.
DR   Gene3D; 3.90.180.10; Medium-chain alcohol dehydrogenases, catalytic domain; 4.
DR   Gene3D; 3.40.50.12780; N-terminal domain of ligase-like; 1.
DR   Gene3D; 3.40.50.720; NAD(P)-binding Rossmann-like Domain; 12.
DR   Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR   Gene3D; 3.10.129.110; Polyketide synthase dehydratase; 4.
DR   InterPro; IPR001227; Ac_transferase_dom_sf.
DR   InterPro; IPR036736; ACP-like_sf.
DR   InterPro; IPR014043; Acyl_transferase.
DR   InterPro; IPR016035; Acyl_Trfase/lysoPLipase.
DR   InterPro; IPR013149; ADH-like_C.
DR   InterPro; IPR013154; ADH-like_N.
DR   InterPro; IPR045851; AMP-bd_C_sf.
DR   InterPro; IPR020845; AMP-binding_CS.
DR   InterPro; IPR000873; AMP-dep_Synth/Lig_com.
DR   InterPro; IPR042099; ANL_N_sf.
DR   InterPro; IPR040097; FAAL/FAAC.
DR   InterPro; IPR013120; Far_NAD-bd.
DR   InterPro; IPR011032; GroES-like_sf.
DR   InterPro; IPR018201; Ketoacyl_synth_AS.
DR   InterPro; IPR014031; Ketoacyl_synth_C.
DR   InterPro; IPR014030; Ketoacyl_synth_N.
DR   InterPro; IPR016036; Malonyl_transacylase_ACP-bd.
DR   InterPro; IPR036291; NAD(P)-bd_dom_sf.
DR   InterPro; IPR027417; P-loop_NTPase.
DR   InterPro; IPR032821; PKS_assoc.
DR   InterPro; IPR020841; PKS_Beta-ketoAc_synthase_dom.
DR   InterPro; IPR042104; PKS_dehydratase_sf.
DR   InterPro; IPR020807; PKS_DH.
DR   InterPro; IPR049551; PKS_DH_C.
DR   InterPro; IPR049552; PKS_DH_N.
DR   InterPro; IPR020843; PKS_ER.
DR   InterPro; IPR013968; PKS_KR.
DR   InterPro; IPR020806; PKS_PP-bd.
DR   InterPro; IPR009081; PP-bd_ACP.
DR   InterPro; IPR006162; Ppantetheine_attach_site.
DR   InterPro; IPR010080; Thioester_reductase-like_dom.
DR   InterPro; IPR016039; Thiolase-like.
DR   NCBIfam; TIGR01746; Thioester-redct; 1.
DR   PANTHER; PTHR43775; FATTY ACID SYNTHASE; 1.
DR   PANTHER; PTHR43775:SF37; FATTY ACID SYNTHASE; 1.
DR   Pfam; PF00698; Acyl_transf_1; 3.
DR   Pfam; PF08240; ADH_N; 2.
DR   Pfam; PF00107; ADH_zinc_N; 4.
DR   Pfam; PF00501; AMP-binding; 1.
DR   Pfam; PF16197; KAsynt_C_assoc; 2.
DR   Pfam; PF00109; ketoacyl-synt; 4.
DR   Pfam; PF02801; Ketoacyl-synt_C; 4.
DR   Pfam; PF08659; KR; 4.
DR   Pfam; PF07993; NAD_binding_4; 1.
DR   Pfam; PF21089; PKS_DH_N; 4.
DR   Pfam; PF00550; PP-binding; 5.
DR   Pfam; PF14765; PS-DH; 4.
DR   Pfam; PF13469; Sulfotransfer_3; 1.
DR   SMART; SM00827; PKS_AT; 3.
DR   SMART; SM00826; PKS_DH; 4.
DR   SMART; SM00829; PKS_ER; 4.
DR   SMART; SM00822; PKS_KR; 4.
DR   SMART; SM00825; PKS_KS; 4.
DR   SMART; SM00823; PKS_PP; 5.
DR   SMART; SM01294; PKS_PP_betabranch; 1.
DR   SUPFAM; SSF56801; Acetyl-CoA synthetase-like; 1.
DR   SUPFAM; SSF47336; ACP-like; 5.
DR   SUPFAM; SSF52151; FabD/lysophospholipase-like; 3.
DR   SUPFAM; SSF50129; GroES-like; 4.
DR   SUPFAM; SSF51735; NAD(P)-binding Rossmann-fold domains; 13.
DR   SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR   SUPFAM; SSF55048; Probable ACP-binding domain of malonyl-CoA ACP transacylase; 1.
DR   SUPFAM; SSF53901; Thiolase-like; 4.
DR   PROSITE; PS00455; AMP_BINDING; 1.
DR   PROSITE; PS50075; CARRIER; 5.
DR   PROSITE; PS00606; KS3_1; 2.
DR   PROSITE; PS52004; KS3_2; 4.
DR   PROSITE; PS00012; PHOSPHOPANTETHEINE; 2.
DR   PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE   4: Predicted;
KW   Acyltransferase {ECO:0000313|EMBL:ESS29205.1};
KW   Ligase {ECO:0000313|EMBL:ESS29205.1};
KW   Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW   NADP {ECO:0000256|ARBA:ARBA00022857};
KW   Oxidoreductase {ECO:0000313|EMBL:ESS29205.1};
KW   Phosphopantetheine {ECO:0000256|ARBA:ARBA00022450};
KW   Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002226};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000313|EMBL:ESS29205.1}.
FT   DOMAIN          813..889
FT                   /note="Carrier"
FT                   /evidence="ECO:0000259|PROSITE:PS50075"
FT   DOMAIN          908..1335
FT                   /note="Ketosynthase family 3 (KS3)"
FT                   /evidence="ECO:0000259|PROSITE:PS52004"
FT   DOMAIN          2979..3055
FT                   /note="Carrier"
FT                   /evidence="ECO:0000259|PROSITE:PS50075"
FT   DOMAIN          3078..3504
FT                   /note="Ketosynthase family 3 (KS3)"
FT                   /evidence="ECO:0000259|PROSITE:PS52004"
FT   DOMAIN          5148..5224
FT                   /note="Carrier"
FT                   /evidence="ECO:0000259|PROSITE:PS50075"
FT   DOMAIN          5252..5673
FT                   /note="Ketosynthase family 3 (KS3)"
FT                   /evidence="ECO:0000259|PROSITE:PS52004"
FT   DOMAIN          6899..6975
FT                   /note="Carrier"
FT                   /evidence="ECO:0000259|PROSITE:PS50075"
FT   DOMAIN          6999..7427
FT                   /note="Ketosynthase family 3 (KS3)"
FT                   /evidence="ECO:0000259|PROSITE:PS52004"
FT   DOMAIN          9004..9081
FT                   /note="Carrier"
FT                   /evidence="ECO:0000259|PROSITE:PS50075"
FT   REGION          694..809
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1845..1866
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          4166..4230
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          8694..8723
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        704..755
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        774..790
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        4166..4186
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        4192..4230
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   10021 AA;  1093469 MW;  5C5AA4748CA6364A CRC64;
     MMTRLPLCQQ QAPPSLAGCH MLAAFQDIPE RPLNRDLKEL WAAYGENPFG NVLECWNRSI
     PDEDAVVWLN SDGEISETTT FSELFSRITR LSSFLRAAGL QVGDRIILCY PPGTDFVTAF
     FACITSGFVA VPVYPPDPSK GLSDVPRFCD INEIAGCKTA LTNSLYKRVV QVITTVAKDS
     RWRSVHWVCT DDVIKRHAEE AKNSVGPDFP NLSPHHPAFL QFTSGSTGNP KGVIVTHGSL
     LHNCHLCWSA YQFPSHYEND GDDENITNHD FALVNQREFW RRRQEISIKS RGHRLRAFSW
     LPVYHDMGLI GFVCSPILCG ATLYQMSPID FIRRPWLWLQ GMSKYDCICC AAPNFAFEVV
     TRKMPDNVYE HLDLSRVCGF LSGAEPIRAS TIDRFCEKFG PKGVRRCAVC PAYGLAESTL
     IVTARKTFQT VPKLLIVDSF QLQHEKTVVA LKPEATVTMD SEEYQVLVGC GVPLEGVQVR
     IVDPETKKEL PPGKVGEIVV YSSSVARGYF GRPDTTKETF CYSFTDLQGK QTPPLGMRTG
     DGGFMYEGEL FVAARLKDLI IIRGRNFYPQ DIEEAVDKVA MVRPGSSAAF AIEVDGHEAL
     AVAAEVRLEE GIKGLWLRVK RQFDRSHYEQ IVRDISKSVA ANTGLTIHRI WLLRPRTIPK
     TSSGKVRRSL TKEKLVSGKL EGILLDHAHH QHQAFYHDGP GGSRSRTRTG SGATPSGGAA
     TRDGTASSNP SGDAHPKNSN TSNSITDSSH SVQRGSLRGD ACVAGGRLGP SASHGGVHHE
     KDHKHRGDGS GMSSPDQVPG AGAAASESSA HVALTEKVKQ TVFEAAKTVL GSSEIPEIDA
     PLHELGVDSI GAVEFSELIS AELQVDVEPT LLFNYPTLSD VIDFFVRELE GKGSEDLAVS
     LEKTGARETT TAIIGAACNL PGGTTSLNAF WDLLMCGVDA IAEVPRSRWD VEEYFDPDPG
     ADGKAYVREG GFIEDAEMFD AAFFRISPSE AKSMDPQQRL LLEVAYESLY DSGYTREGLQ
     RASVGVFVGC CSNDWYQVCS QLELGISSYT TTSYAPSIIA NRLSYTFGLL GPSMTIDTAC
     SSSLVALHIA AQELQGGSCM SAVVAGVNLM LSPHITVAFC KSRMLAPDAR CKTFDASANG
     YVRGEGLGSI VMKRLSQAEK DDDRILCILR GTAVNHGGRS ASLTAPSGPS QQRVIRTALR
     QAELTPGDIY FVETHGTGTS LGDPIEVGAL KSVFGASRDA KTKPLILGAA KTNIGHLEGA
     AGIAGVLKAM LALRYKHIPP NLHFKQMNPH IDVREFNVVV PTAPMAFPRE DGGRLFAGVS
     SFGFGGANGH VILQEALQQP SGDKPPCKRH SKRSIICFMF TGEGYEVSCM GRHLYETDDV
     FKEIITKCND ILKRWLNVSL LSILYPPADK KMEMDKMMCQ ARFGHTALFA FEYALAESMV
     SKGVKPDVVF GVGLGELVAA AFSGVMSLEE ALYVVTQRAK LIDNLPPFEG VMVACRVSEQ
     EVEMALSEME INSRQSISIA AVYGRKNLIL SGEQSEIDNV LMKLNIAGRW KYLPVHNVLT
     MNLVSDIIPP FTHLMSTVKL CRPKVRLVSS TTGEFVDKEA LSGRFWSRQV SQTVQLQQAV
     GNIVASGCSL FVEIGPQVVL TKLAQQSLNG NAKDVTWVPA YDTTSDDVRV FENAVMTCNE
     AIKIIGKERS LEEGQHGQEE LIFHRKPFPW TEIPHPFLSG HKFTEKDDVV FENAFPRRAI
     PLIADHAVNG TAMMPGAGML EMVGAAALAL DLRTTSGAVS LENISFERPM LIGAHRVTYE
     YNRRHSLGEG GGAGTSPRTV RTPRLVNHAY TRLGFREKSA EKAISETAGP ASTEGGRTPV
     AGHHHGSYTK AGVLEDIKDY AHNVLRRAAQ HAEEIDVHTP VPPSMAPPSI VARHPREGSY
     GSMSRTSGIM GSNDRGDGLT VPLSVEMAIR CRVTKMKEVE LSTVWDDEVT DHCFATIVIA
     DKKQETYSSL QEVRCRCTER MDVNEYYDML YDAGLQYGPR FRTIQELYRG EKEALARVHF
     VGNVKQDPFE SGFLIHPAVL DGALQAAGCL LEQSHSGNKV MVPVSVGSAT LTKVEVFQPC
     WAHVRLEETK SKAATLNVTI YDEVTREVVA SLQNAVLRQV DPSASVVATI PKDLLWKVNW
     QLVVEGDKEV VGRTPKVLFI GGNIELSEAL ENAMGSTCKT LSVDALPREK AELQKVLDAE
     QWQAIVYVEA VTAGVYDALD VMNAAVRLVQ AHYLNLNSGK AVGALWFVTR DTVFRDNGAD
     KPGVEIPAHA GLWGLAKAAR LEMEVSAGSP ISMGCLDVSG ADVSSMAQQI LHALKSKVLT
     MRETEVAVRE DGLYVSRLAP QTEVDVRGAI EFHMPERGAI SNLVLRPQAF ASRLPPEDGQ
     IELRVRAVGL NFRDVLNVMG LYPGDPGPPG ADCAGVVVAV GDNVEHIEVG DCVFGVAQGC
     LRTYVTTDSQ LLRRMPESMT FEQAAALPVV ATTVEYALHD LAKVKAGDKV LIHAVTGGVG
     LVAVQLCKRV GATVYGTCSG GKKDAFAKSM GVQYVSSSRD AKAFAEDMRT FLGEDGKVDV
     VLNCLIDDFI PESLKLLGPY GRFMELGKRG VWSKEKVAAE RPDVQYELIA VDEMMEKDPI
     WFGGMLVRVR GLVAEQKIQP LPMHVFDLAD TGENGGVAAF RFMQRAQHMG KVVLKIPSPV
     EEASKNGSIV ITGGLGALGM VVAGWLAEEG ARHICLLSRR EEAPQPGEIP GWDWLRTINC
     DVQFFSCDVS KYDSIRSVLR TIQTSVAPIT GIIHTAGALA DALLDSQTPE LMKKVYNGKV
     QGAWNLNRAL EELGLNEQLK LCVFFSSVTS LFGNFGQANY CSANACLDAL AQWRRAKGLC
     CQSIQWGPWI EQGMAAELKQ QVERNGMKGI SNEVGLRVLH DAIRQQEKAT PVIGCQAIKW
     KLFMHRYIEQ PPFFTNIEMD LQVTSHDSNT VLRNLPPSER RDYIKAQVAA AARQVLGSSS
     PPPFDTPLQD VGVDSLGAVE FRNVLSKKLG VKLSATTLFD YPTLNAIIDH LCEVFGESSQ
     KDNLSTKLGA LSENDLVQEG MAIVSMSCRL PGNSTCPDLF WDMLMRCTDC VCDIPLTRWN
     AEEVYSEDPD LPGKCYTRRG AFIENAEYFD NSFFGISPSE VKHMDPQQRL LLEVAYEAFA
     GAGLTKEKLL GANIGVFVGA CNHDWTYLIQ EDKISAYTGT GSAGSVIANR VSYCLGLRGP
     SVPVDTACSS SLVAADFAFE KLMLRDCTMA LVTGVNLMLT PHLFIAFSKA RMLSPDCKCK
     TFDASADGYV RGEGAGAVLM LPYADAKAKQ MTIQAVVKGT ACNHVGRSAS LTAPNGPSQA
     EVIRMALRRA GMKPNDLKFM ESHGTGTSLG DPIEVNAVRT VFAPGREKTK PLLIGAVKTN
     IGHLEGSSGI AGLIKACLVL SRREIPPNLH FKTLNPHINI EDFPVQFVTD FVSLDSVEST
     RKKNTAGVSS FGFGGVNAHV LLEEATRDQN ATRKEENRPK ICFMFTGQGS QYVGMGRHLY
     EKEPVFREAI TRCVEIAKEY LPLPLLSVIY PSPDERDKMQ EAINETLYSQ PAVFVISYAL
     LELWKSRGVE PDMVMGHSLG EYIAAVCSGV ISIQDGLKLV LKRAQIMSRA PSRNGVMVAC
     RATAEQVEEA IHTLFGDGEK NVALASVNGL KSVIVSGNQQ DVQAVLRQLG AASRAKMLRV
     SHAFHSPLVK DTVAPFKRVL ETVRLSKPRV MFVSTVTGKV ARDELQNTNY WAEHITKVVR
     FCDAVQACIA QGAKIMVEVG PRPTLSNMGK GCLPQTTADE VQWINSLEPE TLSSAVFDSL
     SALAKAGSGA MPAVFDRKFF PWTETCHPLL GGKNTRPDGS VVYSALLRDL VKALFLDHVV
     YGKALLPATC LIEMMASATT HMISNHDEEK LVALSQVVFE KPLLVPDETS LSVLVTVETR
     GQVRVASQVS NSEEEAEDHA VCQVNIVDAL PAAPALTEIR ARCKTEIEAS QLYAQLHNVG
     LQYGPRFQTI VKVFKGEDEV LGLLQAQNVT NFETGFMMHP AVLDGALQLS AVLAFEKGSS
     RAMVPFGVDT VLLKSCPPDK ELWARVVMNR KGVSSASATV HLYDFDGEPI AVLMGVTLRP
     IEFNTAADIP RELLWEVEWE SAGSYSMPQD GSLATPSGTD DSGAEDVTRT PSPLVTGENE
     NTPSREVPSA LPGLSTETPR ASKPLSTDET SGAVLNKSVH SVLLVDYAGG HREVVASIRD
     TCDVEVEVLS KPGSPELYAQ QLRQFAEVVA TKTETVPLST ILFLALDEAT PAEQLAGGLL
     ETVKVFAVEK FKKLPCIWVL TSGAQGPTRS VVSPKHAGLW GLMRTVRLEL EMQIGKMVRF
     GCVDLDKNLE VSVALHFILQ KRTEEPFEAE MILRCPSEDS EKEPGTAEVV NRRERSASFV
     SVGPEPVVPS TTDASAKSSV TSDRMALTPG ALSCFDETAN STEKLIFNAF CPRLHRSPIP
     VRGAVELHMP ERGAISNLVL RPQAFASRTA PQSHECEIRV RAIGLNFRDV LNVMGLYPGD
     PGPPGADCAG TVVAVGSGVR HLKVGDSVYG IAQGCLRTYV TTNGNLMRRM PESLTFEQSA
     ALPVVAATVE YALHDLAKVK EGDKVLIHAV TGGVGLAALQ FCKRVGATVY GTCSGGKKAA
     YAKSLGVKYL TSSRDASAFA EDMREFLGEE EKIDVVLNSL IDNFIPESVK LLAPGGRFME
     LGKRGIWTKA QMAAERPDVY YETIAVDTMM QEDTVWFGGM LDRIRDHVDS GNIPPLPLHV
     FEMTDPLAGG VAAFRFMQRA QHIGKVVIRI ASDLEVKIPS TTEPAATETA MLAEAGTQGN
     ASIGTSTQQI ADRTYLITGG MGGLGLVVAQ WLVEEGARHI VLLSRRGEPS DTVKQSSIWE
     YLTEKERTCV SVLPLKCDVS VKDDIVSAVK EIEARGFPPI RGIFHAAGVT ADASLANQTR
     EAIEQVYLPK VTGAWNLHEV CESMGLDKNL DMFVMFSSVA ALLGNFGQAN YSAANACLDA
     LAEYRRARGL CAQSIQWGPW IEQGMAADLR QALDKVGMRG ISNELGTRVL SEVVRNPRRA
     TTVLCQSVIW KKFLQRYERV PPLFKNTRAG QGATADSSLM LRTMTAEERR AYVEKTVREV
     VKQALGLADS PPMDLPMQEL GIDSIGAVEL RNALSSRLGV KMPATAMFDY PTLDAMVDYI
     NSTLAEQCAG TEQDVSAETQ MNLVPQIMHR LEGNIAVTSV AVHLPGSVHN AEEFWDMMHQ
     MRDCITEVPL TRWNMYTFFD PDPDTLDTSY AKLGGFIEDA DMFDNTCFNI SPAEVRVMDP
     QQRIILEVAY EAMVSAGYTS SGQASQQSIG CFIGCCNADW HMLDVPSGPF TGTGSSTSII
     SNRISYTLGL RGPSLTVDTA CSSSLVAMDN AMCKIIEGSC SAALVGGVNL MLSPHLFICF
     SKARMLSPDC RCRTFDHRAN GYARGEGAGA VFLKPLQAAR KDDSTILAVV RGSAINHDGR
     SASLTAPNGP AQQAVIRAAL QSGGVKPADI TVLESHGTGT QLGDPIEVGA IKAVFMHHRG
     ADNPIYIGAL KTNIGHLEGA AGIASFIKLV LCLRRRELPA NLHFQQLNPH IDMTDFSAVF
     PKATVDLPVT RKLFAGVSSF GFGGANAHVV IEEHFELSNA AEAPQKQTAP VKRERRSFPW
     YQAFHPMLGH ETKTEVDERK IDMEIRRDVF DLLAQHMVEG RPVMPAACYL ETLTSAACTS
     EFLHPCSRKM LRPNHHAAVT LEDIEFQRPM ILEAPEGHGH QIFQILTASL GPNGNASISS
     LKTGDEEQFI HATCRLSSVN DTTAFTRTPE ENLKFLDEDT NRVPVDVGEM YEKLWKNGLM
     LGPQFRTIHS IECSPRTAKV KLQLPQTPTS CELGFRMHPC VLDGTFQTVG PLVLAHDEQA
     RKDGSSSNVG TMKLMIPFMI QRITLTSMHG VSNALWAHVA LVKRESNQAI VNIDIMSMTG
     EPVCCLKELT MRRLDPVPVA EIPRELTWTV DWEEGQNVDP VPANAVEVVA IDPLGMQHTV
     IKELKQKGFA SVQVFERDSS VEEVLRVLRI SSSGDSPTSL TPASAAAGEA AKSTSLPHRV
     VLFLGTGGAN DAIEALDCLL QLSSGLEKAA KEDRQQEFYP IWILTKGSQS HRDKGRVSNP
     VHAGLWGFAR TARLEISAQT SKEASIGCVD LGPECRVADV VQHLAAIQSQ EKYEREILFD
     EVEEVEDQKR EKGKPPSGSK RLKHFVSRLA RSPLNVRGAL ELHMSDRGSI TNLTLRPQSY
     AARVTPKGDM VELRVRAVGL NFRDVLNVMN LYPGDPGPPG SDCAGTVVAV GDRVKHLKVG
     DSVYGVAPGC LKTYVTTNSQ IIRKMPASMS FEEAAALPVV ATTVEYVLND LAKVRAGEKV
     LIHAVSGGVG IAAVQFCKRV GAVVYGTCSS EKKREFVESL GVKYVASSRD PEKFTAEMKT
     FLGEEGRVDV VLNSLIDKFI PASLALLGKD GRFIELGKRG IWTHEQMRAQ RPDVYYEAVA
     IDVMISENPR WFGIMLDRVR TLVERKQLLS LPLTVFNMND PNEGGVAAFR FMQKAQHVGK
     VVVAIPSALD FERSTVGPPT TPQKTYVITG GMGGLGLVVA QWLVEEGARH IVLLSRRGEP
     SDTVKQSSIW EYLTEKERTC VSVLPLKCDV SVKDDIVSAV KEIEARGFPP IRGIFHAAGV
     TADASLANQT REAIEQVYLP KVTGAWNLHE VCESMGLDKN LDMFVMFSSV AALLGNFGQA
     NYSAANACLD ALAEYRRARG LCAQSIQWGP WIEQGMAADL KALAEKAGLR GISNELGIRV
     LVEAVRNSKT VVTFLAQSFI WKRFVQRFDV VPPFFSNIEF ETASPGAFRV LYSSMTPDEL
     HTYVSNVVLD TARQVLGTSE LPALDSPLQE LGIDSLGAVE LRNSLSQRLG VKLSATTLFD
     YPTIRAIIDY IVNQVSGEAS AAKATSGALV SAGLAAGRNS PIAVVGYSCR LPQGSETPDK
     LWEMLRRAQD CVVEVPLTRF DVDMFYDSDI DAKGKMYVRK ACFMDDADMF DNSFFNISAA
     EVTYMDPQQR VMLEVAYDAF YSAAYSREHL IGKNYGVWIG CCNSDWHFLE QQSNPDKSSS
     YSGPGGSGCL VSNRLSYVFG LKGPSMTIDT ACSSSLVAAD CGAQAMRFGY CDGALIGGVN
     LMLSPQLFLA FCKARMLSPD CRCATFDEKA NGYVRGEGAG AIMLRHLADA QREGKRILGI
     VRGTAVNHDG RSASLTAPNG PAQQDVIRSA LQIGGVDPLD VALVESHGTG TALGDPIEMG
     AIKAVYGAGR SADSPLVVGA LKSYIGHLEG SSGIAGILKV LLCLRHHEVP PNLHFDTPNP
     HMDLSDFHVV LPKKQMKLQP AQKSSTILGS VSSFGFGGAN AHVVFEEYPE PTMATDVGGE
     RRSTEAAESA GKKRLAFLFT GQGSQYPNMC KQLYEEEPVF RSHMDACFTV LEGKLDVKLR
     DVIYPTAEND VEMLNLLNQT GYSQPAIFCV EYALAQLMIS KGFSPSVVMG HSLGEYAAAA
     IAGVMDWKDA LLLVHERASI MQHIDPNDGV MYACRASEQD AQAAVEKALG EKAVAVTVSA
     INGPRSIVLA GTQPDVLAVL KAMDMQSRAK QLTVSHAFHC PLVGEAAALL LPKVESVKLS
     NPNEGITFIS TVTGKAVGAG ELTKPEYWAT HITKPVQFLQ GMRAAVTSGE AGIFIEIGPR
     PTLVNMGQQC VPRNDYTWVA PVDPKDSNRN SFPQALEKIT SNMVCTYTWK RHAFPWTVVV
     HPFVDKFVPD QDETKATSQK LLTPAVGELL HDHVVNGTAM MPGVGFLEAM AAGGFMMVDG
     KLPSTVVALR DCEFERPMLI PKIDPSTGEF TQAAKLMITV NNKDITLKSI MDGEEEEIFH
     SRCTFSLCPA TSLPTEPADL LTSLQGRITT PVPITTLYET LQSVGLQYGA RFQSIRECYA
     AASDEVLAKL RPKLPLHSFE RNFRIHPVLL DGALQTAAVL FAEMGHKRPL VPVGVKRALL
     ARVPAGSEVW SHVVVKSKDL RSATMDVTLF STKGTILGQL VDVAVRAFDA VAGAQIPKGL
     LWEVSWVPKA KAPVEDGVDT VENEASETIT PTAKLAVKPT GKVEGLARQP AVNGQPKWLL
     LKTPSSMLEG LQKALQGTPS AFVKELDEKH PNNAISSKIH NSEWDAIIYL GALSAAASST
     TCVADALLLA NTLGNTAAEK PLPNVLFVTQ GQHFIRECGD VPTTAVPTQT GLYGFCRSAQ
     LELENIIGRP VYLGTLDFAA KDPTLDDPAA FVSALQRVLA EADPTGAAGT CEPHTVIRGG
     KEMVPRLVKS SIECRGAVEL HMSDRGALSM LKLRPMPKSA RVPPPRDCVE IRVRAVGLNF
     RDVLNVMGLY PGDPGPPGAD CAGTVVAVGE GVEHLRVGDA VFGIAQGALK TFVTTSAHLV
     RPLPGRLTFE QAAALPVVAS TVEYALHDVA KVKKGDKVLI HAVTGGVGLA AVQFCKRVGA
     EIYGTCSGGK KEEFARSVGV KYITSSRDPK KFAEDMCEFL GNGKHKIDVV LNCLIENYIP
     ETLKFLAPNG RFVELGKRGI WTREQMGTVR PDVLYETVAI DTMMEEDPVW FGGMLDRIRN
     LVDGSMLQPL PLHTFEMTDS SEGGVAAFRF LQRAQHIGKV VIRLSSALEV KNFTTNSRQE
     TENEDSGERH TLSPATAVGK PLSQSVADQM HKTYIITGGT GGLGLVVAQW LIEEGAHNIV
     LLSRRGEPPS SVKAGDPVWK YLCGEEQPNC ISVATLKCDV SQKDDLLRVF KEIESRGLPP
     VAGIFHAAGV TADAALASQT ADSVDQVYLP KAIGAWNLHD ACEALGLNKS LEVFMMFSSV
     AALLGNFGQA NYSAANACLD ALAMYRQSLG LCGQSIQWGP WIEQGMAAQL TQHLEKVGMR
     GITNELGLRV LGDVMMHPTV AVVGAQSLQW RKFLRRYAFE MPSFFSLVPM AGAGSSEASV
     NLATITKEDL CELISSLAQA VSGAAEKPAV DTPLMDLGLD SLGAVEFRNS VADKVGVKLP
     QNLMFENPSI SSISDYILDK AAGKHGESGV TGANGGSGEG SDIPALDTPL DQWLMSVLEP
     AERFALYIDS FASEYGTLQK MAAEEDIITA LEDLGVEDGG DFERLHVAWD DLCNQINTAK
     RMAAKDIRPS STAVSVSDHP KPRMPHPMED VDTLLASLDF DVSTLRPATA PDKVKNVLLT
     GVTGFVGRVQ LASLMQLKQR PDLRVYCLVR ARNADHALSR IREATKEAKC WQETFVSRIV
     PVTGDFTQPL LGLSSEEFQE LARKIDIVYH TGGDVNLLSN YGRLRATNVL SVKAIIDLCT
     TYKVKPLHFA STLGQFPAFF AMFTGHFSGE SIREDSTPDV SQMENFYPPS RQGYPWSKWA
     AERVLEGARK RGLPVAIYRL PNTYVAYKTG YTNKTDYATA LMIASIQEGV FPIGAATAPL
     TPVDTICDML VEASFLEKPK HWVYHLFDPR LVTRADLEQW SQELGISYVG VKVDEFLEAI
     KKRGPESPVF KFVPLMQHWR RFWFDPEERT ESFPISNEHI FEDLPHMRWP SLREVFKNSL
     LYSVRMGFFP KNSKSIILDP QVALNEARRI CGLQLMGREG EDFFLHPWRV LQESARRECD
     LLFGGQLAVY RTVRHYFMNV MYLADAQAKC PEIEKQEIQA PLIIVGLNRT GTTFLQNLMS
     TDPSNRSLRY CEMIAPYGPD GNYRPKGLPN TEESWKKDPR IPFAQEILDS QLGLSEEWMA
     IHTQRAELPE EDFVIFEHCG RCYSICTEFS VPSYREWLAS DNYKETRKAY SFHKRFLQHL
     QWQRPAKRWL LKMPFHLFTL EALFETYPDA KIIFMHRDPK ETMGSWASLV KHAQQSLMDN
     VDPSALGREE LRAMSTMINR ALAFRRSHPE LGPRFLDVQY QDLVNNPIGV VKSIYKHFGI
     PLTLAAQTGI ASFCNENRKH RDKLTKHMYS LKDVDLTDGM VQEAFKEYYD SGLCKYLSSA
     R
//
DBGET integrated database retrieval system