ID A0A286Y679_CAVPO Unreviewed; 4925 AA.
AC A0A286Y679;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=[histone H3]-lysine(4) N-methyltransferase {ECO:0000256|ARBA:ARBA00023620};
DE EC=2.1.1.364 {ECO:0000256|ARBA:ARBA00023620};
GN Name=KMT2C {ECO:0000313|Ensembl:ENSCPOP00000033098.1};
OS Cavia porcellus (Guinea pig).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Hystricomorpha; Caviidae;
OC Cavia.
OX NCBI_TaxID=10141 {ECO:0000313|Ensembl:ENSCPOP00000033098.1, ECO:0000313|Proteomes:UP000005447};
RN [1] {ECO:0000313|Proteomes:UP000005447}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2N {ECO:0000313|Proteomes:UP000005447};
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSCPOP00000033098.1}
RP IDENTIFICATION.
RC STRAIN=2N {ECO:0000313|Ensembl:ENSCPOP00000033098.1};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=L-lysyl(4)-[histone H3] + S-adenosyl-L-methionine = H(+) +
CC N(6)-methyl-L-lysyl(4)-[histone H3] + S-adenosyl-L-homocysteine;
CC Xref=Rhea:RHEA:60264, Rhea:RHEA-COMP:15543, Rhea:RHEA-COMP:15547,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC ChEBI:CHEBI:59789, ChEBI:CHEBI:61929; EC=2.1.1.364;
CC Evidence={ECO:0000256|ARBA:ARBA00024515};
CC PhysiologicalDirection=left-to-right; Xref=Rhea:RHEA:60265;
CC Evidence={ECO:0000256|ARBA:ARBA00024515};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAKN02032938; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02032939; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02032940; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02032941; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AAKN02032942; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR STRING; 10141.ENSCPOP00000033098; -.
DR Ensembl; ENSCPOT00000045682.1; ENSCPOP00000033098.1; ENSCPOG00000001857.4.
DR VEuPathDB; HostDB:ENSCPOG00000001857; -.
DR GeneTree; ENSGT00940000155281; -.
DR InParanoid; A0A286Y679; -.
DR OMA; MYCKHLE; -.
DR Proteomes; UP000005447; Unassembled WGS sequence.
DR Bgee; ENSCPOG00000001857; Expressed in cerebellum and 12 other cell types or tissues.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0044666; C:MLL3/4 complex; IEA:Ensembl.
DR GO; GO:0140999; F:histone H3K4 trimethyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15696; ePHD1_KMT2C; 1.
DR CDD; cd15697; ePHD2_KMT2C; 1.
DR CDD; cd22026; HMG-box_KMT2C; 1.
DR CDD; cd15509; PHD1_KMT2C_like; 1.
DR CDD; cd15594; PHD2_KMT2C; 1.
DR CDD; cd15511; PHD3_KMT2C; 1.
DR CDD; cd15513; PHD5_KMT2C_like; 1.
DR CDD; cd15600; PHD6_KMT2C; 1.
DR CDD; cd19171; SET_KMT2C_2D; 1.
DR Gene3D; 3.30.160.360; -; 1.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 7.
DR InterPro; IPR034732; EPHD.
DR InterPro; IPR003889; FYrich_C.
DR InterPro; IPR003888; FYrich_N.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR041967; KMT2C_ePHD1.
DR InterPro; IPR041968; KMT2C_ePHD2.
DR InterPro; IPR047004; KMT2C_PHD2.
DR InterPro; IPR047005; KMT2C_PHD6.
DR InterPro; IPR037877; PHD3_KMT2C.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR001841; Znf_RING.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR45888:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE 2C; 1.
DR PANTHER; PTHR45888; HL01030P-RELATED; 1.
DR Pfam; PF05965; FYRC; 1.
DR Pfam; PF05964; FYRN; 1.
DR Pfam; PF00628; PHD; 4.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF13771; zf-HC5HC2H; 1.
DR Pfam; PF13832; zf-HC5HC2H_2; 1.
DR SMART; SM00542; FYRC; 1.
DR SMART; SM00541; FYRN; 1.
DR SMART; SM00249; PHD; 8.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 6.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50216; DHHC; 1.
DR PROSITE; PS51805; EPHD; 2.
DR PROSITE; PS51543; FYRC; 1.
DR PROSITE; PS51542; FYRN; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS50016; ZF_PHD_2; 6.
DR PROSITE; PS50089; ZF_RING_2; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methylation {ECO:0000256|ARBA:ARBA00022481};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000005447};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00175}.
FT DOMAIN 159..263
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS51805"
FT DOMAIN 273..323
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 276..321
FT /note="RING-type"
FT /evidence="ECO:0000259|PROSITE:PS50089"
FT DOMAIN 320..370
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 396..452
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 884..937
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 934..984
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 1011..1066
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 4413..4521
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS51805"
FT DOMAIN 4785..4901
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 4909..4925
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 20..44
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 97..128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 755..787
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 812..836
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1144..1251
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1406..1425
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1545..1571
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1649..1709
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1725..2367
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2539..2588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2671..2690
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2778..2827
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2878..2911
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3161..3187
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3223..3376
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3483..3531
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3566..3723
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 3738..3841
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4035..4080
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4156..4180
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 97..120
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 769..785
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1152..1199
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1410..1425
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1546..1571
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1649..1670
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1671..1698
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1725..1793
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1813..1845
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1884..1916
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2006..2020
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2062..2082
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2091..2112
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2113..2132
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2133..2155
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2157..2176
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2255..2275
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2282..2330
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2340..2354
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2569..2588
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2779..2804
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2878..2894
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3223..3242
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3248..3270
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3271..3289
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3329..3345
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3358..3376
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3483..3515
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3588..3611
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3612..3626
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3635..3658
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3701..3718
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3740..3755
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3756..3772
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3799..3834
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4925 AA; 543200 MW; 843B0F19AB5D77A9 CRC64;
MDGLETTETE NIVETVEIKE HSAEEDAEAE VDSSKQPVPT LQQPASEESV NFLVSVGVEA
KISEQLCAFC YCGEKSSLGQ GDLKQFRVTP GFILPWQSQP SNKDIDDSSS GTYERVQNST
PRRQRGQRKE RALQQNMVSC MSVSTQTVAD DQAGKLWDEL SLVGLPDAID VQALFDSAGT
CWAHHRCVEW SLGVCQVEEP LLVNVDKAVV SGSTERCAFC KHLGATIKCC EEKCTQMYHY
PCAAGVGTFQ DFSHFFLLCP EHIDQAPERS KEDANCAVCD SPGDLLDQFF CTTCGQHYHG
MCLDIAVTPL KRAGWQCPEC KVCQNCKQSG EDSKMLVCDT CDKGYHTFCL QPVMKSVPTN
GWKCKNCRIC VECGTRSSSQ WHHSCLVCDA CYQQQDNLCP FCGKCYHPEL QKDMLHCNIC
KRWVHLECDK PTDHELDSQL KEEYICMYCK HLGAEMDPLQ PGDEMEMAEP ATDSNNEMEV
GGTEEQMVFL EQGVNKDVSD QESVPGIDPD VQVIHTEEQQ KNKLPEGIDK DCLLLSETSP
NKVNSELENE LSCGVVCEKM DLPSKRTHLH DKDEKEDKME VAVNIDAPTY QIVVQQELQL
LPDPQGVICK EAAQPLAAGV ESVTVPPVAL VSPSEESTSC SKEQLITERL HEEIEQKENS
EFSTEFMDFE MTSAVESCVK DGLCLGDKSL KLPTEVESSF SPDIGKTNVS SSPTLRLDLP
SHNMLHSYSS TLNSSGNIMP TTYISVTPKI GMGKPAITKR KFSPGRPRSK QGAWSTHNTV
SPPPWSPDIS EGREIFKPRQ LPGSAIWSIK VGRGSGFPGK RRPRGAGLSG RGGRGRSKLK
NGIGAVVLPG VSAADISSNK DEEENSMHNT VVLFSSSDKF TLHQDMCVVC GSFGQGAEGR
LLACSQCGQC YHPYCVSIKI TKVVLSKGWR CLECTVCEAC GKATDPGRLL LCDDCDISYH
TYCLDPPLQT VPKGGWKCKW CVWCRHCGAT SAGLRCEWQN NYTQCAPCAS LSSCPVCYRN
YREDDLILQC RQCDRWMHAV CQNLNTEEEV ENVADIGFDC SMCRPYIPAS NVPSSECCES
SLVAQIITKV KELDPPKTYT QDGVCLTESG MTQLQSLTVT VPRRKRSKPK LKLKIINQNS
VAVLQTPPDI QSEHSRDGEM DDSREAELMD CDGKSESSPE REAVDEETKG VEGTDGVKKR
KRKPYRPGIG GFMVRQRSRT GQGKTKRCVI RKDSSGSISE QLPSRDDGWG EQLPDTLIDE
SVSVAENTEK IKKRYRKRKN KLEETFPAYL QEAFFGKDLL DTSRQNKLNL DNVSEDAAQL
MYKTNTNTGF LEPSFDPLLN SSTPAKPGTQ GTADDPLADI SEVLNTDDDI LGIISDDLAK
SVSHSGLDLC TFQVESSPFP FDIGSIADDP SSLPQPSVSQ TSRPLTEEQL DGILSPELDK
MVTDGAILGK LYKIPELGGK DVEDLFTAVL SPVNTQPAPL PQPPPPPQLL PMHNPDVFSR
MPLMNGLIGP SPHLPHNSLP PGSGLGTFSA IQSHYTDARD KNAAFNPITN DPNSSWTPSA
PSVEGENDTM SNAQRSTLKW EKEEALGEMA TVAPVLYTNI NFPNLKVEFP DWTTRVKQIA
KLWRKASSQE RAPYVQKARD NRAALRINKV QMSNDSMKRQ QQQDSTDSSS RIDSDLFKDP
LKQRESEHEQ EWKFRQQMRQ KSKQQAKIEA TQKLEQVKNE QQQQQQQQQQ QQQQQQQQQH
GSQHLLMPSG SDTPSSGVQS PLTPQPSNGN MSPAQTFHKD LFSKQQPSAP ASASSDAVFV
KPQAPPPPST TSRVPVQDGL SQSQMSQPHS PQMFSPGSSN SRPPSPIDPY AKMVGTPRPP
PGGHSFCRRN SVAPVENCVP LSSVPRPIQM SETPANRPSP SRDLCSSSTT NNDPYAKPPD
TPRPMVTDQF PKPLGLPRSP MVSEQTAKGP PAAGTGDHFT KPSPRGDTFQ RQRIPDPYAR
PLLASAPADS SSGPFKTPLH PPPSSQDPYG SMSQASRRLS VDTYERPPLT PRPVDNFPHS
QSNDPYSQPP HIPHPAMNES FAHSSRAFSQ PGTISRSAPQ DPYSQPPGTP RPVVDSYSQP
SGTTRSNLDP YSQPPGTPRP TTVDPYSQQP PTPRPSIQTD MFVTSAANQR HSDPYSHPPG
TPRPGISTPY SQPPATPRPR TSEGFTRPSG ARPALVPNRD PFLQAAQHRA PALPSSLVRP
PDMCSQTPRP PGSGLPDTFS RVSPSAARDP YDQPPMTPRS QSDSFGSSQV TRDITDQAGA
GPEEGFNTSA NSAVSSQGPQ FSSIAQAPGP VSTSGGTDTQ NTVNMSQADT EKLRQRQKLR
EIILQQQQQK KIASRQEKGP QDTPAVPHPV PLPNWQPESI SQAFSRPPPP YPGNIRSPMV
PLPGPRYAVF PKDQRGPYPP DVAGMGMRPH GFRFGFSGSS HGTMSGQDRF LVPPQQIQGS
GIPPHLRRSV SVEMPRPLNN SQMNNPVGLP QHFPPQSLPV QQHNILGQAF IELRHRAPDG
RPQHPFPTTP GNVIEAPSHP RHGNFIPRPD FPGLKHTEPM RRPHQGLPSQ LSMHPNLEQV
PSQQEQGHPL HASSVVMRHM SHSLGTGFSE APLSTSVSAE AAPDNLQITS QASDSLEEKL
DSDDPSVKDL DVKDLEGVEV KDLDDEDLEN LNLDTEDGKG DELDTLDNLE TNDPNLDDLL
RSGEFDIIAY TDPELDLGDK KSMFNEELDL NVPIDDKLDN QCVSVEPPQK EQEDKIVVLS
ENHLPQKKSN IVSEIKTEVL SPDSKEEAKC ETEKNEKSDE IGDHVEPPCT QASAHTDLSD
GEKACLPPCE PELLEKRANR ETDDSSASII QGSTPLAAQD IVNSCDITGS TPVLSSLLAS
EKSDSSDNRS LGSPPPTVPA SPSNHVSSLP SALMAPPGPV LDNAMNSNVT VVSRVNHTFS
QGVQVNPGFI QGQSTVNHSL GKGKPTNQCA PVTSQSGSSA ISGSQQLMMP QTLGHQNRER
PLLLEEQPLL LQDLLDQERQ EQQQQRQMQA MIRQRSEPFF PNIDFDAITD PIMKAKMVAL
KGINKVMAQN NLGMPPMVMS RFPFMGPATA GAQNSEGQTP MPQTVTQDGS ITHQISRPNP
PNFGPGFVND SQRKQYEEWL QETQQLLQMQ QKYLEEQIGA HRKSKKALSA KQRTAKKAGR
EFPEEDAEQL KHVTEQQSMV QKQLEQIRKQ QKEHAELIED YRIKQQQQQQ QQQQQQQQQR
AMAPPLLMPG IQPQPPRLPG ATPPAINQPS FPMVPQQLQH QQHTAALSGH NSPARMPGLP
GWQSASAPAH LPLNPPRIQP PMAQLPVKAC TPTTGTVSNA NPQSGPPPRV EFDDNNPFSE
SFQERERKER LREQQERQRI QLMHEVDRQR ALQQRMEMEQ HGLIGSEVGN RTPVSQIPFY
GSDRPCDFMQ PPRPLQQSPQ HQQQMALVLQ QQNVQQGSVN SSPTQTFMQT NDRRQGALTS
FIPDSSSVPS GSPNFHSAKQ GHGNLSGTSF QQSPLRPPFA PALPTTSPVA NSSLPCGQDL
AVAHGQSYGS SQSLIQLYSD IIPEEKGKKK RTRKKKKEDD VESTKAPLTP HSDITAPLTP
SISETTSTPT VNTPNDPPPQ SEPEPAEPAG QPGTDLENTL PSADLSLETP NQEACANSEA
KTLPMEMPAK NEELKLENTE TESGLGQEET ESEEQTGCKA QDKSEASPVS SVQSPLDSVG
APVTKADLGN ELLKHLLKNK KSSSLLNQRS EGSFCSEDNC TKDSKTAEKQ NPEERMQTLG
VHVQGGFGCG NNQPKMDGGS ETKKQRNKRT QRTGEKAAPR SKKRKKDEEE KQPMYPNSDS
FTHLKQQLSL LPLMEPIIGV NFAHFLPYGS GQFNSGNRLL GTFGSATLEG VSDYYSQLIY
KQNNLSNPPT PPASLPPTPP PMACQKMANG FATTEELAGK AGVLVSHEVT KTLGPKPFPL
PFRPQDDLLA RAVAQGPKTV DVPASLPTPP HNNHEEIRIQ DHCGDRDTPD SFVPSSSPES
VVGVEVSRYP DLSVVKEEPP EPVPSPIIPI LPSTTGKSSE SRRNDIKTEP GTLFFTSPFG
SSSNGPRSGL ISVAITLHPT AAENINSVVA AFSDLLHVRI PNSYEVSSAP DVPPMGSTMG
LVSSPRVNPG LEYRPHLHLR GPPPGSANPP RLATSYRLKQ PNVPFPPASS GLPGYKDPSH
GAAGKAALRP QWCCHCKVVI FGSGVRKSFK DLAFMNKDSR ESNKRMEKDI VFCSNNCFIL
YSTAQSKNPE SKESLPPLPQ SPMREMPSRA FHQYSNNISA LDVHCLPQLQ EKASPPPSPP
IAFPPALEAA QVEAKPDELK VTVKLKPRLR TVHSGLDDCR PLTKKWRGMK WKKWSIHIVI
PKGTFKPPCE DEIDEFLKKL GTSLKPDPVP RDYRKCCFCH EEGDGLTDGP ARLLNLDLDL
WVHLNCALWS TEVYETQAGA LINVELALRR GLQMKCVFCH KTGATSGCHR FRCTNIYHFT
CAIKAQCMFF KDKTMLCPMH KPKGIHEQEL SYFAVFRRVY VQRDEVRQIA SIVQRGERDH
TFRVGSLIFH TIGQLLPQQM QAFHSTKALF PVGYEASRLY WSTRYANRRC RYLCSIEEKD
GRPVFVIRIV EQGHEDLVLS DTSPKGVWDK ILEPVACVRK KSEMLQLFPA YLKGEDLFGL
TVSAVARIAE SLPGVEACEN YTFRYGRNPL MELPLAVNPT GCARSEPKMS AHVKRFVLRP
HTLNSTSTSK SFQSTVTGEL NAPYSKQFVH SKSSQYRRMK TEWKSNVYLA RSRIQGLGLY
AARDIEKHTM VIEYIGTIIR NEVANRKEKL YESQNRGVYM FRMDNDHVID ATLTGGPARY
INHSCAPNCV AEVVTFERGH KIIISSNRRI QTGEELCYDY KFDFEDDQHK IPCHCGAVNC
RKWMN
//