ID A0A0B4KFP4_DROME Unreviewed; 1898 AA.
AC A0A0B4KFP4;
DT 01-APR-2015, integrated into UniProtKB/TrEMBL.
DT 01-APR-2015, sequence version 1.
DT 24-JAN-2024, entry version 62.
DE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-79 specific {ECO:0000256|ARBA:ARBA00020987, ECO:0000256|RuleBase:RU271113};
DE EC=2.1.1.360 {ECO:0000256|ARBA:ARBA00012190, ECO:0000256|RuleBase:RU271113};
DE AltName: Full=Histone H3-K79 methyltransferase {ECO:0000256|ARBA:ARBA00029821, ECO:0000256|RuleBase:RU271113};
GN Name=gpp {ECO:0000313|EMBL:AGB95699.1,
GN ECO:0000313|FlyBase:FBgn0264495};
GN Synonyms=0732/14 {ECO:0000313|EMBL:AGB95699.1}, anon-WO0118547.319
GN {ECO:0000313|EMBL:AGB95699.1}, BcDNA:SD05230
GN {ECO:0000313|EMBL:AGB95699.1}, CG10272 {ECO:0000313|EMBL:AGB95699.1},
GN CG33324 {ECO:0000313|EMBL:AGB95699.1}, dDot1
GN {ECO:0000313|EMBL:AGB95699.1}, dDOT1L {ECO:0000313|EMBL:AGB95699.1},
GN Dmel\CG42803 {ECO:0000313|EMBL:AGB95699.1}, DOT1L
GN {ECO:0000313|EMBL:AGB95699.1}, Gpp {ECO:0000313|EMBL:AGB95699.1}, grp
GN {ECO:0000313|EMBL:AGB95699.1}, l(3)03342
GN {ECO:0000313|EMBL:AGB95699.1}, l(3)S073214b
GN {ECO:0000313|EMBL:AGB95699.1}, l(3)sA3484
GN {ECO:0000313|EMBL:AGB95699.1};
GN ORFNames=CG42803 {ECO:0000313|EMBL:AGB95699.1,
GN ECO:0000313|FlyBase:FBgn0264495}, Dmel_CG42803
GN {ECO:0000313|EMBL:AGB95699.1};
OS Drosophila melanogaster (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila; Sophophora.
OX NCBI_TaxID=7227 {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803};
RN [1] {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=10731132; DOI=10.1126/science.287.5461.2185;
RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D.,
RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F.,
RA George R.A., Lewis S.E., Richards S., Ashburner M., Henderson S.N.,
RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., Brandon R.C.,
RA Rogers Y.H., Blazej R.G., Champe M., Pfeiffer B.D., Wan K.H., Doyle C.,
RA Baxter E.G., Helt G., Nelson C.R., Gabor G.L., Abril J.F., Agbayani A.,
RA An H.J., Andrews-Pfannkoch C., Baldwin D., Ballew R.M., Basu A.,
RA Baxendale J., Bayraktaroglu L., Beasley E.M., Beeson K.Y., Benos P.V.,
RA Berman B.P., Bhandari D., Bolshakov S., Borkova D., Botchan M.R., Bouck J.,
RA Brokstein P., Brottier P., Burtis K.C., Busam D.A., Butler H., Cadieu E.,
RA Center A., Chandra I., Cherry J.M., Cawley S., Dahlke C., Davenport L.B.,
RA Davies P., de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I.,
RA Dietz S.M., Dodson K., Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C.,
RA Dunn P., Durbin K.J., Evangelista C.C., Ferraz C., Ferriera S.,
RA Fleischmann W., Fosler C., Gabrielian A.E., Garg N.S., Gelbart W.M.,
RA Glasser K., Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M.,
RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., Hostin D.,
RA Houston K.A., Howland T.J., Wei M.H., Ibegwam C., Jalali M., Kalush F.,
RA Karpen G.H., Ke Z., Kennison J.A., Ketchum K.A., Kimmel B.E., Kodira C.D.,
RA Kraft C., Kravitz S., Kulp D., Lai Z., Lasko P., Lei Y., Levitsky A.A.,
RA Li J., Li Z., Liang Y., Lin X., Liu X., Mattei B., McIntosh T.C.,
RA McLeod M.P., McPherson D., Merkulov G., Milshina N.V., Mobarry C.,
RA Morris J., Moshrefi A., Mount S.M., Moy M., Murphy B., Murphy L.,
RA Muzny D.M., Nelson D.L., Nelson D.R., Nelson K.A., Nixon K., Nusskern D.R.,
RA Pacleb J.M., Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V.,
RA Reese M.G., Reinert K., Remington K., Saunders R.D., Scheeler F., Shen H.,
RA Shue B.C., Siden-Kiamos I., Simpson M., Skupski M.P., Smith T., Spier E.,
RA Spradling A.C., Stapleton M., Strong R., Sun E., Svirskas R., Tector C.,
RA Turner R., Venter E., Wang A.H., Wang X., Wang Z.Y., Wassarman D.A.,
RA Weinstock G.M., Weissenbach J., Williams S.M., WoodageT, Worley K.C.,
RA Wu D., Yang S., Yao Q.A., Ye J., Yeh R.F., Zaveri J.S., Zhan M., Zhang G.,
RA Zhao Q., Zheng L., Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S.,
RA Zhu X., Smith H.O., Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.;
RT "The genome sequence of Drosophila melanogaster.";
RL Science 287:2185-2195(2000).
RN [2] {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537568;
RA Celniker S.E., Wheeler D.A., Kronmiller B., Carlson J.W., Halpern A.,
RA Patel S., Adams M., Champe M., Dugan S.P., Frise E., Hodgson A.,
RA George R.A., Hoskins R.A., Laverty T., Muzny D.M., Nelson C.R.,
RA Pacleb J.M., Park S., Pfeiffer B.D., Richards S., Sodergren E.J.,
RA Svirskas R., Tabor P.E., Wan K., Stapleton M., Sutton G.G., Venter C.,
RA Weinstock G., Scherer S.E., Myers E.W., Gibbs R.A., Rubin G.M.;
RT "Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster
RT euchromatic genome sequence.";
RL Genome Biol. 3:RESEARCH0079-RESEARCH0079(2002).
RN [3] {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803}
RP GENOME REANNOTATION.
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537572; DOI=10.1186/gb-2002-3-12-research0083;
RA Misra S., Crosby M.A., Mungall C.J., Matthews B.B., Campbell K.S.,
RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E.,
RA Smith C.D., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P.,
RA Bettencourt B.R., Celniker S.E., de Grey A.D.N.J., Drysdale R.A.,
RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., Stapleton M.,
RA Yamada C., Ashburner M., Gelbart W.M., Rubin G.M., Lewis S.E.;
RT "Annotation of the Drosophila melanogaster euchromatic genome: a systematic
RT review.";
RL Genome Biol. 3:RESEARCH0083.1-RESEARCH0083.22(2002).
RN [4] {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537573;
RA Kaminker J.S., Bergman C.M., Kronmiller B., Carlson J., Svirskas R.,
RA Patel S., Frise E., Wheeler D.A., Lewis S.E., Rubin G.M., Ashburner M.,
RA Celniker S.E.;
RT "The transposable elements of the Drosophila melanogaster euchromatin: a
RT genomics perspective.";
RL Genome Biol. 3:RESEARCH0084.1-RESEARCH0084.20(2002).
RN [5] {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=12537574;
RA Hoskins R.A., Smith C.D., Carlson J.W., Carvalho A.B., Halpern A.,
RA Kaminker J.S., Kennedy C., Mungall C.J., Sullivan B.A., Sutton G.G.,
RA Yasuhara J.C., Wakimoto B.T., Myers E.W., Celniker S.E., Rubin G.M.,
RA Karpen G.H.;
RT "Heterochromatic sequences in a Drosophila whole-genome shotgun assembly.";
RL Genome Biol. 3:RESEARCH0085-RESEARCH0085(2002).
RN [6] {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=16110336; DOI=10.1371/journal.pcbi.0010022;
RA Quesneville H., Bergman C.M., Andrieu O., Autard D., Nouaud D.,
RA Ashburner M., Anxolabehere D.;
RT "Combined evidence annotation of transposable elements in genome
RT sequences.";
RL PLoS Comput. Biol. 1:166-175(2005).
RN [7] {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569856; DOI=10.1126/science.1139815;
RA Smith C.D., Shu S., Mungall C.J., Karpen G.H.;
RT "The Release 5.1 annotation of Drosophila melanogaster heterochromatin.";
RL Science 316:1586-1591(2007).
RN [8] {ECO:0000313|EMBL:AGB95699.1, ECO:0000313|Proteomes:UP000000803}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Berkeley {ECO:0000313|Proteomes:UP000000803};
RX PubMed=17569867; DOI=10.1126/science.1139816;
RA Hoskins R.A., Carlson J.W., Kennedy C., Acevedo D., Evans-Holm M.,
RA Frise E., Wan K.H., Park S., Mendez-Lago M., Rossi F., Villasante A.,
RA Dimitri P., Karpen G.H., Celniker S.E.;
RT "Sequence finishing and mapping of Drosophila melanogaster
RT heterochromatin.";
RL Science 316:1625-1628(2007).
CC -!- FUNCTION: Histone methyltransferase that specifically trimethylates
CC histone H3 to form H3K79me3. This methylation is required for telomere
CC silencing and for the pachytene checkpoint during the meiotic cell
CC cycle by allowing the recruitment of RAD9 to double strand breaks.
CC Nucleosomes are preferred as substrate compared to free histone.
CC {ECO:0000256|RuleBase:RU271113}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=L-lysyl(79)-[histone H3] + 3 S-adenosyl-L-methionine = 3 H(+)
CC + N(6),N(6),N(6)-trimethyl-L-lysyl(79)-[histone H3] + 3 S-adenosyl-L-
CC homocysteine; Xref=Rhea:RHEA:60328, Rhea:RHEA-COMP:15549, Rhea:RHEA-
CC COMP:15552, ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC ChEBI:CHEBI:59789, ChEBI:CHEBI:61961; EC=2.1.1.360;
CC Evidence={ECO:0000256|ARBA:ARBA00001569,
CC ECO:0000256|RuleBase:RU271113};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU271113}.
CC -!- MISCELLANEOUS: In contrast to other lysine histone methyltransferases,
CC it does not contain a SET domain, suggesting the existence of another
CC mechanism for methylation of lysine residues of histones.
CC {ECO:0000256|RuleBase:RU271113}.
CC -!- SIMILARITY: Belongs to the class I-like SAM-binding methyltransferase
CC superfamily. DOT1 family. {ECO:0000256|RuleBase:RU271113}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AE014297; AGB95699.1; -; Genomic_DNA.
DR RefSeq; NP_001262316.1; NM_001275387.1.
DR SMR; A0A0B4KFP4; -.
DR EnsemblMetazoa; FBtr0334322; FBpp0306435; FBgn0264495.
DR GeneID; 40793; -.
DR AGR; FB:FBgn0264495; -.
DR CTD; 40793; -.
DR FlyBase; FBgn0264495; gpp.
DR VEuPathDB; VectorBase:FBgn0264495; -.
DR GeneTree; ENSGT00390000013515; -.
DR BioGRID-ORCS; 40793; 0 hits in 1 CRISPR screen.
DR ChiTaRS; gpp; fly.
DR GenomeRNAi; 40793; -.
DR Proteomes; UP000000803; Chromosome 3R.
DR Bgee; FBgn0264495; Expressed in brain and 21 other cell types or tissues.
DR ExpressionAtlas; A0A0B4KFP4; baseline and differential.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0140956; F:histone H3K79 trimethyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0031507; P:heterochromatin formation; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR GO; GO:0051726; P:regulation of cell cycle; IEA:InterPro.
DR CDD; cd20902; CC_DOT1L; 1.
DR Gene3D; 1.10.260.60; -; 1.
DR Gene3D; 3.40.50.150; Vaccinia Virus protein VP39; 1.
DR InterPro; IPR025789; DOT1_dom.
DR InterPro; IPR021169; DOT1L/grappa.
DR InterPro; IPR030445; H3-K79_meTrfase.
DR InterPro; IPR029063; SAM-dependent_MTases_sf.
DR PANTHER; PTHR21451; HISTONE H3 METHYLTRANSFERASE; 1.
DR PANTHER; PTHR21451:SF0; HISTONE-LYSINE N-METHYLTRANSFERASE, H3 LYSINE-79 SPECIFIC; 1.
DR Pfam; PF08123; DOT1; 1.
DR PIRSF; PIRSF037123; Histone_H3-K79_MeTrfase_met; 1.
DR SUPFAM; SSF53335; S-adenosyl-L-methionine-dependent methyltransferases; 1.
DR PROSITE; PS51569; DOT1; 1.
PE 3: Inferred from homology;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853,
KW ECO:0000256|RuleBase:RU271113}; Coiled coil {ECO:0000256|SAM:Coils};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603,
KW ECO:0000256|RuleBase:RU271113};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU271113};
KW Reference proteome {ECO:0000313|Proteomes:UP000000803};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691,
KW ECO:0000256|RuleBase:RU271113};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000256|RuleBase:RU271113}.
FT DOMAIN 19..336
FT /note="DOT1"
FT /evidence="ECO:0000259|PROSITE:PS51569"
FT REGION 338..537
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 558..593
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 886..908
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 960..996
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1033..1075
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1165..1190
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1221..1333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1345..1374
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1432..1463
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1486..1508
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1529..1559
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1573..1604
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1637..1714
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1731..1757
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1770..1794
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1848..1898
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 682..716
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 338..366
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 383..410
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 420..489
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 496..523
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 890..906
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1033..1050
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1169..1190
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1236..1259
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1275..1289
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1297..1332
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1531..1550
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1686..1708
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1875..1898
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT BINDING 142..145
FT /ligand="S-adenosyl-L-methionine"
FT /ligand_id="ChEBI:CHEBI:59789"
FT /evidence="ECO:0000256|PIRSR:PIRSR037123-1"
FT BINDING 165..174
FT /ligand="S-adenosyl-L-methionine"
FT /ligand_id="ChEBI:CHEBI:59789"
FT /evidence="ECO:0000256|PIRSR:PIRSR037123-1"
FT BINDING 192
FT /ligand="S-adenosyl-L-methionine"
FT /ligand_id="ChEBI:CHEBI:59789"
FT /evidence="ECO:0000256|PIRSR:PIRSR037123-1"
FT BINDING 228..229
FT /ligand="S-adenosyl-L-methionine"
FT /ligand_id="ChEBI:CHEBI:59789"
FT /evidence="ECO:0000256|PIRSR:PIRSR037123-1"
SQ SEQUENCE 1898 AA; 206534 MW; B821424B297F3710 CRC64;
MATPQVKDLV LRSPAGSSDV ISFAWPLQIG HGQDKHDNGI DIIDTIKFVC DELPSMSSAF
EETNLHQIDT ACYKTMTGLV DRFNKAVDSI VALEKGTSLP AERLNKFAHP SLLRHILQLV
YNAAVLDPDK LNQYEPFSPE VYGETSYELV QQMLKHVTVS KEDTFIDLGS GVGQVVLQMA
GSFPLKTCIG IEKADTPARY AERMDVIFRQ YMGWFGKRFC EYKLIKGDFL VDEHRENITS
STLVFVNNFA FGPTVDHQLK ERFADLRDGA RVVSSKSFCP LNFRITDRNL SDIGTIMHVS
EIPPLKGSVS WTCKPVSYYL HVIDRTILER YFQRLKTKGG NDHESVGTVR TTRDRAKREA
NVGQHHHNNH HSNNHANSNN HQRDREQSNG ATATAAHQQR HQSQSPANVS GAGIVLAASG
QQAASKTRQQ LQHQHNQQQR SLDMESSTES DGDATNGNGG NTTTATNTTS ASNGPMTRKV
WSDWCSSKGK SSQSDDEENN NSNSNGGSNG GSIGGGSVGR QARATTQKKR KKLTRKAAIA
SKSAAAAQRE AEAAAAAAVS VPSKESSSKE DPPRAASAGP GRKGRMKKGA RGRKSLKIVG
LEALHKQTVL STSLDTMTKK LPAAPGTVDQ QLTALLTENM SHAELDIPTA PQDTPYALQI
LLDVFRSQYT SMIEHMKSSA YVPQVQKQIA QEQERMARLK NRASQLDKQI KVLIDDSVAL
LKVRMNELGI HVNSPNDLIA QAKEIVGRHK DLQHTASRMR NEVTFYEGEQ KLLLNKQLKN
LPEYQKLCGT VNGKVKLEVP PELSETTAQE LVLKEIANTL SQRKKLYAQV STIEQETSVL
QKTAEERSTA ATLLAQGTNM IVSTGSSSSS STTVCASAVT AQSNKLNSVK NSRRNREHRA
RSQEWPEVPE VGKIQESNPE VLAQKIVETC RQIEAGKFQG AGAPSSQVNG KNKAIIEVPP
PPATAPVSIK SSPGHHYKDT TLMPAPKQQQ QQQMTLSQLP KCELPGLSTS RKQESPKVAN
FEDRLKSIIT TALNEDQEQR SKAVESSPSP SPLHSPAPKR SKQHPAGAIN PAQSLPNNLH
NIITVSTQGL MHLNANTTIS PITPPLPGPG AGATASTAPP PPANLPYGAY GGAVAKTTIS
GKYQAAKEPK YSPVRQAPLP PPPSHMASLY PAGQQTTPAD LGYQRRRSSV SATSYEHYMV
QQQQQLQQQQ LMLAAAAHAA QRQQMRVEEQ QQQQQHQHHH HHHHHHPQHR LPQHVQHQHP
HQHHPNEFKA PPADSHLQRS SSREQLIVEP PQTQPLELLP RASSANSDYS GYRIRPPSRP
SSNSSQPDYT QVSPAKMALR RHLSQEKLSQ HVTPQATPPL PGHGGAPTSG KTIGDLVNGE
IERTLEISHQ SIINAAVNMS TSGASFMERA FLNERSNDRL LINLNAQRPE RVHVRPLSEE
SQDPQPTSYA QERGPGLGAG GAAAGGNSNL ATLAHVAYAQ KAQGGARANA GTAPPATHSS
SARSGRDYQP VALPRAELKG SIEAYFHEEQ QQKQSKGAGS AGSSSLRGPR LNGANPPLEG
LAASLQDHVR ARKYKEETEE RQRRAAAAAS SSAGPPAGME LPTHYAHQAP PAHSYHHHGA
SINGTPHKVE LGIKRSSPLA PHQQPPRPSK LAHYEPPTTQ QQHAHAHLYA NGQVLPPPPA
HDATTPSPTP SSSSSSCGRR SNSNNGKLLV DPPLLMSPEI NSLLGDERPL QLSHHQQQQQ
QMLHHHQSQQ QQHLQLTQQQ LRVAHLGHGL SHGHSTMPTL GGQRNGNGNA ADDGKLYVDQ
AKNSTRVSVP KYAPQQAPPP AHVAFPLEPH ILHQHKYQRL ATQTVAEDLT MKRKEQESDS
ELEPEVVAAF VEDGGSHSGS SSSSSSSHSG SDSSSDVP
//