ID A0A3P9DHY1_9CICH Unreviewed; 918 AA.
AC A0A3P9DHY1;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=[Histone H3]-lysine(4) N-trimethyltransferase {ECO:0008006|Google:ProtNLM};
OS Maylandia zebra (zebra mbuna).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Haplochromini; Maylandia; Maylandia zebra complex.
OX NCBI_TaxID=106582 {ECO:0000313|Ensembl:ENSMZEP00005033994.1, ECO:0000313|Proteomes:UP000265160};
RN [1] {ECO:0000313|Ensembl:ENSMZEP00005033994.1, ECO:0000313|Proteomes:UP000265160}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=25186727; DOI=10.1038/nature13726;
RA Brawand D., Wagner C.E., Li Y.I., Malinsky M., Keller I., Fan S.,
RA Simakov O., Ng A.Y., Lim Z.W., Bezault E., Turner-Maier J., Johnson J.,
RA Alcazar R., Noh H.J., Russell P., Aken B., Alfoldi J., Amemiya C.,
RA Azzouzi N., Baroiller J.F., Barloy-Hubler F., Berlin A., Bloomquist R.,
RA Carleton K.L., Conte M.A., D'Cotta H., Eshel O., Gaffney L., Galibert F.,
RA Gante H.F., Gnerre S., Greuter L., Guyon R., Haddad N.S., Haerty W.,
RA Harris R.M., Hofmann H.A., Hourlier T., Hulata G., Jaffe D.B., Lara M.,
RA Lee A.P., MacCallum I., Mwaiko S., Nikaido M., Nishihara H.,
RA Ozouf-Costaz C., Penman D.J., Przybylski D., Rakotomanga M., Renn S.C.P.,
RA Ribeiro F.J., Ron M., Salzburger W., Sanchez-Pulido L., Santos M.E.,
RA Searle S., Sharpe T., Swofford R., Tan F.J., Williams L., Young S., Yin S.,
RA Okada N., Kocher T.D., Miska E.A., Lander E.S., Venkatesh B., Fernald R.D.,
RA Meyer A., Ponting C.P., Streelman J.T., Lindblad-Toh K., Seehausen O.,
RA Di Palma F.;
RT "The genomic substrate for adaptive radiation in African cichlid fish.";
RL Nature 513:375-381(2014).
RN [2] {ECO:0000313|Ensembl:ENSMZEP00005033994.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P9DHY1; -.
DR Ensembl; ENSMZET00005035190.1; ENSMZEP00005033994.1; ENSMZEG00005025418.1.
DR GeneTree; ENSGT00940000166821; -.
DR Proteomes; UP000265160; LG18.
DR GO; GO:0044666; C:MLL3/4 complex; IEA:InterPro.
DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IEA:InterPro.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd15509; PHD1_KMT2C_like; 1.
DR CDD; cd15594; PHD2_KMT2C; 1.
DR CDD; cd15511; PHD3_KMT2C; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 3.
DR InterPro; IPR034732; EPHD.
DR InterPro; IPR000637; HMGI/Y_DNA-bd_CS.
DR InterPro; IPR047004; KMT2C_PHD2.
DR InterPro; IPR037877; PHD3_KMT2C.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR45888:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE 2C; 1.
DR PANTHER; PTHR45888; HL01030P-RELATED; 1.
DR Pfam; PF00628; PHD; 2.
DR Pfam; PF13771; zf-HC5HC2H; 1.
DR SMART; SM00249; PHD; 4.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 3.
DR PROSITE; PS51805; EPHD; 1.
DR PROSITE; PS00354; HMGI_Y; 1.
DR PROSITE; PS50016; ZF_PHD_2; 2.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000265160};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 208..330
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS51805"
FT DOMAIN 387..437
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 452..521
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT REGION 1..137
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 177..219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 529..553
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 642..672
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 764..789
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 857..918
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..28
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..102
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 119..137
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 201..219
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 918 AA; 99202 MW; 2E7E6EE606329546 CRC64;
MSSEDKTVEP SDQGPSPPSS SVGATSTGSP AHADKRPRGR PRKDAAALLP QAPPLSTSKN
RKKGRTRGRV VVDEEDSMDG TEITESIDPQ DTETQIQPVE TVEVTSEVPE EDRPSSPLAQ
SPLAENSAGP SVPTSREVKS SERLCAFCYC GGRSLLGQGD LQVFTVTSQL EALFSHKADG
SSTGDGSDGD KTAQPKMAEE TTSGQKEKNG SEGCEDEPDP ASRFWNELSH VGLPEDLNVQ
SLFESGQCWA HQSCALWSDG VCEGEGQSLL NVDRAIDSGS TKHCAYCKRL GASIKCCAEG
CAQLYHYPCA GAAGTFQDIR SLSLLCPEHI ELATHKFVDD INCVLCDSPG DLLDQLFCTS
CGLHYHGICL DMAVTPLRRA GWQCPECKIC QTCKNPGEDT KMLVCDMCDK GYHTFCLQPV
IDSLPTNGWR CQNCRVCLQC GTRTSGQWHH TSLLCENCVQ NQDPALCCPM CACILDPEHH
KDVVFCHTCK RWLHLECERQ NSGQAEIHPR EDYVCSNCRS PAAEQALHPE DMDTGPELSP
QPASMHTNSE TGVQLAQKHT DLELGNQLPP EMHKDPKPEL LAVPLHSDPE HLQATVQEEK
LATSDTKPDI SVGADIVERQ VTEPSVKIPA EIKSETKPAV RAQLASTESH ASSVTLSQES
IPAQHSTEGL QTQDTTIQIK PANTEACPEK GMVNPSTKDF KAIISTTKEP GTASVPFTDQ
PMVVALPQPS NMDVVRGSFL ETNPKELNSD RREGIFHRNL AETVKPSTST SEEMDTTPEK
PSMPFPSLSL EEKPLKTTVE RLAEMVSSPS NSSPLARGTS PRELTQTQIL TSMSDHSSVM
PTTTLIAFTP KIGMGKPAIT KRKFSPGRPR VKQGRGSGFP GRRRPRGAGL SGRAGRGRAR
GKNGVSPSIN PGVCAVSV
//