ID W5J3S8_ANODA Unreviewed; 2064 AA.
AC W5J3S8;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 24-JAN-2024, entry version 63.
DE RecName: Full=Histone-lysine N-methyltransferase EHMT2 {ECO:0008006|Google:ProtNLM};
GN ORFNames=AND_009901 {ECO:0000313|EMBL:ETN58496.1};
OS Anopheles darlingi (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN58496.1};
RN [1] {ECO:0000313|EMBL:ETN58496.1, ECO:0000313|Proteomes:UP000000673}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT the genome of the newly sequenced Anopheles darlingi.";
RL BMC Genomics 11:529-529(2010).
RN [2] {ECO:0000313|EMBL:ETN58496.1}
RP NUCLEOTIDE SEQUENCE.
RA Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ETN58496.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23761445;
RA Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA Camargo E.P., de Vasconcelos A.T.;
RT "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL Nucleic Acids Res. 41:7387-7400(2013).
RN [4] {ECO:0000313|EnsemblMetazoa:ADAC009901-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADMH02002131; ETN58496.1; -; Genomic_DNA.
DR STRING; 43151.W5J3S8; -.
DR EnsemblMetazoa; ADAC009901-RA; ADAC009901-PA; ADAC009901.
DR VEuPathDB; VectorBase:ADAC009901; -.
DR VEuPathDB; VectorBase:ADAR2_004667; -.
DR eggNOG; KOG1082; Eukaryota.
DR HOGENOM; CLU_232920_0_0_1; -.
DR OMA; GNFAMCK; -.
DR OrthoDB; 5481936at2759; -.
DR Proteomes; UP000000673; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0002039; F:p53 binding; IEA:InterPro.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd20905; EHMT_ZBD; 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR043550; EHMT1/EHMT2.
DR InterPro; IPR047762; EHMT_CRR.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR46307; G9A, ISOFORM B; 1.
DR PANTHER; PTHR46307:SF4; G9A, ISOFORM B; 1.
DR Pfam; PF00023; Ank; 1.
DR Pfam; PF12796; Ank_2; 1.
DR Pfam; PF21533; EHMT1-2_CRR; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR PRINTS; PR01415; ANKYRIN.
DR SMART; SM00248; ANK; 6.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 3.
DR PROSITE; PS50088; ANK_REPEAT; 4.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Reference proteome {ECO:0000313|Proteomes:UP000000673}.
FT REPEAT 1613..1635
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 1656..1688
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 1689..1721
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 1722..1754
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT DOMAIN 1847..1910
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 1913..2026
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 38..106
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 119..143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 188..285
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 321..343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 382..730
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 791..823
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 954..1138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1151..1198
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1461..1503
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 38..59
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 75..105
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 122..141
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 188..208
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 209..225
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 262..277
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 417..443
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 451..488
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 512..530
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 531..577
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 613..627
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 673..690
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 713..730
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 802..817
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 977..995
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 996..1024
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1069..1110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1155..1198
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1478..1503
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2064 AA; 219276 MW; 120FDCB0EBAAE66D CRC64;
MDFIGNLLNQ MSSAFNQETA IPKKEEISMD ETLKWRALKN NQFASRTRQP SSSGGTGGGS
GSGKGGRKVE ANGQPRPGSS ATSSGLATMM TMSSAENSND SGPAASALAI SEGRRSAPIL
DGSLDDTNSS ALLSNGGASS TPDLMIAGIE EDDDLEDGIE IEIAKDISGE ELQEALGLSQ
LVVETIDDLS QQSQSSNSVP PGSHSKTTPI MKSEDEKDED PIVRRDAAIG SSLAAEESSI
ADDLHLPATK TEKPEVVATT NATEEQEERE KERKEVVPGS VSSQQVDASP VAVIADADAA
DDDVVIIAPS TVQEVVVLVA EDGTKEEEEE EQERGACASA ASESAAATHM TTVASSAAST
KCTEKKQAVL TDADPVAATS VASVDEVERK YRPQENDSVR ANATMAETAG GPSVDDLKLE
ESLEKKTDPD GAGDGEEKLA EAARPPSAGR ATRSKKTSIS SATTVTPPPL AGVANAANSS
SQRRSQRFVK DSGVAASGKL NGEQQSRSSP VVVKEEQEEP AIKKKTNDES SKQANVSMAT
TTKVAAIDRS SSSSTATLKS DRSLRSKQLQ SSQGVGGPAA SAIAGGRRAS ETVKSACQED
SNDSIPVAES EPAEKPPNRR GRKRLSHQEQ TAIATPQSVV VHEEPTVASE ESQQRASGSG
TRSVFPLDPG ENDSGGKQQQ QQQQQQAPVA VPAPNRRGRK RKNPIPVDAT VNTGPVTAAS
AAAASSTSSL PYGKRMLRMS RDGSEGILAS ALARRDKVDS QGRSSRPIKL SAKMLANEEL
RQGFEQHNNG RIIIASDSVN TDDTTTDRDR ERASETSSRK QSATVAAAAA ATAAAAAASA
AVKAGSIGAQ QRAERAGSGS SLSSSSEDVT LVSVTKVVPT VAPVATSVKR GSAEAKETKR
IQDSATLPVS TEATVKGTSN EGIRTTIQPQ QQQAPTARCP DLQTFLQEIR SMRLGTNRSP
EENPKLNRRQ IKRLGKLKEK HLLALGLRRK SKENQRNGVS TSAADQSDGA SVIPSDTESS
GSEAEFVPSG KIGTVGKPSV TLRLRKPETL LENPRSLAGG RPASLPSGPS TGPARSGGTK
AKQQQQQQQT NGRNIANAPG SSSRRQSAIV LPAAATTLPG PFSAPPPSKD METEHLQRSL
KRLGCEVTII PQAPRPQAPI SSSSAPRATS VARSGAGVGA GSSQRKQQRH NKTPKGISQA
AQALRTLSSV SPSLEIVLDK RPIGLATAPF GNKRTPFEAS GSTKTVVEAS GTTGGTVDGG
LVCLCAQISD VYVRRPAGNG YCTAIDDIDG QPIGCCNELT EDEVIMLRPS ASVSFQVFCN
MHRKRLEDHG CCAVCGHFCT QGNFAMCKNV HLFHPNCAKK YILNTPYNSN RPDEPPTAPI
LVLQCPHCAR ECPNGEIQVN IQLTTPPVLL PSRSNVVKPA KMTVSKPDSS STNGGTVGED
IFRTKVNALV PSSVKNMLAT SSSGGGGING NGSGSLAAAP RAGSSSGRSE RSNNGTGVGG
SRRKTTFTKQ DFYRAITTQH NDVDRVSEII ASGFDIETRF ADIHGGTCLH LVAHYGTITS
AYLIISRARS VDYLNIADNV LRTAMMCALE QKKFEIIKLL LDCGADATVK GPDGMTVLHI
AARHGHHEAV RTILESVRKR LTARELSSFL NRGDGGRWTA LAWAAENRHK ETIQQLLELG
ADVNVCDLEN NTSLHWATLA GCTDTLYLLL NKCCDTNVQN TSGDTPLHIA CRLGHASSCI
LLMAKGASLT IRNNAGEQPM DAIGDPDSEC ASILGANLKM RLLAKNTKET RVLSSDISNG
RERYPVQVVQ TVGANDRLQA LPKFKYVKRT VQVECSVQMD TNLRNMRLCS CTDDCSSEGA
NCVCSERGWY NADGRLVDDF NYHHPPEIVE CGDACDCNRL VCRNRVVQRG LLVPLQIFHS
AGKGWSVRTL VRIAKGSFLV EYVGELLTDE AADRRPDDSY IFDLGAGYCM DASAYGNVSR
FFNHSCKPNV SPVRVFYEHQ DTRFPKVAMF ACRDIEPQEE ICFDYGDKFW MVKNRTVCCQ
CNASECRYRT VQQAGDGCPV TVVL
//