ID A0A484BI50_DRONA Unreviewed; 1351 AA.
AC A0A484BI50;
DT 05-JUN-2019, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=Histone-lysine N-methyltransferase eggless {ECO:0008006|Google:ProtNLM};
GN ORFNames=AWZ03_005854 {ECO:0000313|EMBL:TDG47710.1};
OS Drosophila navojoa (Fruit fly).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; Ephydroidea;
OC Drosophilidae; Drosophila.
OX NCBI_TaxID=7232 {ECO:0000313|EMBL:TDG47710.1, ECO:0000313|Proteomes:UP000295192};
RN [1] {ECO:0000313|EMBL:TDG47710.1, ECO:0000313|Proteomes:UP000295192}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Navoj_Jal97 {ECO:0000313|EMBL:TDG47710.1};
RC TISSUE=Whole organism {ECO:0000313|EMBL:TDG47710.1};
RX PubMed=30423125; DOI=.1093/jhered/esy059;
RA Vanderlinde T., Dupim E.G., Nazario-Yepiz N.O., Carvalho A.B.;
RT "An Improved Genome Assembly for Drosophila navojoa, the Basal Species in
RT the mojavensis Cluster.";
RL J. Hered. 110:118-123(2019).
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:TDG47710.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LSRL02000040; TDG47710.1; -; Genomic_DNA.
DR RefSeq; XP_017960874.1; XM_018105385.1.
DR STRING; 7232.A0A484BI50; -.
DR GeneID; 108654234; -.
DR KEGG; dnv:108654234; -.
DR OMA; LLCCDCE; -.
DR OrthoDB; 2877903at2759; -.
DR Proteomes; UP000295192; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:UniProt.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd01395; HMT_MBD; 1.
DR CDD; cd10517; SET_SETDB1; 1.
DR CDD; cd20382; Tudor_SETDB1_rpt1; 1.
DR CDD; cd21181; Tudor_SETDB1_rpt2; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR047232; SETDB1/2-like_MBD.
DR InterPro; IPR041292; Tudor_4.
DR InterPro; IPR041291; TUDOR_5.
DR PANTHER; PTHR46024; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1.
DR PANTHER; PTHR46024:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE EGGLESS; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF18358; Tudor_4; 1.
DR Pfam; PF18359; Tudor_5; 1.
DR SMART; SM00391; MBD; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000295192};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Repressor {ECO:0000256|ARBA:ARBA00022491}.
FT DOMAIN 909..975
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 1037..1109
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 1112..1326
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 1..124
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 150..311
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1179..1237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 37..83
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 84..123
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 181..195
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 209..255
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 275..304
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1184..1200
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1202..1217
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1351 AA; 152618 MW; 0E1DEE3754AC3C82 CRC64;
MESESAAVDC SENDDAAAVD PSIEATSLPA GKASTAKEAE AEAEQGEKVE GGDKMSLDED
DHDEPSKASS EYKESENSDG TEQDKNITEI TSEITEESSI LKLDSTKPIE TDMDSSIELI
SSPISEQLEA ADKNIKGSTD ESDAVKECVA PLMDSNDAHK STEDDLDSSI ELIESPALKE
PSETDNIKDS KSDSDSSIEL ISSPKNTEGN NEEESESKSD DKESVNCNPD KDDKDKGESE
DKQIIPEHIE ESKTNTEDAT IVGVKFDSAM PTESPTVKEE PQSMTVAEEP QQPDESSNAE
MKSIQDEPME TEARQEVESS CLAKLHEEDY KETKPEGDSK ESVDYEEDNE IYYKKDCLNC
NCEKLHKQYV RASLAALNYY KVPRKAHKRQ YICMSCYDTA MDMYEDYAGH LMAKQPLLLR
SFNQNHADFV ALDSSDEEED DDEQSKPEFS ANDLQLIENE LEDAIKTVIN RVDLDDQMAW
SKSILQAKSQ RIGQEIEMVN DELCKLQNMA DKMHFALYSS CQVVHKHLPP LDLYHNMNDY
VQVPPAGEIE RPPIQLNETY YAVKNKAIAS WVSVKVIEFT ESSTVGGNVI KSYKIKYLNT
PYQMVKTVTA KHLAYFEPPP VRLTIGTRVI AYFNGSTLSR GKEKGVVPSA FYPGIIAEPL
KQANRFRYLI FYDDGYTQYV QHNDVRLVCM ASEKVWEDVH PGSRDFIQKY VEKYSVDRPM
VQCTRGQNMN TESNGTWLYA RVIDIDCSLV QMQFDDDKNH TEWIYRGSLR LGPVFKETQN
ALNSANAMQQ HRVPRRTEPF IRYTKEMETS SRQVNQQMRA IARKSSSAPQ NVNMSSSSSA
AAAASNAAAA AAAARNTVRH LNNSTIYVDD ENRPKGHVVY FTAKRNLPPK MYTPHECTPA
CLFKIVHRLD SYSPLAKPLL SGWERLVFKQ KAKRNVVYRG PCGKSFRNLA EVHKYLRATD
NVLNVENFDF TPDLRCLAEY SIDPTIVKEA DISKGQEKMA IPLVNYYDNT LPPPCTYAKQ
RIPTEGVHLN LDEEFLVGCD CEDDCSDKSK CACWQLTVAG VRYCNPNKPI EEIGYQYKRL
HEHVPTGIYE CNSRCKCKKN CLNRVVQHSL EMKLQVFKTS NRGWGLRCVN DIPKGAFICI
YAGHLLTETM ANEGGLDAGD EYFADLDYIE VAEQLKEGYE SDVEHSPAEE EEDTYVPDPE
DDADFMPAKY YQPRKKDKLR VSRSHSTQST EQDSQERAVI NFNPNADLDE TVRENSVRRL
FGKDEAPYIM DAKTTGNLGR YFNHSCAPNL FVQNVFVDTH DLRFPWVAFF SANHIRSGTE
LTWNYNYEVG VVPGKVLYCQ CGATNCRIRL L
//