ID A0A2I3GY12_NOMLE Unreviewed; 549 AA.
AC A0A2I3GY12;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE SubName: Full=Methyl-CpG binding domain protein 1 {ECO:0000313|Ensembl:ENSNLEP00000036191.1};
GN Name=MBD1 {ECO:0000313|Ensembl:ENSNLEP00000036191.1};
OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC Nomascus.
OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000036191.1, ECO:0000313|Proteomes:UP000001073};
RN [1] {ECO:0000313|Ensembl:ENSNLEP00000036191.1, ECO:0000313|Proteomes:UP000001073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Gibbon Genome Sequencing Consortium;
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSNLEP00000036191.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADFV01039189; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01039190; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_003267547.1; XM_003267499.3.
DR AlphaFoldDB; A0A2I3GY12; -.
DR Ensembl; ENSNLET00000055882.1; ENSNLEP00000036191.1; ENSNLEG00000011661.3.
DR GeneID; 100592659; -.
DR CTD; 4152; -.
DR GeneTree; ENSGT00950000183005; -.
DR OrthoDB; 5262043at2759; -.
DR Proteomes; UP000001073; Chromosome 4.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd01396; MeCP2_MBD; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR002857; Znf_CXXC.
DR PANTHER; PTHR12396; METHYL-CPG BINDING PROTEIN, MBD; 1.
DR PANTHER; PTHR12396:SF46; METHYL-CPG-BINDING DOMAIN PROTEIN 1; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF02008; zf-CXXC; 2.
DR SMART; SM00391; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS51058; ZF_CXXC; 2.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001073};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00509}.
FT DOMAIN 1..69
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 169..216
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT DOMAIN 217..263
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT REGION 80..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 269..309
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 331..402
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 461..517
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..95
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..120
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 286..309
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 368..382
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 484..504
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 549 AA; 60044 MW; 8729F390D6E0BD38 CRC64;
MAEDWLDCPA LGPGWKRREV FRKSGATCGR SDTYYQSPTG DRIRSKVELT RYLGPACDLT
LFDFKQGILC YPAPKAHPVA VASKKRKKPS RPAKTRKRQV GPQSGEVRKE APRDETKADT
DTAPASFPAP GCCENCGISF SGDGTQRQRL KTLCKDCRAQ RIAFNREQRM FKRVGCGECA
ACQVTEDCGA CSTCLLQLPH DVASGLFCKC ERRRCLRIVE RSRGCGVCRG CQTQEDCGRC
PICLRPPRPG LRRQWKCVQR RCLRGKHARR KGGCDSKMAA RRRPGSQALP PPPPSQSPEP
TEPHPRALAP SPPAEFIYYC VDEDELKRLM PSVWSESDDG AGSPPPYRRR KRPSSARRHH
LGPTLKPTLA TRTAQPDHTQ APTKQEAGGG FVLPPPGTDL VFLREGASSP VQVPGPVAAS
TEALLQEAQC SGLSWVVALP QVKQEKADTQ DEWTPGTAVL TSPVLVPGCP SKAVDPGLPS
VKQEPPDPEE DKEENKDDSA SKLAPEEEAG GAGTPVITEI FSLGGTRFRD TTVWLPRSKD
LKKPGARKQ
//