ID H2P9D2_PONAB Unreviewed; 573 AA.
AC H2P9D2; A0A2J8TVI0; A0A6D2W214;
DT 21-MAR-2012, integrated into UniProtKB/TrEMBL.
DT 21-MAR-2012, sequence version 1.
DT 27-MAR-2024, entry version 59.
DE RecName: Full=Methyl-CpG-binding domain protein 4 {ECO:0000256|PIRNR:PIRNR038005};
DE EC=3.2.2.- {ECO:0000256|PIRNR:PIRNR038005};
GN Name=MBD4 {ECO:0000313|Ensembl:ENSPPYP00000015005.2};
GN ORFNames=CR201_G0032119 {ECO:0000313|EMBL:PNJ37036.1};
OS Pongo abelii (Sumatran orangutan) (Pongo pygmaeus abelii).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pongo.
OX NCBI_TaxID=9601 {ECO:0000313|Ensembl:ENSPPYP00000015005.2, ECO:0000313|Proteomes:UP000001595};
RN [1] {ECO:0000313|Ensembl:ENSPPYP00000015005.2, ECO:0000313|Proteomes:UP000001595}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Wilson R.K., Mardis E.;
RT "A 6x draft sequence assembly of the Pongo pygmaeus abelii genome.";
RL Submitted (FEB-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:PNJ37036.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Susie {ECO:0000313|EMBL:PNJ37036.1};
RA Pollen A., Hastie A., Hormozdiari F., Dougherty M., Liu R., Chaisson M.,
RA Hoppe E., Hill C., Pang A., Hillier L., Baker C., Armstrong J.,
RA Shendure J., Paten B., Wilson R., Chao H., Schneider V., Ventura M.,
RA Kronenberg Z., Murali S., Gordon D., Cantsilieris S., Munson K., Nelson B.,
RA Raja A., Underwood J., Diekhans M., Fiddes I., Haussler D., Eichler E.;
RT "High-resolution comparative analysis of great ape genomes.";
RL Submitted (DEC-2017) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|Ensembl:ENSPPYP00000015005.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Mismatch-specific DNA N-glycosylase involved in DNA repair.
CC Has thymine glycosylase activity and is specific for G:T mismatches
CC within methylated and unmethylated CpG sites. Can also remove uracil or
CC 5-fluorouracil in G:U mismatches. Has no lyase activity. Was first
CC identified as methyl-CpG-binding protein.
CC {ECO:0000256|PIRNR:PIRNR038005}.
CC -!- SUBUNIT: Interacts with MLH1. {ECO:0000256|PIRNR:PIRNR038005}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PIRNR:PIRNR038005}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NDHI03003481; PNJ37036.1; -; Genomic_DNA.
DR STRING; 9601.ENSPPYP00000015005; -.
DR Ensembl; ENSPPYT00000015605.3; ENSPPYP00000015005.2; ENSPPYG00000013417.3.
DR eggNOG; KOG4161; Eukaryota.
DR GeneTree; ENSGT00530000063687; -.
DR HOGENOM; CLU_034167_0_0_1; -.
DR OMA; MTECHKS; -.
DR TreeFam; TF329176; -.
DR Proteomes; UP000001595; Chromosome 3.
DR GO; GO:0016607; C:nuclear speck; IEA:Ensembl.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0008263; F:pyrimidine-specific mismatch base pair DNA N-glycosylase activity; IEA:Ensembl.
DR GO; GO:0006281; P:DNA repair; IEA:UniProtKB-KW.
DR CDD; cd01396; MeCP2_MBD; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR011257; DNA_glycosylase.
DR InterPro; IPR017352; MBD4.
DR InterPro; IPR045138; MeCP2/MBD4.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR PANTHER; PTHR15074:SF7; METHYL-CPG-BINDING DOMAIN PROTEIN 4; 1.
DR PANTHER; PTHR15074; METHYL-CPG-BINDING PROTEIN; 1.
DR Pfam; PF01429; MBD; 1.
DR PIRSF; PIRSF038005; Methyl_CpG_bd_MBD4; 1.
DR SMART; SM00391; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF48150; DNA-glycosylase; 1.
DR PROSITE; PS50982; MBD; 1.
PE 4: Predicted;
KW DNA damage {ECO:0000256|PIRNR:PIRNR038005};
KW DNA repair {ECO:0000256|PIRNR:PIRNR038005};
KW DNA-binding {ECO:0000256|PIRNR:PIRNR038005};
KW Hydrolase {ECO:0000256|PIRNR:PIRNR038005};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR038005};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000001595}.
FT DOMAIN 76..148
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT REGION 1..38
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 573 AA; 65431 MW; FC7043E12A2025C9 CRC64;
MGTTGLESLS LGDRGAAPTV TSSERLVPDP PSDLRKEDVA MELERVGEDE EQMMIKRSSK
CNPLLQEPIT SAQFGATAGT ECHKSVPCGW ERVVKQRLFG KTAGRFDVYF ISPEGLKFRS
KSSLANYLHK NGETSLKPED FDFTVLSKRG IKSRYKDCSM AALTSHLQNQ SNNSNWNLRT
RSKCKKDVFM PPNSSSELRE SRGLSNFTSS HLLLKEDEGV EDVNFRKVRK PKGKVTILKG
IPIKKTKKGC RKSCSGFVQS DSKRESVCNK ADAESEPVAQ KSQLDRTVCI SDAGACDETL
SVTSEENSLV KERSLSSGSN FCSEQKTSGI INKFCSAKDS EHNEKYEDTF LESEEIRTKV
EEVVERKEHL HTDILKRGSE MDNNCSPTRK DFTEDTIPRT QIERRKTSLY FSSKYNKEAL
SPPRRKAFKK WTPPRSPFNL IQETLFHDPW KLLIATIFLN RTSGKMAIPV LWKFLEKYPS
AEVARTADWR DVSELLKPLG LYDLRAKTIV KFSDEYLTKQ WKYPIELHGI GKYGNDSYRI
FCVNEWKQVH PEDHKLNKYH DWLWENHEKL SLS
//