ID A0A3Q2HTZ1_HORSE Unreviewed; 494 AA.
AC A0A3Q2HTZ1;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=Methyl-CpG-binding protein 2 {ECO:0000256|PIRNR:PIRNR038006};
DE Short=MeCp-2 protein {ECO:0000256|PIRNR:PIRNR038006};
DE Short=MeCp2 {ECO:0000256|PIRNR:PIRNR038006};
GN Name=MECP2 {ECO:0000313|Ensembl:ENSECAP00000038544.3,
GN ECO:0000313|VGNC:VGNC:20066};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000038544.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000038544.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000038544.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000038544.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000038544.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- FUNCTION: Chromosomal protein that binds to methylated DNA. It can bind
CC specifically to a single methyl-CpG pair. It is not influenced by
CC sequences flanking the methyl-CpGs. Binds both 5-methylcytosine (5mC)
CC and 5-hydroxymethylcytosine (5hmC)-containing DNA, with a preference
CC for 5-methylcytosine (5mC). {ECO:0000256|PIRNR:PIRNR038006}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PIRNR:PIRNR038006}.
CC Note=Colocalized with methyl-CpG in the genome.
CC {ECO:0000256|PIRNR:PIRNR038006}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3Q2HTZ1; -.
DR Ensembl; ENSECAT00000055563.3; ENSECAP00000038544.3; ENSECAG00000018208.4.
DR VGNC; VGNC:20066; MECP2.
DR GeneTree; ENSGT00530000063687; -.
DR Proteomes; UP000002281; Chromosome X.
DR Bgee; ENSECAG00000018208; Expressed in blood and 23 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0010385; F:double-stranded methylated DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000122; P:negative regulation of transcription by RNA polymerase II; IEA:UniProtKB-UniRule.
DR CDD; cd01396; MeCP2_MBD; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR017353; Me_CpG-bd_MeCP2.
DR InterPro; IPR045138; MeCP2/MBD4.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR PANTHER; PTHR15074; METHYL-CPG-BINDING PROTEIN; 1.
DR PANTHER; PTHR15074:SF6; METHYL-CPG-BINDING PROTEIN 2; 1.
DR Pfam; PF01429; MBD; 1.
DR PIRSF; PIRSF038006; Methyl_CpG_bd_MeCP2; 1.
DR SMART; SM00391; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
PE 1: Evidence at protein level;
KW DNA-binding {ECO:0000256|PIRNR:PIRNR038006};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR038006};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Proteomics identification {ECO:0007829|PeptideAtlas:A0A3Q2HTZ1};
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repressor {ECO:0000256|PIRNR:PIRNR038006};
KW Transcription {ECO:0000256|PIRNR:PIRNR038006};
KW Transcription regulation {ECO:0000256|PIRNR:PIRNR038006}.
FT DOMAIN 98..170
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT REGION 17..127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 155..222
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 257..283
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 330..494
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..57
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..117
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 259..275
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 385..408
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 456..494
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 494 AA; 53438 MW; 8CA93987E2E2C13B CRC64;
MPFEKLEIQA IQGLMLREEK SEDQDLQGLK DKPLKFKKVK KDKKEDKEGK HEPLQPPAHH
SAEPAEAGKA ETSEGSGSAP AVPEASASPK QRRSIIRDRG PMYDDPTLPE GWTRKLKQRK
SGRSAGKYDV YLINPQGKAF RSKVELIAYF EKVGDTSLDP NDFDFTVTGR GSPSRREQKP
PKKPKSPKAP GTGRGRGRPK GSGTARPKAA ASEGVQVKRV LEKSPGKLLV KMPFQASPGS
KAEGGGATTS AQVMVIKRPG RKRKAEADPQ AIPKKRGRKP GSVVAAAAAE AKKKAVKESS
IRSVQETVLP IKKRKTRETV SIEVKEVVKP LLVSTLGEKS GKGLKTCKSP GRKSKESSPK
GRSSSTSSPP KKEHHHHHHH AEPPRAPAPL LPPPPPPPPE PQSSEDPTSP PEPQDLSSSV
CKEEKMPRGG SLESDGCPKE PAKTQPAVAT AATAAEKYKH RGEGERKDIV SSSMPRPNRE
EPVDSRTPVT ERVS
//