ID A0A2Y9FC96_PHYMC Unreviewed; 674 AA.
AC A0A2Y9FC96;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 12-SEP-2018, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE SubName: Full=Methyl-CpG-binding domain protein 1 isoform X9 {ECO:0000313|RefSeq:XP_007119393.1};
GN Name=MBD1 {ECO:0000313|RefSeq:XP_007119393.1};
OS Physeter macrocephalus (Sperm whale) (Physeter catodon).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC Physeteridae; Physeter.
OX NCBI_TaxID=9755 {ECO:0000313|Proteomes:UP000248484, ECO:0000313|RefSeq:XP_007119393.1};
RN [1] {ECO:0000313|RefSeq:XP_007119393.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_007119393.1};
RG RefSeq;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_007119393.1; XM_007119331.4.
DR AlphaFoldDB; A0A2Y9FC96; -.
DR GeneID; 102976647; -.
DR CTD; 4152; -.
DR OrthoDB; 5262043at2759; -.
DR Proteomes; UP000248484; Chromosome 19.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd01396; MeCP2_MBD; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR002857; Znf_CXXC.
DR PANTHER; PTHR12396; METHYL-CPG BINDING PROTEIN, MBD; 1.
DR PANTHER; PTHR12396:SF46; METHYL-CPG-BINDING DOMAIN PROTEIN 1; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF02008; zf-CXXC; 3.
DR SMART; SM00391; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS51058; ZF_CXXC; 3.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000248484};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00509}.
FT DOMAIN 1..69
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 169..216
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT DOMAIN 249..294
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT DOMAIN 387..435
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT REGION 80..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 231..252
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 304..364
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 447..507
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 574..630
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..94
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..120
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 324..338
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 339..364
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 601..615
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 674 AA; 74335 MW; C7E0FF5FADF81639 CRC64;
MAEDWLDCPA LGPGWKRREV FRKSGATCGR SDTYYQSPTG DRIRSKVELT RYLGPACDLT
LFDFKQGILC YPAPKAHSLA IPSRKRKKPS RPAKAQKRQV GPPKSEVRKE ASRDETKADA
DTVPASLPAP GCCENCGISF SGDGTRRQRL KTLCKDCRAQ RIAFNREQRM FKRVGCGECA
ACQVTEDCGA CSTCLLQLPH DVASGLFCKC ERRRCLRIVE RVSRAGGVGP RLTCTPDPHS
PGPMRHTSGP PQSRGCGVCR GCQTREDCGR CRVCLRPPRP GLRRQWRCVQ RRCLRHLAHR
LRRHHQRCQR RPPLAVAPPA GKHGRRRGGC DSKVAPRRRP PRTQPPPALP PSQPPESPEL
HPRALAPSPP AEFIYYCVDE DELQPYTNRR QNRKCGACAA CLRRMDCGHC DFCCDKPKFG
GSNQKRQKCR WRQCLQFAMK RLLPSVWAGS EDGAGPPAPY PRRKRPGSAR RPRLGQTPKP
PLATPMAQPD RAQTPVKQEA GSGFVLPPPG TDLVFLREGA SSPVQVPGPA PASTASLLQE
AQCPGLSWVV ALPQVKQEKA DAQEDWTPGT AILTSPVLLP GCPSKAADPG LPPVKQEPPD
PEEDKEEENK DDSTSDLAPE EEAGGAGTPV ITEIFSLGGT RLRDTAVWLP RLRKLLAVNE
NEYFTELQLK EEAL
//