ID A0A340XZG1_LIPVE Unreviewed; 683 AA.
AC A0A340XZG1;
DT 10-OCT-2018, integrated into UniProtKB/TrEMBL.
DT 10-OCT-2018, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE SubName: Full=Methyl-CpG-binding domain protein 1 isoform X2 {ECO:0000313|RefSeq:XP_007465637.1};
GN Name=MBD1 {ECO:0000313|RefSeq:XP_007465637.1};
OS Lipotes vexillifer (Yangtze river dolphin).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Odontoceti;
OC Lipotidae; Lipotes.
OX NCBI_TaxID=118797 {ECO:0000313|Proteomes:UP000265300, ECO:0000313|RefSeq:XP_007465637.1};
RN [1] {ECO:0000313|RefSeq:XP_007465637.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_007465637.1; XM_007465575.1.
DR AlphaFoldDB; A0A340XZG1; -.
DR GeneID; 103086301; -.
DR CTD; 4152; -.
DR OrthoDB; 5262043at2759; -.
DR Proteomes; UP000265300; Unplaced.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd01396; MeCP2_MBD; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR002857; Znf_CXXC.
DR PANTHER; PTHR12396; METHYL-CPG BINDING PROTEIN, MBD; 1.
DR PANTHER; PTHR12396:SF46; METHYL-CPG-BINDING DOMAIN PROTEIN 1; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF02008; zf-CXXC; 3.
DR SMART; SM00391; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS51058; ZF_CXXC; 3.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000265300};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00509}.
FT DOMAIN 1..69
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 169..216
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT DOMAIN 217..263
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT DOMAIN 357..405
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT REGION 80..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 274..334
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 417..478
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 553..600
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 625..666
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..94
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..120
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 294..308
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 310..334
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 567..587
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 683 AA; 75561 MW; 6D8C57BC869674E9 CRC64;
MAEDWLDCPA LGPGWKRREV FRKSGATCGR SDTYYQSPTG DRIRSKVELT RYLGPACDLT
LFDFKQGILC YPAPKAHSLA IPSRKRKKPS KPAKAQKRQV GPSKSEVRKE APRDETKADA
DTVPASLPAP GCCENCGISF SGDGTRRQRL KTLCKDCRAQ RIAFNREQRM FKRVGCGECA
ACQVTEDCGA CSTCLLQLPH DVASGLFCKC ERRRCLRIVE RSRGCGVCRG CQTREDCGRC
RVCLRPPRPG LRRQWRCVQR RCLRHLAHRL RRRHHQRCQR RPPLAVAPPA GKHGRRRGGC
DSKVAPRRRP PRTQPLPALP PSQPPESPEL HPRALAPSPP AEFIYYCVDE DELQPYTNRR
QNRKCGACAA CLRRMDCGHC DFCCDKPKFG GSNQKRQKCR WRQCLQFAMK RLLPSVWAGS
EDGTRPPTPY PRRKRPGSAR RPRLGQTPKP PLATPMAQPD RAQTPVKQEA GSGFVLPPPG
TDLVFLREGA SSPVQVPGPA PASTTALLQE AQCPGLSWVV ALPQVKQEKA DAQEDWTPGT
AILTSPVLLP GCPSKAVDPG LPPVKQEPPD PEEDKEEANK DDPTADLAPE EEAGGAGTPV
ITEIFSLGGT RLRDTAVWLP SLQGRQSGRE DGCKEWETEE TLAPTSTSCK PRGWPGTHVS
LSPPPTSMMW VSCRRSWCPS SQT
//