ID A0A384BC50_BALAS Unreviewed; 657 AA.
AC A0A384BC50;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE SubName: Full=Methyl-CpG-binding domain protein 1 isoform X6 {ECO:0000313|RefSeq:XP_007197297.1};
GN Name=MBD1 {ECO:0000313|RefSeq:XP_007197297.1};
OS Balaenoptera acutorostrata scammoni (North Pacific minke whale)
OS (Balaenoptera davidsoni).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Whippomorpha; Cetacea; Mysticeti;
OC Balaenopteridae; Balaenoptera.
OX NCBI_TaxID=310752 {ECO:0000313|Proteomes:UP000261681, ECO:0000313|RefSeq:XP_007197297.1};
RN [1] {ECO:0000313|RefSeq:XP_007197297.1}
RP IDENTIFICATION.
RC TISSUE=Muscle {ECO:0000313|RefSeq:XP_007197297.1};
RG RefSeq;
RL Submitted (JAN-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_007197297.1; XM_007197235.1.
DR AlphaFoldDB; A0A384BC50; -.
DR CTD; 4152; -.
DR OrthoDB; 5262043at2759; -.
DR Proteomes; UP000261681; Unplaced.
DR GO; GO:0005654; C:nucleoplasm; IEA:UniProt.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd01396; MeCP2_MBD; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR002857; Znf_CXXC.
DR PANTHER; PTHR12396; METHYL-CPG BINDING PROTEIN, MBD; 1.
DR PANTHER; PTHR12396:SF59; METHYL-CPG-BINDING DOMAIN PROTEIN 1; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF02008; zf-CXXC; 3.
DR SMART; SM00391; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS51058; ZF_CXXC; 3.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000261681};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00509}.
FT DOMAIN 1..69
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 169..216
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT DOMAIN 217..263
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT DOMAIN 331..379
FT /note="CXXC-type"
FT /evidence="ECO:0000259|PROSITE:PS51058"
FT REGION 80..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 266..308
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 391..449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 525..574
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 597..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 80..94
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..120
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 284..308
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 545..559
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 657 AA; 72426 MW; B104D98B2C42EC86 CRC64;
MAEDWLDCPA LGPGWKRREV FRKSGATCGR SDTYYQSPTG DRIRSKVELT RYLGPACDLT
LFDFKQGILC YPAPKAHSLA IPSRKRKKPS RPAKAQKRQV GPPKSEVRKE APRDETKADA
DTAPASLPAP GCCENCGISF SGDGTRRQRL KTLCKDCRAQ RIAFNREQRM FKRVGCGECA
ACQVTEDCGA CSTCLLQLPH DVASGLFCKC ERRRCLRIVE RSRGCGVCRG CQTREDCGRC
RVCLRPPRPG LRRQWRCVQR RCLRGKHGRR RGGCDSKVAP RRRPPRTQPL PALPPSQPPE
SPELHPRALA PSPPAEFIYY CVDEDELQPY TNRRQNRKCG ACAACLRRMD CGHCDFCCDK
PKFGGSNQKR QKCRWRQCLQ FAMKRLLPSV WAGSEDGAGP PAPYPRRKRP GSARRPRLGQ
TPKPPLATPM AQPDRARTPV KQEAGSGFVL PPPGTDLVFL REGASSPVQV PGPAPASTAA
LLQEAQCPGL SWVVALPQVK QEKADAQEDW TPGTAILTSP VLLPGCPSKA VDPGLPPVKQ
EPPDPEEDKE EENKDDSTSD LAPEEEAGGA GTPVITEIFS LGGTRLRDTA VWLPSLQGRQ
SGREDGCKEW ETEETLAPPS TSWKPRGWPG THVSLSPPPT SMMWVSCRRS WCPSSQS
//