ID A0A1R3IEW0_COCAP Unreviewed; 336 AA.
AC A0A1R3IEW0;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE SubName: Full=Methyl-CpG DNA binding protein {ECO:0000313|EMBL:OMO81136.1};
GN ORFNames=CCACVL1_12588 {ECO:0000313|EMBL:OMO81136.1};
OS Corchorus capsularis (Jute).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=210143 {ECO:0000313|EMBL:OMO81136.1, ECO:0000313|Proteomes:UP000188268};
RN [1] {ECO:0000313|EMBL:OMO81136.1, ECO:0000313|Proteomes:UP000188268}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. CVL-1 {ECO:0000313|Proteomes:UP000188268};
RC TISSUE=Whole seedling {ECO:0000313|EMBL:OMO81136.1};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M.;
RT "Corchorus capsularis genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO81136.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWWV01010214; OMO81136.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3IEW0; -.
DR STRING; 210143.A0A1R3IEW0; -.
DR EnsemblPlants; OMO81136; OMO81136; CCACVL1_12588.
DR Gramene; OMO81136; OMO81136; CCACVL1_12588.
DR OMA; HPGENTD; -.
DR OrthoDB; 622866at2759; -.
DR Proteomes; UP000188268; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR CDD; cd00122; MBD; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR039622; MBD10/11.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR PANTHER; PTHR33729; METHYL-CPG BINDING DOMAIN CONTAINING PROTEIN, EXPRESSED; 1.
DR PANTHER; PTHR33729:SF6; METHYL-CPG-BINDING DOMAIN-CONTAINING PROTEIN 10; 1.
DR Pfam; PF01429; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000188268};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 16..84
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT REGION 66..336
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 100..292
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 293..336
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 336 AA; 36410 MW; DEFAEBAB73B11621 CRC64;
MSSSVEKEKK EMEVAKEDVA CLELPAPSGW KKKFMPKKGT KKNEIIFIAP TGEEFNNRKQ
LEQYLKAHPG GPAISEFDWG TGETPRRSAR ISEKVKVTPT PESEPPKKRG RKSSASKKDN
KESETAPEGA EETEDAHMEE AEKSGKENVE GETGKVAIKE DENEKENENK DKTQDADGKT
ESTSQEVKHG EDANISTNIE EGKEAAEAVS EKLKAPQDGV EADASGVDRK EKEGLEGAAS
ERKVEQPVAE AEKGLGSGEH DKPDAGITEE TKKEVEGLEK ENHDKSTTES EGTIKGKESA
NCNEGQNTSG VNETNKKTEE AVQNGSNGSN TGEVKP
//