ID A0A453JK56_AEGTS Unreviewed; 452 AA.
AC A0A453JK56;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE RecName: Full=MBD domain-containing protein {ECO:0000259|PROSITE:PS50982};
OS Aegilops tauschii subsp. strangulata (Goatgrass).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Pooideae; Triticodae; Triticeae; Triticinae; Aegilops.
OX NCBI_TaxID=200361 {ECO:0000313|EnsemblPlants:AET5Gv20092400.2, ECO:0000313|Proteomes:UP000015105};
RN [1] {ECO:0000313|Proteomes:UP000015105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. AL8/78 {ECO:0000313|Proteomes:UP000015105};
RX PubMed=25035499; DOI=10.1126/science.1250092;
RG International Wheat Genome Sequencing Consortium,;
RA Marcussen T., Sandve S.R., Heier L., Spannagl M., Pfeifer M.,
RA Jakobsen K.S., Wulff B.B., Steuernagel B., Mayer K.F., Olsen O.A.;
RT "Ancient hybridizations among the ancestral genomes of bread wheat.";
RL Science 345:1250092-1250092(2014).
RN [2] {ECO:0000313|Proteomes:UP000015105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. AL8/78 {ECO:0000313|Proteomes:UP000015105};
RX PubMed=29158546; DOI=10.1038/s41477-017-0067-8;
RA Zhao G., Zou C., Li K., Wang K., Li T., Gao L., Zhang X., Wang H., Yang Z.,
RA Liu X., Jiang W., Mao L., Kong X., Jiao Y., Jia J.;
RT "The Aegilops tauschii genome reveals multiple impacts of transposons.";
RL Nat. Plants 3:946-955(2017).
RN [3] {ECO:0000313|EnsemblPlants:AET5Gv20092400.2}
RP IDENTIFICATION.
RG EnsemblPlants;
RL Submitted (MAR-2019) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A453JK56; -.
DR STRING; 200361.A0A453JK56; -.
DR EnsemblPlants; AET5Gv20092400.2; AET5Gv20092400.2; AET5Gv20092400.
DR Gramene; AET5Gv20092400.2; AET5Gv20092400.2; AET5Gv20092400.
DR Proteomes; UP000015105; Chromosome 5D.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR039622; MBD10/11.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR PANTHER; PTHR33729; METHYL-CPG BINDING DOMAIN CONTAINING PROTEIN, EXPRESSED; 1.
DR PANTHER; PTHR33729:SF2; METHYL-CPG BINDING DOMAIN CONTAINING PROTEIN, EXPRESSED; 1.
DR Pfam; PF01429; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000015105};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 41..108
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 83..452
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..15
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 111..135
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 141..170
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 187..342
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 418..452
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 452 AA; 47191 MW; D3918F8155E2877D CRC64;
IPTRPSFPLF PHPKPTPRSS CVRAQNPETS AAMATGGDRA AEELVSVEMP APEGWTKKFT
PQSRGRSEIV FVSPTGEEIK NKRQLNSYLK ANPGGPTSSE FDWSTGDTPR RSARISEKVK
VFDSPEGEKI PKRSRNSSGR KGKQEKKEDP ETEEDKEAEA GKEAPSEDVA KSTDVEMTVA
EAIDAAKSTD VEMKPAEAND AAKNADVEMK VAEEVKAAPS EDAGKTEDST PAPVEHKEDV
KPAESDVAPA PAAEDKKEDV KPAETDAPPA PTVEDKKEDV KPAEADAPPA PAVEDKEDAK
PAEADAPPAP AVEDKEDAKP AEADAAPAPA VEDKKEDVKP SEADAAPLVS SEKVKTEEAP
LVSSEEVKTE EAPPVSSEGA KTEEVAPPAS KPTENSVAAP SEPAIAPAPA AVSETKSDAA
AVDSQPGAAT NESPSAVNNG QLSPGASTVK CT
//