ID W1NE73_AMBTC Unreviewed; 365 AA.
AC W1NE73;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=MBD domain-containing protein {ECO:0000259|PROSITE:PS50982};
GN ORFNames=AMTR_s00004p00169140 {ECO:0000313|EMBL:ERM93673.1};
OS Amborella trichopoda.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Amborellales; Amborellaceae; Amborella.
OX NCBI_TaxID=13333 {ECO:0000313|EMBL:ERM93673.1, ECO:0000313|Proteomes:UP000017836};
RN [1] {ECO:0000313|Proteomes:UP000017836}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24357323;
RG Amborella Genome Project;
RT "The Amborella genome and the evolution of flowering plants.";
RL Science 342:1241089-1241089(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI397628; ERM93673.1; -; Genomic_DNA.
DR RefSeq; XP_006826436.1; XM_006826373.2.
DR AlphaFoldDB; W1NE73; -.
DR STRING; 13333.W1NE73; -.
DR EnsemblPlants; ERM93673; ERM93673; AMTR_s00004p00169140.
DR GeneID; 18421624; -.
DR Gramene; ERM93673; ERM93673; AMTR_s00004p00169140.
DR KEGG; atr:18421624; -.
DR eggNOG; ENOG502RYIM; Eukaryota.
DR HOGENOM; CLU_759416_0_0_1; -.
DR OMA; HEERKEY; -.
DR OrthoDB; 622866at2759; -.
DR Proteomes; UP000017836; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR CDD; cd01396; MeCP2_MBD; 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR039622; MBD10/11.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR PANTHER; PTHR33729; METHYL-CPG BINDING DOMAIN CONTAINING PROTEIN, EXPRESSED; 1.
DR PANTHER; PTHR33729:SF6; METHYL-CPG-BINDING DOMAIN-CONTAINING PROTEIN 10; 1.
DR Pfam; PF01429; MBD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR PROSITE; PS50982; MBD; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000017836};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 21..91
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 73..210
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 244..365
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 94..117
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 144..210
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 267..293
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 306..330
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 365 AA; 39127 MW; C6E24729030D1E34 CRC64;
MATLVERDEG MVQETHDNND KGEVVSVELP APQGWKKKFI PKKGGTPKRN EIVFVAPTGE
EIRNKKQLDQ YLKAHPGGPA VSEFDWGTGD TPRRSARISE KAKALDFPES EPKSKRARKS
SSSKKGPKAK KANDEENEAL EEEADKEKSD VADAAGVGKE HDASKEVEMQ DAEDGRGKVD
NKVIAGEESA KHEVDTVPKP AEDGSIKVED KVTTVEENAG NEGDSGTKLV GDAGKSVVEK
VGGSCDTNYM ESMKDKGVTT TEDVGPGDSK ANEEVKENVL PDDHNKEEKN ATADCEVGSV
NTDDAKAFEA TKESAATEEK SEEEKGLPEN GATTREGGKE GSTTKDMHSL NCADGQRPEP
SPVSC
//