ID A0A2U1LWI7_ARTAN Unreviewed; 663 AA.
AC A0A2U1LWI7;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=DNA-binding domain-containing protein {ECO:0000313|EMBL:PWA53330.1};
GN ORFNames=CTI12_AA445810 {ECO:0000313|EMBL:PWA53330.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA53330.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA53330.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA53330.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA53330.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01007455; PWA53330.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1LWI7; -.
DR STRING; 35608.A0A2U1LWI7; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR038945; MBD13-like.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR PANTHER; PTHR34067; OS04G0193200 PROTEIN; 1.
DR PANTHER; PTHR34067:SF20; OS08G0206700 PROTEIN; 1.
DR Pfam; PF01429; MBD; 4.
DR SUPFAM; SSF54171; DNA-binding domain; 5.
DR PROSITE; PS50982; MBD; 4.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000313|EMBL:PWA53330.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 106..180
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 206..282
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 338..409
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 442..515
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT REGION 33..116
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 169..193
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 292..332
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 401..437
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 537..600
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 637..663
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 44..112
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 401..418
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 422..437
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 568..591
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 663 AA; 72706 MW; 9BFFCEF4EF6730F3 CRC64;
MSSDDWPEWL PGDWTVQIRK IDQKKVKCYV DPQGHKCYSK PQVLDHLSKT NKSTANESAH
TNDGPESTPR SRSARRATGS GNNTPNGTDG TLVEAGDRSS SEPHTSPNEV GMSNWLPDGW
TEEVKIRKGG RSAGTKYRIY TDPITGQKFF SRPQVQKYLG TLNDSPAVVK TPTDNAEKRQ
SATEISQSGS VKKESASARF SSEYLVVSRT TTVEGLPEGW IKEVRTRKRG GANRKDPFYL
DPSSDYAFFS KKDALRYLET GDVTKCVIKP VRRNADDSPD VTLTLTEGLA SPTIPLKGTE
TSEAVPNGDE NGSQNKTPAR SISGGTFPTP DTKEIRSISG GTVTSDWLPE GWTVEVLAKA
TGQKYKVFKE TATGKKFYSK PQVLKYLGIA DDSSISNKRK ETALSVTPVS SVAPTSAEGS
QGKRPKRSKT KKDDTQNLDF TEEITTTAAD GLPAGWIKET RTKIFATHKR TDPFYTDPAT
GYIFRSKLDA LRFLETGDVN ICAIRPKVKD KDGNEVFVYT HDVQKPGQAT TGEQLLEGKE
DVPTDGAVPM TQVPARGRGR PPKLNSRSSK RQKGINLETE PSLNPAEDNE TETGLNLEKQ
ADDERLSFQI EEAENWTDQC LDFAVKTLTD EILFNGQPAS SSLQDGNGEV DNGVKETPTQ
ANQ
//