ID A0A2U1NNI8_ARTAN Unreviewed; 319 AA.
AC A0A2U1NNI8;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE SubName: Full=Homeobox protein knotted-1-like 2 {ECO:0000313|EMBL:PWA75010.1};
GN ORFNames=CTI12_AA247340 {ECO:0000313|EMBL:PWA75010.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA75010.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA75010.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA75010.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108}.
CC -!- SIMILARITY: Belongs to the TALE/KNOX homeobox family.
CC {ECO:0000256|PROSITE-ProRule:PRU00559}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA75010.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01002476; PWA75010.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1NNI8; -.
DR STRING; 35608.A0A2U1NNI8; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR005539; ELK_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR008422; Homeobox_KN_domain.
DR InterPro; IPR005540; KNOX1.
DR InterPro; IPR005541; KNOX2.
DR PANTHER; PTHR11850:SF279; HOMEOBOX DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR11850; HOMEOBOX PROTEIN TRANSCRIPTION FACTORS; 1.
DR Pfam; PF03789; ELK; 1.
DR Pfam; PF05920; Homeobox_KN; 1.
DR Pfam; PF03790; KNOX1; 1.
DR Pfam; PF03791; KNOX2; 1.
DR SMART; SM01188; ELK; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM01255; KNOX1; 1.
DR SMART; SM01256; KNOX2; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51213; ELK; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
PE 3: Inferred from homology;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT DOMAIN 216..236
FT /note="ELK"
FT /evidence="ECO:0000259|PROSITE:PS51213"
FT DOMAIN 236..299
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DNA_BIND 237..300
FT /note="Homeobox; TALE-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 39..64
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 319 AA; 37364 MW; 10161A978F1D1B74 CRC64;
MMLMKMIMII DRSLRPNIQI MEEMYGLTNF QQGGVQQDNY QPRRMYGSGS NIQTTSSNDK
QQNNNELAEE LEDDLLRENR AKILSHPLYP KLLQVYIACQ KVGAPTDIAN LFDEIFKDND
FSRRSSSCSC LGDDPELDEF METYCEMLDK YRSDLARPFD EATTFLNNMQ TQLNNLCKGT
TVTYNTDESV ERSEEDLISR GETEVLKANW THEDRALKEK LLRKYSKYIS SLKHEFSNKK
KKGKLPKEAR QVLLDWWNIH YRWPYPTEAD KIALAESTGL DQKQINNWFI NQRKRHWKPS
ENMQFAVMDT LCGSFLVNN
//