ID A0A2U1QBP5_ARTAN Unreviewed; 687 AA.
AC A0A2U1QBP5;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE SubName: Full=START domain, Homeodomain-like, START-like domain protein {ECO:0000313|EMBL:PWA95428.1};
GN ORFNames=CTI12_AA050080 {ECO:0000313|EMBL:PWA95428.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA95428.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA95428.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA95428.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- SIMILARITY: Belongs to the HD-ZIP homeobox family. Class IV subfamily.
CC {ECO:0000256|ARBA:ARBA00006789}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA95428.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01000242; PWA95428.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1QBP5; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008289; F:lipid binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR CDD; cd08875; START_ArGLABRA2_like; 1.
DR Gene3D; 3.30.530.20; -; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR042160; GLABRA2/ANL2/PDF2/ATML1-like.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR023393; START-like_dom_sf.
DR InterPro; IPR002913; START_lipid-bd_dom.
DR PANTHER; PTHR45654; HOMEOBOX-LEUCINE ZIPPER PROTEIN MERISTEM L1; 1.
DR PANTHER; PTHR45654:SF104; HOMEOBOX-LEUCINE ZIPPER PROTEIN-RELATED; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF01852; START; 1.
DR SMART; SM00389; HOX; 1.
DR SMART; SM00234; START; 1.
DR SUPFAM; SSF55961; Bet v1-like; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS50848; START; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054, ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT DOMAIN 15..75
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 198..430
FT /note="START"
FT /evidence="ECO:0000259|PROSITE:PS50848"
FT DNA_BIND 17..76
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..26
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 74..101
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 687 AA; 75463 MW; AEBA5E3A05DE1817 CRC64;
MEAPSGDDQD PSQRPNKKKR YHRHTQHQIQ ELEAFFKECP HPDDKQRKEL GRRLTLEPLQ
VKFWFQNKRT QMKAQHERHE NSSLRNDNEK LRMENIRYKE ALANATCPNC GGPAAIGEMS
FDEQQLRMEN ARLRDEIDRI SGIAAKYVGK PLLTYPNLSQ NGPPRSLDLP IASFNSQPSM
VDDMFGTSNL LRSVSGPSEA DKPVIIELAV AAMEELVRMA QSGEPLWIPS SDNSSETLNE
DEYSQTFPRG IGPKQMGLQS EASRESQVVI MNHISLVEIL MDVNQWSNVF AGIVSRAMTL
EVLSIGVAGN YNGALQVMTA EYQVPSPLVP TREYYFVRYC KQHADGTWAV VDVSLDNLRP
GVMSRSRRRP SGCLIQELPN GYSKVTWVEH VEFDDRAVHD IYRSLVNSGL AFGAQRWVAT
LERQCERLAS AMANNIPAGD VGVIPTLEGR KSMLKLSERM VLSFCSGVGA STTHTWTTLS
GSGADDVRVM TRKSVDDPGR PPGIVLSAAT SFWIPVPPKR VFDFLRDENS RSEWDILSNG
GLVQEMAHIA NGRDPGNCVS LLRVNSANSS QGNMLILQES SSDSTGSYVI YAPVDIAAMN
VVLSGGDPAY VALLPSGFAI LPDGAGKHQG GRIVEVGNGG SLLTVAFQIL VDSVPTAKLS
LGSVATVNNL IKCTVERIKA AVALENP
//