ID A0A2U1PCL2_ARTAN Unreviewed; 1617 AA.
AC A0A2U1PCL2;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Homeodomain-like transcriptional regulator {ECO:0000313|EMBL:PWA83504.1};
GN ORFNames=CTI12_AA124010 {ECO:0000313|EMBL:PWA83504.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA83504.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA83504.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA83504.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PROSITE-ProRule:PRU00108, ECO:0000256|RuleBase:RU000682}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA83504.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01001342; PWA83504.1; -; Genomic_DNA.
DR STRING; 35608.A0A2U1PCL2; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR007759; Asxl_HARE-HTH.
DR InterPro; IPR018501; DDT_dom.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR044977; RLT1-3.
DR InterPro; IPR028942; WHIM1_dom.
DR InterPro; IPR028941; WHIM2_dom.
DR PANTHER; PTHR36968; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR PANTHER; PTHR36968:SF5; HOMEOBOX-DDT DOMAIN PROTEIN RLT2; 1.
DR Pfam; PF02791; DDT; 1.
DR Pfam; PF05066; HARE-HTH; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF15612; WHIM1; 1.
DR Pfam; PF15613; WSD; 1.
DR SMART; SM00571; DDT; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS50827; DDT; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51913; HTH_HARE; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Homeobox {ECO:0000256|PROSITE-ProRule:PRU00108,
KW ECO:0000256|RuleBase:RU000682};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW Transcription {ECO:0000256|ARBA:ARBA00023163}.
FT DOMAIN 24..84
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 522..581
FT /note="DDT"
FT /evidence="ECO:0000259|PROSITE:PS50827"
FT DOMAIN 702..771
FT /note="HTH HARE-type"
FT /evidence="ECO:0000259|PROSITE:PS51913"
FT DNA_BIND 26..85
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 1..26
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 78..150
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 247..275
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 806..831
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1394..1480
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1568..1617
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 369..438
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 251..275
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 806..827
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1451..1466
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1568..1584
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1617 AA; 179715 MW; 5F4D0099D227B221 CRC64;
MDGGDGVVGG GESGAAAGGG SEGGEVKVKR KMKTAFQLEV LEKTYAGEQY PSEELRGELS
KQLGLSDRQL QMWFCHRRLK EKKPQPEKRS KKGISSSAGV GDVMVVSGEG LGGNEVGGSG
GSGSGSSPFG HGERGVSRAG GPGVSGSVSG GGRVAVAAST EMPSVKMYYE SPQAVSEARA
VAFVEAQLGE RLRDDGPILG MEFDPLPPGA FGAPIVQQNL AVRSYDAKPF DAKPVKGGVK
AVHEYQFLPE QPSGRSDSYE RATTSQYHGS PIDAASSRTH ITPNSGFGFK GPVNLLPQQG
IETPTSVAAH PITGIENPFA TPVTHEEAMA RIARKRKSEE ARIAKEVEAN QKRIRKELEK
QDVLRRKREE QMRKEMERHD RERRKEEERL LREKQREEER YQREQRREME RRMKFLQKES
MRAEKLRLKE DMRKEKEAAR LKAANDRAAA RRLAKESLEL IDDERLELME IAASNKGLPS
MLSLDSEALE SLESLRDMLP EFPPKSVHLK RPFKFEPWTD SEENIGNLLM VWKFLITFAD
VLGLWPFTLD EFVQAFHDHD PRLLGEIHVA LLKLIVKDIE DVARAPSSLA ANQIGPPNPG
GGHPQIVEGA FAWGFDICSW QRHLNPLTWP EILRQFALAA GYGPKLKKRN VDRELPHEEN
EGVDGGDMIS NIRSGAAAKS ALAKMQARGF SNRRSRHRLT PGTVKYAAFH VLSLEGSRGL
SILDVADKIQ KSGLRDLTTS KTPEASIAAA LSRDTKLFER TAPSTYCLKS PYRRDPADGE
AIINAAKEKI HTFKNGLLDG EEAFDAERDD AEKEEDSDSD VPDDPEGDDI GILSVSDVQS
ETVGDIKVES LVDQPNIIGV NHVTNKGVAA PDEDAVVDES ISGEVWVQGL MEGEYSDLTV
EERLNALVAL ISVANEGNSI RVLLEERLET AMAIKKQMLA DAQVDRRRMR EDFMIKIQYP
SVDDRIDYMS NNHQENVGDL HDDLGNLNER LLEPAYAAEK SRAQLKFFIS HKAEELYVYR
SLPLGQDRRR NRYWQFITSA SQNDPGCGRI FVELCDGRWR LIDNEESFNV LLASLDVRGN
REAQLHSMLQ RIEARFKEFV SKMWVDVPEA GPMGSGSPSS TVSIPTTDAS EFSSTFAIEH
GRNGNEMTNA VKRYQNFEKW MWRECLNLRA MKFGSTRCEN ILEVCDNCLD LSFFENGQCS
SCHKLCETFF GSNLAFAKHL SRFKEKLKSE SVLCFHDRES GLPVRFRLIK ALLALIEASL
PLEALQPSWT DECRKTWCVK LVNAVTVDAL LEALTLLESS IKRDYLSLDF ETTDELLGSD
NSAALLASKD CSTVSVLPWL PETTSSVALR LMELDTSLYY LLSQKEDAEK DKEATNVLTL
HPKFAAVMKN GEEGDQGESL YGSGNLHDPW ADPVTSGRGR GRGRGRGRAR GGRSQRSVAA
SGSRSNNPKG RPRGRGGRKR GRRGGQTKPH NTVQITEHDS SRDYLYEESP VVGFQDWNAE
EIADFHATEN ASSSEYEIEN ENENTGVDEY DDDMMADDGY HQKAYNNIPS REYIGEYGLE
EDDEVMDDYE DEDEQAGVDV DGFFNDDADD NRDLDVGGGH VGNPGDETEL SSSGYSD
//