ID A0A2U1PC30_ARTAN Unreviewed; 730 AA.
AC A0A2U1PC30;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE SubName: Full=Basic helix-loop-helix leucine zipper transcription factor {ECO:0000313|EMBL:PWA83315.1};
GN ORFNames=CTI12_AA166840 {ECO:0000313|EMBL:PWA83315.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA83315.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA83315.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA83315.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA83315.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01001363; PWA83315.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1PC30; -.
DR STRING; 35608.A0A2U1PC30; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0046983; F:protein dimerization activity; IEA:InterPro.
DR CDD; cd11445; bHLH_AtPIF_like; 1.
DR Gene3D; 4.10.280.10; Helix-loop-helix DNA-binding domain; 1.
DR InterPro; IPR011598; bHLH_dom.
DR InterPro; IPR036638; HLH_DNA-bd_sf.
DR InterPro; IPR047265; PIF1-like_bHLH.
DR InterPro; IPR044273; PIF3-like.
DR PANTHER; PTHR46807; TRANSCRIPTION FACTOR PIF3; 1.
DR PANTHER; PTHR46807:SF1; TRANSCRIPTION FACTOR PIF3; 1.
DR Pfam; PF00010; HLH; 1.
DR SMART; SM00353; HLH; 1.
DR SUPFAM; SSF47459; HLH, helix-loop-helix DNA-binding domain; 1.
DR PROSITE; PS50888; BHLH; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 672..721
FT /note="BHLH"
FT /evidence="ECO:0000259|PROSITE:PS50888"
FT REGION 95..122
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 415..442
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 549..591
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 604..686
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 239..280
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 95..109
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 604..631
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 632..659
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 730 AA; 81313 MW; 2312C5A6764765D6 CRC64;
MPGTYVAKPD VVKFSTKPIL PSPNALYRKL DLDKAHGHIL RYGFLPGYTE WTVHGEHTIS
LAPSQSSYVN VEETSLGQED IIGLVRDALG INSLPSDNTQ LGDTTMEGDT GESTKADDHG
DEGVSYKKLL EECDKELSPN DLESFDICYK TTDDTYIQEA TAEMMVIANQ EISRKKLELV
GPEGNIEPAL EAEIAREVLN KLFGNEEPRC FGAGVTKSQI TKFCCDLRMM RGEVLANENR
FLLEKVDNQS KEIATQKKQL ETQKNKVESY SKQVNTLVSQ LNNMGQQLNE VYGMLKVFQT
AFPDLYNTAS TSAAASTCDK QPSSSASPIM DHYSPVMDHY PELIFIKIKA IKNWLTSWLV
FMPICSCRNE RLYLLRRKTG HQMTKLLITA FSHDWGGDED IMELLWHNGQ VVMQSQNQRS
SGSKKLETKP AVRSAEQTAH QTGPSDLFMQ EDEISSWLHY PIEYPANENS LEGYIYNNDL
LFPTPPPNPV TTAPITPTTL LPPPPPSVVV PSPRPPVAPI WRNRVDIQPQ SRQQPKYPNF
LHFSRPNKAR TLESGPSAPV TEAPESRASR VSEKPPPISA GGESVSGVGL VGTSSMGREV
ETCDTSMMSS PDGSGASGSI EPSTQMPPPS TNDRKRKGRD TEDTECHSED VECEYPDAKK
QSHGSTSTKR SRAAEVHNLS ERRRRDRINE KMKALQELIP RCNKSDKASM LDEAIEYLKS
LQMQVQVNKP
//