ID A0A2U1N5Z1_ARTAN Unreviewed; 791 AA.
AC A0A2U1N5Z1;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE SubName: Full=NIN-like protein {ECO:0000313|EMBL:PWA68930.1};
GN ORFNames=CTI12_AA301100 {ECO:0000313|EMBL:PWA68930.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA68930.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA68930.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA68930.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA68930.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01003537; PWA68930.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1N5Z1; -.
DR STRING; 35608.A0A2U1N5Z1; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR InterPro; IPR045012; NLP.
DR InterPro; IPR000270; PB1_dom.
DR InterPro; IPR003035; RWP-RK_dom.
DR PANTHER; PTHR32002:SF35; PROTEIN NLP6; 1.
DR PANTHER; PTHR32002; PROTEIN NLP8; 1.
DR Pfam; PF00564; PB1; 1.
DR Pfam; PF02042; RWP-RK; 1.
DR SMART; SM00666; PB1; 1.
DR SUPFAM; SSF54277; CAD & PB1 domains; 1.
DR PROSITE; PS51745; PB1; 1.
DR PROSITE; PS51519; RWP_RK; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT DOMAIN 629..714
FT /note="RWP-RK"
FT /evidence="ECO:0000259|PROSITE:PS51519"
FT DOMAIN 709..789
FT /note="PB1"
FT /evidence="ECO:0000259|PROSITE:PS51745"
FT REGION 24..69
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 618..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 24..43
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 54..69
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 791 AA; 88777 MW; 7380EAEE2114CD88 CRC64;
MSATLKVPTL GVVACGVVAW TRKKGANRDQ HSHDQHSRDQ RARSGGLRSG GLDLKRGANR
DQRDVDSSKG GVLLRSHIAQ AQAATVYMYF ESCPVSMALF GQSQSVDSAY REKPSCSGPR
SVLSTDQNKD LLWNKVSLEP PTVSFSGESR YLTPLWVSRK EEPLNELYEI IRAVLKQLVL
RAQHVLVQFW SPRVVGKRQI LTTLDQPFGF GLPVEELYFY RKESEQNLFL VCKDDEEQDI
SPPARVFRRG LPEWTPDVTN YLPKHFPQQD CAIRCNLHGY LALPIFDSTT PLCIGVLELL
MSSKNTDYTF EVQQVHKALK LQNLTCPQTF DCATPQVLSE CRQNVWDKIR CILKSVCDIH
KLPLAQTWAV SPHNSFVSHE KTLQKSCSSF DTKCIGKVCM STTSLPFYVR DMGMWPFREA
CKMQHLDKSR SFVGRAMLAH GSSYCEDITQ LCEDEYPLVY NARMSGLTGC FTIFLHSIEG
DNGDYVLEFF LPLNSKDSRH VLNLVQTLKQ MIVVASGFEL GEISPIQITE SPRDETCLSL
SVEPQSIHIS STTTTKTLAF GMDSTDSESV LANVVKTDSA DGQSQCSSKE NYTNDMSDNV
NIVNSRENDN AASYSIVTNQ NPSDTITDAG EKSKKRGRKR KIDSLTMEAV VQHVGKPISQ
AAESFGVCRS TLKRFCRENG ILSWSKLCQS KKTSCDTESK SESIYQVAQL MVKATFKGDM
IKFRFPISSG LLELENEVAQ RLDLKGKTLI IKYKDEENDW LRITCDDDLQ SLPEFLASNT
TIRLIVELAS N
//