ID A0A2U1N1Z6_ARTAN Unreviewed; 1016 AA.
AC A0A2U1N1Z6;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 15.
DE SubName: Full=NIN-like protein {ECO:0000313|EMBL:PWA67541.1};
GN ORFNames=CTI12_AA319280 {ECO:0000313|EMBL:PWA67541.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA67541.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA67541.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA67541.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA67541.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01003815; PWA67541.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1N1Z6; -.
DR STRING; 35608.A0A2U1N1Z6; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR InterPro; IPR045012; NLP.
DR InterPro; IPR000270; PB1_dom.
DR InterPro; IPR003035; RWP-RK_dom.
DR PANTHER; PTHR32002:SF49; NIN-LIKE PROTEIN; 1.
DR PANTHER; PTHR32002; PROTEIN NLP8; 1.
DR Pfam; PF00564; PB1; 1.
DR Pfam; PF02042; RWP-RK; 1.
DR SMART; SM00666; PB1; 1.
DR SUPFAM; SSF54277; CAD & PB1 domains; 1.
DR PROSITE; PS51745; PB1; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT DOMAIN 938..1016
FT /note="PB1"
FT /evidence="ECO:0000259|PROSITE:PS51745"
FT REGION 889..932
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 909..928
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1016 AA; 116201 MW; ED36236148250A0E CRC64;
MNYQEGRYAH VHRDSCTRYQ KCITTSQYKI LIQKSYQNSC KHSLMVSNLN TIINSGLSVY
NFQDRIPYKT INGLKLARDQ IQEALKIVCQ SNLVTFAQVW IASLDENHVH FSSSWEEAQT
RRLLGLKLTG YNAIFANPAR YCWTFQNYYN ACDLIPIQPG KELFLKTLQD SEPRFCKNIS
QLGNFKFYRE PGSAGVGFTI CMRSIHTGDF NYVFEFLWKH YSFYPILLEE ILLDIKRCLP
GFKFASGKEI GDKLNVIEVD YSTDKEIKKF DIFQFKSLSP IPEGEVGNEQ VAVDYISPLE
TNCKTAPVLI PQQVIEQQFG TKDFLAAENI CGDGNRMADR DDERLSEFLE TLPTYILHKK
PEIKGSLWVF CSKDEGSQKN LSSDDLGTRM ICDKIKSAFS KVKNDEKLIV QFWAPVTSGN
RRVLSTSGQP FAISKYSEGL KEYRGRCVGY ECDIDMNNDN NKVIKNEIEQ HCDGNPMTTM
SGPPTNAFLN RLPEAVLDTK NHREDSLMRY ASECRLWTSY TLPIGCPSQY HSSCIGVVEC
SSISSINLEI MNTMRRALEQ EGLNVFNVQD RIPYSAINGL TLVRDEIQVA LKIVCESHKI
CFAQVWISYE DENHMPFSFS SEDTQTTCRL ALKLTGYNSV DENSTASHWR FKEYYDACDM
VPLKMGEELV EKTLQDNQPR FCENISQLGT DMLMAWVSTD DVACSGFTIC MSIDTGDFSC
AFEFIWRHNP DYVLLLEVLL LALKRHLPRF KFASGAELGD QLHVIDVENS TKSETKFFEI
FKEKRLSPIR EAIDKGKNAI DVNYNTPILE AMNKGKKAIV VNCNAPIPEE KNKGKKAIVV
NYNALSKEKR VTTEIELSRE EIEQHYGKTE KQAAKELRVS LSTLKRKRNK LGMSGWQGPN
LPQRKAYNSN KNRSKESHTH EKDNGVIQDP SPIKRNENAV IKAEYADDII KLHLGISEAT
FVTVENEIGK KFKLKQGTFK IKYLDEDEEW ILMTSDQDLS DCIQNSRRLR VLLHNQ
//