ID A0A2U1N4W0_ARTAN Unreviewed; 688 AA.
AC A0A2U1N4W0;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 24-JAN-2024, entry version 15.
DE RecName: Full=NUC153 domain-containing protein {ECO:0000259|Pfam:PF08159};
GN ORFNames=CTI12_AA303820 {ECO:0000313|EMBL:PWA68558.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA68558.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA68558.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA68558.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the ESF1 family.
CC {ECO:0000256|ARBA:ARBA00009087}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA68558.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01003610; PWA68558.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1N4W0; -.
DR STRING; 35608.A0A2U1N4W0; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0006364; P:rRNA processing; IEA:InterPro.
DR InterPro; IPR039754; Esf1.
DR InterPro; IPR012580; NUC153.
DR PANTHER; PTHR12202:SF0; ESF1 HOMOLOG; 1.
DR PANTHER; PTHR12202; UNCHARACTERIZED; 1.
DR Pfam; PF08159; NUC153; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|ARBA:ARBA00023054};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT DOMAIN 592..616
FT /note="NUC153"
FT /evidence="ECO:0000259|Pfam:PF08159"
FT REGION 1..168
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 245..270
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 395..461
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 475..688
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 7..44
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..110
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 111..125
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..168
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 245..260
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 407..422
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 512..550
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 574..590
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 651..666
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 688 AA; 78149 MW; 181D4859BA54C5C9 CRC64;
MAPKNKKNKN KNVSTSSMAA PSATRVYGDD TNAITDSRFI QSQKDPRFQE VPKHKNKVKI
DSRFKRMLTD KHFTSSSSRV DKRGRVKKND DSTVNNLKEY YRIENDSELA KNEGNEDEDE
VESEESDEKL LKSGSEESGD EDDDEEEEVE EVDDDYNSTD TDEDEGGYLE MEEDGLQIEE
ENVPEIDKET HRLAVVNLDW SQVQAVDLFV VLNSFLPKSG QILSVSVYPS EFGLKRMEEE
AVKGPVGLFD DENGKNTKDD SDESDDDEEI DNEKLRAYEL SRLRYYFAVV VCDSVATADY
LYKSCDGIEF ERSSNKLDLR FIPDSMEFKH PARDVATEAP ANYEGIDFQT RALQQSKIDL
TWDENEPQRS KKLDKINADE ESEYVKYENE LKDFIASSDS ESDEDDSDGN KPGKRQKTDR
YRALLQSGDG SDDNEDDEGM DMEVTFNTGL EDLSNKILEK KKDKNSETVW DAYLRKKKEK
KKARKNKSKD SSDDESDNSD GDMIEEAGDF FAGDTTQKKK ASRDKKAQKS ETGLSKEEAE
ASRAELELLL ADDDGGDANV KGYNMKRKKA KGKKGKQDDI DEEKIPTVDY EDPRFSSLYT
RPEYALDPTD PQFKRSAAYV RQVAHKQNKD DVAEKGGQPN DSPVEAQAAE PKKTNSKKDH
ETSMLIKSIK MKSQQLTSDA KKSRRNAK
//