ID A0A2U1L820_ARTAN Unreviewed; 529 AA.
AC A0A2U1L820;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 03-MAY-2023, entry version 16.
DE SubName: Full=Cleavage stimulating factor 64 {ECO:0000313|EMBL:PWA45149.1};
GN ORFNames=CTI12_AA513740 {ECO:0000313|EMBL:PWA45149.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA45149.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA45149.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA45149.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA45149.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01010913; PWA45149.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1L820; -.
DR STRING; 35608.A0A2U1L820; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0031124; P:mRNA 3'-end processing; IEA:InterPro.
DR CDD; cd12398; RRM_CSTF2_RNA15_like; 1.
DR Gene3D; 1.25.40.630; -; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 1.10.20.70; Transcription termination and cleavage factor, C-terminal domain; 1.
DR InterPro; IPR025742; CSTF2_hinge.
DR InterPro; IPR026896; CSTF_C.
DR InterPro; IPR038192; CSTF_C_sf.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR PANTHER; PTHR45735; CLEAVAGE STIMULATION FACTOR SUBUNIT 2; 1.
DR PANTHER; PTHR45735:SF2; RRM DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF14327; CSTF2_hinge; 1.
DR Pfam; PF14304; CSTF_C; 1.
DR Pfam; PF00076; RRM_1; 1.
DR SMART; SM00360; RRM; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR PROSITE; PS50102; RRM; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176}.
FT DOMAIN 7..85
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT REGION 83..121
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 246..333
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 362..410
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 246..260
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 267..333
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 362..384
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 529 AA; 57349 MW; 5660E014BCFD8B3B CRC64;
MSNPQHRCVF VGNIPYDATE EQLVQICEEV GPVVNFRLVI DRETNKPKGY GFCEYKDEET
ALSARRNLHG YDINGRQLRV DFAENDKNSD RTREQGRVGP GLPTNSAPQK QVGGPAVQGD
PSFHQPIGHQ VAMAAAVVMA GALGGGQSNS NISQNGIQSQ PTLGTDPLTL HLAKIPKAQL
TEVLNEVKTM ATQNKEQARQ LLMAHPQLSK AILQAHIMLK SVPPHMLQMP NARPLPGQHT
HLPVQVGQQS SIQPLSGLPP LAQNKVQPGF MPQSQSQPST SQYSAPSFPL QPRFQTPLPG
QHQVLQNPTV SGLPATGNLQ TLHPQHPGSL PNRVENQLAT SSSMLQYPGQ HASANLRHNS
PLVSSIQPSI PSHQPSSSYV SAIPNNGKKD HDNPQHLMQN PAWGRKVDPH PNMASRLHEN
ASFVNDRDQI GHPSKLLKME DGRAASLASA DVNLSASVMG PSQAVPLSGN PIPKAAAVDT
QKQNHELPPG IDSDLLQQVL NLTPEQLSSL PPDQQQQVIQ LQQMLRQSK
//