ID A0A2U1PK29_ARTAN Unreviewed; 403 AA.
AC A0A2U1PK29;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 03-MAY-2023, entry version 15.
DE SubName: Full=Nucleotide-binding, alpha-beta plait {ECO:0000313|EMBL:PWA86072.1};
GN ORFNames=CTI12_AA144300 {ECO:0000313|EMBL:PWA86072.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA86072.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA86072.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA86072.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA86072.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01001058; PWA86072.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1PK29; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd21618; RRM_AtNSRA_like; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 2.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR10501:SF73; RRM DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR10501; U1 SMALL NUCLEAR RIBONUCLEOPROTEIN A/U2 SMALL NUCLEAR RIBONUCLEOPROTEIN B; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF00098; zf-CCHC; 2.
DR SMART; SM00360; RRM; 1.
DR SMART; SM00343; ZnF_C2HC; 3.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS50158; ZF_CCHC; 3.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 137..223
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 323..338
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 344..359
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 362..377
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 1..37
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 232..301
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 380..403
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 249..301
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 403 AA; 45423 MW; 113D977093574CB9 CRC64;
MADPYWRYAA DRGSGPPPTY PGYIPSEPST LSSQPLWTSH NHVSSSSDFL QNDILSSRPR
PYGVDDYMGI RRPETGISGY AAGTSRNGHP PVWEDPYPLG RRDVMQGIRP EIPDIVNERP
ASLRMADGPP VGVRESNVLF VDALPSDCSR REVSHLFRPF LGFQELRLVH KEPRHGADRG
LVLCFVEFSD SKHALTALEA LQGYKFDNKK PDSPALKIQF AHFPFQLPPD REEQRHANSF
QLPSNREDQG RLTPRREPSI EDEEPRGRKQ FEKRKFEGSP QTSKKRKNVE STADSSDSRK
GRSFCIKCNR RHFGECKADL KGCFSCGKLD HKSWDCPSES AKACFNCGKS DHKSKDCRNK
LCHGCKEPGH LAAECPKLNF NGSKREKKGN ASEPMVKDEP QKT
//