ID A0A2P6TF03_CHLSO Unreviewed; 1300 AA.
AC A0A2P6TF03;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Nucleolar MIF4G domain-containing 1 isoform A {ECO:0000313|EMBL:PRW32550.1};
GN ORFNames=C2E21_8247 {ECO:0000313|EMBL:PRW32550.1};
OS Chlorella sorokiniana (Freshwater green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=3076 {ECO:0000313|EMBL:PRW32550.1, ECO:0000313|Proteomes:UP000239899};
RN [1] {ECO:0000313|EMBL:PRW32550.1, ECO:0000313|Proteomes:UP000239899}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=UTEX 1602 {ECO:0000313|Proteomes:UP000239899};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus, nucleolus
CC {ECO:0000256|ARBA:ARBA00004604}.
CC -!- SIMILARITY: Belongs to the CWC22 family.
CC {ECO:0000256|ARBA:ARBA00006856}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PRW32550.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPG02000019; PRW32550.1; -; Genomic_DNA.
DR Proteomes; UP000239899; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0005730; C:nucleolus; IEA:UniProtKB-SubCell.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR Gene3D; 1.25.40.180; -; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR003891; Initiation_fac_eIF4g_MI.
DR InterPro; IPR003890; MIF4G-like_typ-3.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR PANTHER; PTHR18034; CELL CYCLE CONTROL PROTEIN CWF22-RELATED; 1.
DR PANTHER; PTHR18034:SF4; NUCLEOLAR MIF4G DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02847; MA3; 1.
DR Pfam; PF02854; MIF4G; 1.
DR Pfam; PF00076; RRM_1; 1.
DR SMART; SM00544; MA3; 1.
DR SMART; SM00543; MIF4G; 1.
DR SMART; SM00360; RRM; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR PROSITE; PS51366; MI; 1.
DR PROSITE; PS50102; RRM; 1.
PE 3: Inferred from homology;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000239899};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1063..1087
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1223..1243
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1264..1291
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 28..106
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 804..920
FT /note="MI"
FT /evidence="ECO:0000259|PROSITE:PS51366"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 100..483
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 102..123
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 140..209
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 210..240
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 241..261
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 276..299
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 300..321
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 347..376
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 408..436
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 437..453
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1300 AA; 140309 MW; 8256175F77B4C3F9 CRC64;
MSYRDRSRSR SRSRERGRGR EDRGGERVSV LVRNLPLDAR AEEVRAKFER YGPVRDIYLP
KDFYSGRPKG FGFIEFLDRR DGEEAIYNLD RTMFGNREIQ VCMSKEGRKT PRDMMDRERG
PGRGPPPPRG GGGYGGGRGR SRSPRRRSRS RSRSRDYRRR SRSRSRSRSR DYRRRSRSRS
ADRRRRSPSR SRSPRRSPAR SRSRSPSKSP RRSPARSVSR SPSRGRSRSP SRSPSRSPRR
SASPAREENG AGDDKFGELL PDHLKAAAGL DPEMQLQRDL ARKLGLKRGK TKMGGEDGLD
EFLDELGGID DLFEGGQDDD GAADEAPPGR QTGQKRKAAA AAAAADSGSE EEEEGGFSEE
DDEGSDGLDG LLDSSEEEGE EHGLTSGSSE GEEGASGGSG SDLEGGGGSD SEGEEEEEDS
LLAGLESDDE GGSSEGEGEE QQWSGSEGEQ EAGTSDSDAA DERQQQDQQG QQPAAAAAPS
AAKYVPPAAR RAAAAAAGGA ANEAAARVER RVRGLLNRLA EANIQGIVGD LAALYNEEGR
RLVSDAVCAE LIAAAAEGPR ASDRFAAVTA TAVAGLAGSV RSAEIVANFL DRLGARLEAA
VAAGDGLACH NLTMVAAHLF LSGALRPDVV HSMLDTWRER FTETDVSMIV TLLQACGLQL
RAADPVAMKE FVVAVHARAG EAGGLGRLTK RAQILLELVV DVKNNRTREE AKRKAVVLAP
AIGKWLRQCG APDAAVGGIS WRKVLQPGKK GIWWIPEATD AQPAQLGSGA ALRAAAAAAA
AGGEGGLSGP ELLKLAAAMR MNTDVRRAVF CVVMGSEDCV DAVEKLLRLG LKGEQEREVV
RVTVECCLQE KAWNPYYAHL LARLCGAGKS HKMTLQFCFW DQLKELDGME VRRLTNLARL
MASVIATQAL PATMLKGFDF AAALSARETL FWRIFFEHLL AACKSSDDSS ALFKRISAQP
ELKPLRVALA AFLRRSVGPW LAAKPPGAGG LSAEQLGELL RRLRGAERAL GPETEAAYVR
DRWTLLAKMD SQVGCILGLP SAAWAARLAG LVWRQQGSPL RQLAWIAGPA AAQAALHALP
LLFILLWGRM AYATHRSRVL LAQRLLMLPL LTVTQQLPPA MVAADLPSLL GQLLLATRAQ
PLLLFPLGFQ LPPGRSAALQ ALATAVGLRH LNQACSAAIF ELSSTKAVLT RLVSLLGRVP
SPMPLPPVSI PDDYPHTCWL LGAWIQICWL LVLPCALVTA WDLRARQQWA LRHPPGPHLP
AAECAWWAAL ALPCAWHQLL LLLPAFSLAV WRFMTMAAGG
//