ID C1ML55_MICPC Unreviewed; 424 AA.
AC C1ML55;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 54.
DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEH59873.1};
GN ORFNames=MICPUCDRAFT_31728 {ECO:0000313|EMBL:EEH59873.1};
OS Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Mamiellaceae; Micromonas.
OX NCBI_TaxID=564608 {ECO:0000313|Proteomes:UP000001876};
RN [1] {ECO:0000313|EMBL:EEH59873.1, ECO:0000313|Proteomes:UP000001876}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1545 {ECO:0000313|EMBL:EEH59873.1,
RC ECO:0000313|Proteomes:UP000001876};
RX PubMed=19359590; DOI=10.1126/science.1167222;
RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L.,
RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E.,
RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M.,
RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H.,
RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., Gready J.E.,
RA John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., Moreau H.,
RA Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., Piegu B.,
RA Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., Zelensky A.,
RA Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., Van de Peer Y.,
RA Grigoriev I.V.;
RT "Green evolution and dynamic adaptations revealed by genomes of the marine
RT picoeukaryotes Micromonas.";
RL Science 324:268-272(2009).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the RRM CWC2 family.
CC {ECO:0000256|ARBA:ARBA00008024}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG663736; EEH59873.1; -; Genomic_DNA.
DR RefSeq; XP_003056497.1; XM_003056451.1.
DR AlphaFoldDB; C1ML55; -.
DR STRING; 564608.C1ML55; -.
DR GeneID; 9681397; -.
DR KEGG; mpp:MICPUCDRAFT_31728; -.
DR eggNOG; KOG0118; Eukaryota.
DR OrthoDB; 929875at2759; -.
DR Proteomes; UP000001876; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR CDD; cd12360; RRM_cwf2; 1.
DR Gene3D; 3.30.70.330; -; 1.
DR InterPro; IPR039171; Cwc2/Slt11.
DR InterPro; IPR034181; Cwc2_RRM.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR032297; Torus.
DR InterPro; IPR000571; Znf_CCCH.
DR PANTHER; PTHR14089:SF2; PRE-MRNA-SPLICING FACTOR CWC2; 1.
DR PANTHER; PTHR14089; PRE-MRNA-SPLICING FACTOR RBM22; 1.
DR Pfam; PF00076; RRM_1; 1.
DR Pfam; PF16131; Torus; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR PROSITE; PS50103; ZF_C3H1; 1.
PE 3: Inferred from homology;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723, ECO:0000256|PROSITE-
KW ProRule:PRU00723}; mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000001876};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PROSITE-ProRule:PRU00723};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00723}.
FT DOMAIN 92..119
FT /note="C3H1-type"
FT /evidence="ECO:0000259|PROSITE:PS50103"
FT ZN_FING 92..119
FT /note="C3H1-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00723"
FT REGION 59..89
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 65..89
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 424 AA; 48561 MW; C3458F680D1486C8 CRC64;
MRPESDSLSN GREVRGATFM WTTEAVERAL ERRAIPQSQP EDSISCDLWP SNERNLRHRS
KYTNENRSNT QATALSRCNS RTDSGRTKGS ISKQQHFCLH FARGRCVKGY ECLYLHHLPT
GFHEAAKPAL YDIFGREKHR LEREDNGGTG SHMRQCRTLF VYFGGAGEWG GQRLRQLISK
SFGEWGPVED IHVVPSKCIS FVRYKFIVSA EFAKEAMMGQ NLFGGKESLT VRWANDDPNP
TAIHRVKRER EDVIVDASER ANTRTPSWHD NERRQYAYLQ TTRGQNFPYQ NSLHTQDYPD
TEAQYEAHLN NVSTQSSTKH ALQDVKMMHD ENPDSVEEDY EIRGARPFSK SRTIMPQVFE
LEENSQKVSN QAEINEISRR TQLDSEILRA QHVAYRAYLD AGGKPDDWNP DSLSTFVAAP
GSFC
//