ID C1GA21_PARBD Unreviewed; 221 AA.
AC C1GA21;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 07-JAN-2015, sequence version 2.
DT 27-MAR-2024, entry version 54.
DE RecName: Full=Sm domain-containing protein {ECO:0000259|PROSITE:PS52002};
GN ORFNames=PADG_04107 {ECO:0000313|EMBL:EEH48023.2};
OS Paracoccidioides brasiliensis (strain Pb18).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Onygenales; Onygenales incertae sedis; Paracoccidioides.
OX NCBI_TaxID=502780 {ECO:0000313|EMBL:EEH48023.2, ECO:0000313|Proteomes:UP000001628};
RN [1] {ECO:0000313|EMBL:EEH48023.2, ECO:0000313|Proteomes:UP000001628}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Pb18 {ECO:0000313|EMBL:EEH48023.2,
RC ECO:0000313|Proteomes:UP000001628};
RX PubMed=22046142; DOI=10.1371/journal.pgen.1002345;
RA Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., Goldberg J.,
RA Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., Grynberg M.,
RA Gujja S., Heiman D.I., Henn M.R., Kodira C.D., Leon-Narvaez H.,
RA Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., Morais F.V., Pereira M.,
RA Rodriguez-Brito S., Sakthikumar S., Salem-Izacc S.M., Sykes S.M.,
RA Teixeira M.M., Vallejo M.C., Walter M.E., Yandava C., Young S., Zeng Q.,
RA Zucker J., Felipe M.S., Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G.,
RA Puccia R., San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.;
RT "Comparative genomic analysis of human fungal pathogens causing
RT paracoccidioidomycosis.";
RL PLoS Genet. 7:E1002345-E1002345(2011).
CC -!- SUBUNIT: Component of the heptameric LSM1-LSM7 complex, which consists
CC of LSM1, LSM2, LSM3, LSM4, LSM5, LSM6 and LSM7. Component of the
CC heptameric LSM2-LSM8 complex, which consists of LSM2, LSM3, LSM4, LSM5,
CC LSM6, LSM7 and LSM8. The LSm subunits form a seven-membered ring
CC structure with a doughnut shape. {ECO:0000256|ARBA:ARBA00025892}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the snRNP SmB/SmN family.
CC {ECO:0000256|ARBA:ARBA00009123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN275960; EEH48023.2; -; Genomic_DNA.
DR RefSeq; XP_010759320.1; XM_010761018.1.
DR AlphaFoldDB; C1GA21; -.
DR STRING; 502780.C1GA21; -.
DR GeneID; 22583298; -.
DR KEGG; pbn:PADG_04107; -.
DR VEuPathDB; FungiDB:PADG_04107; -.
DR eggNOG; KOG3168; Eukaryota.
DR HOGENOM; CLU_076902_1_1_1; -.
DR InParanoid; C1GA21; -.
DR OMA; KMINYRM; -.
DR OrthoDB; 5475294at2759; -.
DR Proteomes; UP000001628; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0032991; C:protein-containing complex; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR CDD; cd01717; Sm_B; 1.
DR Gene3D; 2.30.30.100; -; 1.
DR InterPro; IPR010920; LSM_dom_sf.
DR InterPro; IPR047575; Sm.
DR InterPro; IPR001163; Sm_dom_euk/arc.
DR PANTHER; PTHR10701:SF0; SMALL NUCLEAR RIBONUCLEOPROTEIN-ASSOCIATED PROTEIN B; 1.
DR PANTHER; PTHR10701; SMALL NUCLEAR RIBONUCLEOPROTEIN-ASSOCIATED PROTEIN B AND N; 1.
DR Pfam; PF01423; LSM; 1.
DR SMART; SM00651; Sm; 1.
DR SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR PROSITE; PS52002; SM; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000001628};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274}.
FT DOMAIN 1..91
FT /note="Sm"
FT /evidence="ECO:0000259|PROSITE:PS52002"
FT REGION 116..221
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 138..213
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 221 AA; 21940 MW; 76D156CA9D37669A CRC64;
MPYQIQINYR MRVTLNDGRQ MTGQMLAFDK HMNLVLADTE EFRRVKRKST KTTQAPGSST
PSLVEVEEKR TLGLTIVRGT HVVSCSVDGP PPADPAARLG TTVPSVSGAG PTTLAAGPGI
SRPAGRGLPV GLGGPAAGVG GPPPPGGFGG FPPAGFPGAP PPGFPGRGGP PGGPPGFGPP
PGFGGPAGVP GAPPGGFQPP TGFQPPPGQG RGFPPPGFGG R
//