ID V5GIF3_KALBG Unreviewed; 542 AA.
AC V5GIF3;
DT 22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT 22-JAN-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=Sm domain-containing protein {ECO:0000259|PROSITE:PS52002};
GN ORFNames=PSEUBRA_SCAF4g04896 {ECO:0000313|EMBL:EST05757.1};
OS Kalmanozyma brasiliensis (strain GHG001) (Yeast) (Pseudozyma brasiliensis).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Kalmanozyma.
OX NCBI_TaxID=1365824 {ECO:0000313|EMBL:EST05757.1, ECO:0000313|Proteomes:UP000019377};
RN [1] {ECO:0000313|Proteomes:UP000019377}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=GHG001 {ECO:0000313|Proteomes:UP000019377};
RX PubMed=24356824; DOI=10.1128/genomea.00920-13;
RA Oliveira J.V.D.C., dos Santos R.A.C., Borges T.A., Riano-Pachon D.M.,
RA Goldman G.H.;
RT "Draft genome sequence of Pseudozyma brasiliensis sp. nov. strain GHG001, a
RT high producer of endo-1,4-xylanase isolated from an insect pest of
RT sugarcane.";
RL Genome Announc. 1:E0092013-E0092013(2013).
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytosol
CC {ECO:0000256|ARBA:ARBA00004514}.
CC -!- SIMILARITY: Belongs to the snRNP core protein family.
CC {ECO:0000256|ARBA:ARBA00008146}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI545884; EST05757.1; -; Genomic_DNA.
DR RefSeq; XP_016290746.1; XM_016438277.1.
DR AlphaFoldDB; V5GIF3; -.
DR STRING; 1365824.V5GIF3; -.
DR GeneID; 27420970; -.
DR eggNOG; KOG2190; Eukaryota.
DR eggNOG; KOG3172; Eukaryota.
DR HOGENOM; CLU_022670_4_0_1; -.
DR OMA; MQTANTH; -.
DR OrthoDB; 241824at2759; -.
DR Proteomes; UP000019377; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:UniProtKB-SubCell.
DR GO; GO:0005681; C:spliceosomal complex; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000387; P:spliceosomal snRNP assembly; IEA:InterPro.
DR CDD; cd01721; Sm_D3; 1.
DR Gene3D; 2.30.30.100; -; 1.
DR Gene3D; 3.30.1370.10; K Homology domain, type 1; 3.
DR InterPro; IPR004087; KH_dom.
DR InterPro; IPR004088; KH_dom_type_1.
DR InterPro; IPR036612; KH_dom_type_1_sf.
DR InterPro; IPR010920; LSM_dom_sf.
DR InterPro; IPR047575; Sm.
DR InterPro; IPR001163; Sm_dom_euk/arc.
DR InterPro; IPR034099; SmD3.
DR PANTHER; PTHR10288; KH DOMAIN CONTAINING RNA BINDING PROTEIN; 1.
DR PANTHER; PTHR10288:SF342; PAB1-BINDING PROTEIN 2; 1.
DR Pfam; PF00013; KH_1; 3.
DR Pfam; PF01423; LSM; 1.
DR SMART; SM00322; KH; 3.
DR SMART; SM00651; Sm; 1.
DR SUPFAM; SSF54791; Eukaryotic type KH-domain (KH-domain type I); 3.
DR SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR PROSITE; PS50084; KH_TYPE_1; 3.
DR PROSITE; PS52002; SM; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000019377};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00117}.
FT DOMAIN 6..78
FT /note="Sm"
FT /evidence="ECO:0000259|PROSITE:PS52002"
FT REGION 154..235
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 400..428
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 158..181
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 199..235
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 542 AA; 56724 MW; 9909AFE7274988A6 CRC64;
MGTIGIPVKL IHEAVGHVIT VELKGGASYR GTLYDAEDNF NIAMKDITVT APDGKQSHLE
NVYIRGNMLR FIIVPDMLQQ APMFKRIGPN AMKGRGIGSA RGRATILREP VRTIISSPAQ
TKSLFGASSR PTIHSIVSKG SPTTIRLNVD ANVKAKSKND GNDTPASPDR QSNSTTKKMA
SDASGKPEEM DAVDLGNVTA DGDNNNADTS ALSNDQDGDQ VADSAGSVGD SSETQATQIS
MRTLIVTSDA SIIIGKSGKH INEIRDKSNA RLNISEIIQG NPERILTVSG PLDAVSKAFG
LIVRRINDEP FDQPSVPGSK SVTIRFIVPN SRMGSVIGKQ GSKIKEIQEA SGARLTAGEA
MLPGSTERVL SISGVADAVH IAVYYVGTIL LEHQDRNANN LPYRPTAGGP STRPPAPGGN
PYAAPQQAFG YGAPAPPFGA APAGAGGAPQ LPPGSQTQQI FIPNDLVGCI IGKGGSKINE
IRSMSASHIK IMEPGAGIAA GGSGNERLVT ITGPPPNIQM AVSLLYQRLE QEKMRLAQGG
AP
//