GenomeNet

Database: UniProt
Entry: V5GIF3_KALBG
LinkDB: V5GIF3_KALBG
Original site: V5GIF3_KALBG 
ID   V5GIF3_KALBG            Unreviewed;       542 AA.
AC   V5GIF3;
DT   22-JAN-2014, integrated into UniProtKB/TrEMBL.
DT   22-JAN-2014, sequence version 1.
DT   27-MAR-2024, entry version 39.
DE   RecName: Full=Sm domain-containing protein {ECO:0000259|PROSITE:PS52002};
GN   ORFNames=PSEUBRA_SCAF4g04896 {ECO:0000313|EMBL:EST05757.1};
OS   Kalmanozyma brasiliensis (strain GHG001) (Yeast) (Pseudozyma brasiliensis).
OC   Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC   Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Kalmanozyma.
OX   NCBI_TaxID=1365824 {ECO:0000313|EMBL:EST05757.1, ECO:0000313|Proteomes:UP000019377};
RN   [1] {ECO:0000313|Proteomes:UP000019377}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=GHG001 {ECO:0000313|Proteomes:UP000019377};
RX   PubMed=24356824; DOI=10.1128/genomea.00920-13;
RA   Oliveira J.V.D.C., dos Santos R.A.C., Borges T.A., Riano-Pachon D.M.,
RA   Goldman G.H.;
RT   "Draft genome sequence of Pseudozyma brasiliensis sp. nov. strain GHG001, a
RT   high producer of endo-1,4-xylanase isolated from an insect pest of
RT   sugarcane.";
RL   Genome Announc. 1:E0092013-E0092013(2013).
CC   -!- SUBCELLULAR LOCATION: Cytoplasm, cytosol
CC       {ECO:0000256|ARBA:ARBA00004514}.
CC   -!- SIMILARITY: Belongs to the snRNP core protein family.
CC       {ECO:0000256|ARBA:ARBA00008146}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KI545884; EST05757.1; -; Genomic_DNA.
DR   RefSeq; XP_016290746.1; XM_016438277.1.
DR   AlphaFoldDB; V5GIF3; -.
DR   STRING; 1365824.V5GIF3; -.
DR   GeneID; 27420970; -.
DR   eggNOG; KOG2190; Eukaryota.
DR   eggNOG; KOG3172; Eukaryota.
DR   HOGENOM; CLU_022670_4_0_1; -.
DR   OMA; MQTANTH; -.
DR   OrthoDB; 241824at2759; -.
DR   Proteomes; UP000019377; Unassembled WGS sequence.
DR   GO; GO:0005829; C:cytosol; IEA:UniProtKB-SubCell.
DR   GO; GO:0005681; C:spliceosomal complex; IEA:InterPro.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000387; P:spliceosomal snRNP assembly; IEA:InterPro.
DR   CDD; cd01721; Sm_D3; 1.
DR   Gene3D; 2.30.30.100; -; 1.
DR   Gene3D; 3.30.1370.10; K Homology domain, type 1; 3.
DR   InterPro; IPR004087; KH_dom.
DR   InterPro; IPR004088; KH_dom_type_1.
DR   InterPro; IPR036612; KH_dom_type_1_sf.
DR   InterPro; IPR010920; LSM_dom_sf.
DR   InterPro; IPR047575; Sm.
DR   InterPro; IPR001163; Sm_dom_euk/arc.
DR   InterPro; IPR034099; SmD3.
DR   PANTHER; PTHR10288; KH DOMAIN CONTAINING RNA BINDING PROTEIN; 1.
DR   PANTHER; PTHR10288:SF342; PAB1-BINDING PROTEIN 2; 1.
DR   Pfam; PF00013; KH_1; 3.
DR   Pfam; PF01423; LSM; 1.
DR   SMART; SM00322; KH; 3.
DR   SMART; SM00651; Sm; 1.
DR   SUPFAM; SSF54791; Eukaryotic type KH-domain (KH-domain type I); 3.
DR   SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR   PROSITE; PS50084; KH_TYPE_1; 3.
DR   PROSITE; PS52002; SM; 1.
PE   3: Inferred from homology;
KW   Reference proteome {ECO:0000313|Proteomes:UP000019377};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00117}.
FT   DOMAIN          6..78
FT                   /note="Sm"
FT                   /evidence="ECO:0000259|PROSITE:PS52002"
FT   REGION          154..235
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          400..428
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        158..181
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        199..235
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   542 AA;  56724 MW;  9909AFE7274988A6 CRC64;
     MGTIGIPVKL IHEAVGHVIT VELKGGASYR GTLYDAEDNF NIAMKDITVT APDGKQSHLE
     NVYIRGNMLR FIIVPDMLQQ APMFKRIGPN AMKGRGIGSA RGRATILREP VRTIISSPAQ
     TKSLFGASSR PTIHSIVSKG SPTTIRLNVD ANVKAKSKND GNDTPASPDR QSNSTTKKMA
     SDASGKPEEM DAVDLGNVTA DGDNNNADTS ALSNDQDGDQ VADSAGSVGD SSETQATQIS
     MRTLIVTSDA SIIIGKSGKH INEIRDKSNA RLNISEIIQG NPERILTVSG PLDAVSKAFG
     LIVRRINDEP FDQPSVPGSK SVTIRFIVPN SRMGSVIGKQ GSKIKEIQEA SGARLTAGEA
     MLPGSTERVL SISGVADAVH IAVYYVGTIL LEHQDRNANN LPYRPTAGGP STRPPAPGGN
     PYAAPQQAFG YGAPAPPFGA APAGAGGAPQ LPPGSQTQQI FIPNDLVGCI IGKGGSKINE
     IRSMSASHIK IMEPGAGIAA GGSGNERLVT ITGPPPNIQM AVSLLYQRLE QEKMRLAQGG
     AP
//
DBGET integrated database retrieval system