ID A0A0F4YFC1_TALEM Unreviewed; 765 AA.
AC A0A0F4YFC1;
DT 24-JUN-2015, integrated into UniProtKB/TrEMBL.
DT 24-JUN-2015, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=Sm domain-containing protein {ECO:0000259|PROSITE:PS52002};
GN ORFNames=T310_9561 {ECO:0000313|EMBL:KKA16844.1};
OS Rasamsonia emersonii CBS 393.64.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Eurotiales; Trichocomaceae; Rasamsonia.
OX NCBI_TaxID=1408163 {ECO:0000313|EMBL:KKA16844.1, ECO:0000313|Proteomes:UP000053958};
RN [1] {ECO:0000313|EMBL:KKA16844.1, ECO:0000313|Proteomes:UP000053958}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 393.64 {ECO:0000313|EMBL:KKA16844.1,
RC ECO:0000313|Proteomes:UP000053958};
RA Heijne W.H., Fedorova N.D., Nierman W.C., Vollebregt A.W., Zhao Z., Wu L.,
RA Kumar M., Stam H., van den Berg M.A., Pel H.J.;
RL Submitted (APR-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBUNIT: Component of the heptameric LSM1-LSM7 complex, which consists
CC of LSM1, LSM2, LSM3, LSM4, LSM5, LSM6 and LSM7. Component of the
CC heptameric LSM2-LSM8 complex, which consists of LSM2, LSM3, LSM4, LSM5,
CC LSM6, LSM7 and LSM8. The LSm subunits form a seven-membered ring
CC structure with a doughnut shape. {ECO:0000256|ARBA:ARBA00025892}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KKA16844.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LASV01000733; KKA16844.1; -; Genomic_DNA.
DR RefSeq; XP_013323456.1; XM_013468002.1.
DR AlphaFoldDB; A0A0F4YFC1; -.
DR STRING; 1408163.A0A0F4YFC1; -.
DR GeneID; 25321493; -.
DR OrthoDB; 1381922at2759; -.
DR Proteomes; UP000053958; Unassembled WGS sequence.
DR GO; GO:0043229; C:intracellular organelle; IEA:UniProt.
DR GO; GO:0032991; C:protein-containing complex; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR GO; GO:0000956; P:nuclear-transcribed mRNA catabolic process; IEA:InterPro.
DR CDD; cd01723; LSm4; 1.
DR Gene3D; 2.30.30.100; -; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 3.
DR InterPro; IPR034101; Lsm4.
DR InterPro; IPR010920; LSM_dom_sf.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR047575; Sm.
DR InterPro; IPR001163; Sm_dom_euk/arc.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR47939; MEMBRANE-ASSOCIATED SALT-INDUCIBLE PROTEIN-LIKE; 1.
DR PANTHER; PTHR47939:SF1; OS04G0684500 PROTEIN; 1.
DR Pfam; PF01423; LSM; 1.
DR Pfam; PF13812; PPR_3; 1.
DR SMART; SM00651; Sm; 1.
DR SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR PROSITE; PS51375; PPR; 1.
DR PROSITE; PS52002; SM; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000053958}.
FT DOMAIN 2..75
FT /note="Sm"
FT /evidence="ECO:0000259|PROSITE:PS52002"
FT REPEAT 663..697
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 223..275
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 252..275
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 765 AA; 86702 MW; FA09D2401683E2AA CRC64;
MLPLGLLTAA QGHPMLVELK NGETLNGHLV NCDNWMNLTL KEVVQTSPEG DRFFRLPEVY
VRGNNVRDNR SAARNKCLRI GAVVVHRSEA TAARETETVV EDEEDGPVVV AEVEPARPIK
SIEKNEIRYP CDKSSEWMCD DTFGQIRQAW YWRIVAAYQE KGGTFWDSRL EMRRGNLRVD
GLWYCLCPSF CHSSLRQPFP SFVSRRASPT RCRVSNLAPD AAPGIRRYSS AGGSDTAQTE
RASEELSKNH ARESVSSAQS PTSAEEVQQQ QDDQPKIRIT RYFARDPALV RVPQSISQKS
TEDLENYLNQ VTSEKSPKML TVTAVLRALI AERHIEPKVR HYRALILANT DPRYGSPENV
RWLLQEMEEN GIAADSSTLH AALQALAVHP DYLLRQEVLR TLRDRWLSLS PTGWHHLVAG
LIREQQFELA LDHLDHMARM EIPVQSWLHA LLVYKMCELE DFDQVYDLMS SRASQQQDIS
MDLWLYVLDE ASEALHHKTT RYVWKQVVEL GYLNPSYGVC GNVLTVASRT GDTDLAASVI
RFLVKTEVPL TLEDYEKLAE THVMAGDLQA AFDTLCLMHN AGIQLEESST RSILTYMVQK
KLKPQLAWET LKRLKAQGRN VPVGCANVVI EYCEHELARD QGAVDKAIKF YKELYDLCSS
PADVTTYNCL ISLSRRAKRS DACMFVVKEM AALGVTPDIT TFELLILMCL ELENFRSAYM
YFQDLLDRGW TLSDSTCAKI RELCQGSEDE FAVRLVSHPM LSILG
//