ID G0S5H2_CHATD Unreviewed; 769 AA.
AC G0S5H2;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 24-JAN-2024, entry version 50.
DE RecName: Full=Sm domain-containing protein {ECO:0000259|PROSITE:PS52002};
GN ORFNames=CTHT_0032950 {ECO:0000313|EMBL:EGS21437.1};
OS Chaetomium thermophilum (strain DSM 1495 / CBS 144.50 / IMI 039719)
OS (Thermochaetoides thermophila).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Thermochaetoides.
OX NCBI_TaxID=759272 {ECO:0000313|Proteomes:UP000008066};
RN [1] {ECO:0000313|EMBL:EGS21437.1, ECO:0000313|Proteomes:UP000008066}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 1495 / CBS 144.50 / IMI 039719
RC {ECO:0000313|Proteomes:UP000008066};
RX PubMed=21784248; DOI=10.1016/j.cell.2011.06.039;
RA Amlacher S., Sarges P., Flemming D., van Noort V., Kunze R., Devos D.P.,
RA Arumugam M., Bork P., Hurt E.;
RT "Insight into structure and assembly of the nuclear pore complex by
RT utilizing the genome of a eukaryotic thermophile.";
RL Cell 146:277-289(2011).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the snRNP Sm proteins family.
CC {ECO:0000256|ARBA:ARBA00006850}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL988041; EGS21437.1; -; Genomic_DNA.
DR RefSeq; XP_006693733.1; XM_006693670.1.
DR AlphaFoldDB; G0S5H2; -.
DR STRING; 759272.G0S5H2; -.
DR GeneID; 18257333; -.
DR KEGG; cthr:CTHT_0032950; -.
DR eggNOG; KOG3293; Eukaryota.
DR eggNOG; KOG4197; Eukaryota.
DR HOGENOM; CLU_008514_1_1_1; -.
DR OMA; TEIREIC; -.
DR OrthoDB; 1381922at2759; -.
DR Proteomes; UP000008066; Unassembled WGS sequence.
DR GO; GO:0097525; C:spliceosomal snRNP complex; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR GO; GO:0000956; P:nuclear-transcribed mRNA catabolic process; IEA:InterPro.
DR CDD; cd01723; LSm4; 1.
DR Gene3D; 2.30.30.100; -; 1.
DR Gene3D; 1.25.40.10; Tetratricopeptide repeat domain; 2.
DR InterPro; IPR034101; Lsm4.
DR InterPro; IPR027141; LSm4/Sm_D1/D3.
DR InterPro; IPR010920; LSM_dom_sf.
DR InterPro; IPR002885; Pentatricopeptide_rpt.
DR InterPro; IPR047575; Sm.
DR InterPro; IPR001163; Sm_dom_euk/arc.
DR InterPro; IPR011990; TPR-like_helical_dom_sf.
DR PANTHER; PTHR23338; SMALL NUCLEAR RIBONUCLEOPROTEIN SM; 1.
DR PANTHER; PTHR23338:SF16; U6 SNRNA-ASSOCIATED SM-LIKE PROTEIN LSM4; 1.
DR Pfam; PF01423; LSM; 1.
DR Pfam; PF13812; PPR_3; 1.
DR SMART; SM00651; Sm; 1.
DR SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR PROSITE; PS51375; PPR; 1.
DR PROSITE; PS52002; SM; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008066};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 2..75
FT /note="Sm"
FT /evidence="ECO:0000259|PROSITE:PS52002"
FT REPEAT 512..546
FT /note="PPR"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00708"
FT REGION 204..225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 209..225
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 769 AA; 87566 MW; 19CFEED6E3E08765 CRC64;
MLPLGLLTAA QGHPMLVELK NGETLNGHLV QCDTWMNLTL REVVQTSPEG DKFMRLPEVY
VKGNNIKYLR VPDEIIDIVK EQQQQQHRGA VVGEEEVALR EVDGVARGVH ECDSCWDSLN
GTLATGMGVV QRHRHLQRQQ QQLLLLRDYE TTTWRNSFWG TSLGSRFGEK ARQEDVGAQQ
FEHAVGDRCM TRRKHDWEVW GEGSRGNAFS PQPLPQQASV SSSQGPARVQ MQHENAPVTA
DDLIKSEASF PQRTFQASTE VIYEALRGLR ERRDGAIKIR RFVEHLVKER GEQPNEALYE
ALVVANWHTT TGSAGELRAI MKEMKKLGIK FSESFYHSAL RLLAIHPDYI TRTIILDKMN
EEGILLNDDG KCSVALGYLR DGQYEMALDY LDQICHEGVD VPGWMFDIFF FVLTRHGFLD
EALHLLQYLV DRAEGFLNAV PLNSWYYFLD ECSKAFYYEG TKFVWENLVR PGILHPSDGV
MINVLNTASR YNDAELATKV IEQLSARKIK LTANHYEALL DCYANIGDLQ NAFQVLSIMA
DAGIMPDQSS TRSLYLFLKS HPERADEVVK ILNELSKDHR IPVAAMNVVL EALLKAGDMV
KAMHVYRDLR YLCKGGPNQQ TFQMLLEQCK SSEVAGFLTS EMHQFSVRSS PEILDHVIRC
FAFDGLLDAV LSYLSEWNRS PMRRGWISTP TLTAVLERCY RARDVRVWRV VDEAQRRNVR
IDPQIMEVLE AEIPRADAYR ELPAHGWATS DEAYQSKEVS SDSKDYASR
//