ID D4AY15_ARTBC Unreviewed; 192 AA.
AC D4AY15;
DT 18-MAY-2010, integrated into UniProtKB/TrEMBL.
DT 18-MAY-2010, sequence version 1.
DT 27-MAR-2024, entry version 51.
DE RecName: Full=Sm domain-containing protein {ECO:0000259|PROSITE:PS52002};
GN ORFNames=ARB_01084 {ECO:0000313|EMBL:EFE32193.1};
OS Arthroderma benhamiae (strain ATCC MYA-4681 / CBS 112371) (Trichophyton
OS mentagrophytes).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Onygenales; Arthrodermataceae; Trichophyton.
OX NCBI_TaxID=663331 {ECO:0000313|EMBL:EFE32193.1, ECO:0000313|Proteomes:UP000008866};
RN [1] {ECO:0000313|Proteomes:UP000008866}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-4681 / CBS 112371 {ECO:0000313|Proteomes:UP000008866};
RX PubMed=21247460; DOI=10.1186/gb-2011-12-1-r7;
RA Burmester A., Shelest E., Gloeckner G., Heddergott C., Schindler S.,
RA Staib P., Heidel A., Felder M., Petzold A., Szafranski K., Feuermann M.,
RA Pedruzzi I., Priebe S., Groth M., Winkler R., Li W., Kniemeyer O.,
RA Schroeckh V., Hertweck C., Hube B., White T.C., Platzer M., Guthke R.,
RA Heitman J., Woestemeyer J., Zipfel P.F., Monod M., Brakhage A.A.;
RT "Comparative and functional genomics provide insights into the
RT pathogenicity of dermatophytic fungi.";
RL Genome Biol. 12:R7.1-R7.16(2011).
CC -!- SUBUNIT: Component of the heptameric LSM1-LSM7 complex, which consists
CC of LSM1, LSM2, LSM3, LSM4, LSM5, LSM6 and LSM7. Component of the
CC heptameric LSM2-LSM8 complex, which consists of LSM2, LSM3, LSM4, LSM5,
CC LSM6, LSM7 and LSM8. The LSm subunits form a seven-membered ring
CC structure with a doughnut shape. {ECO:0000256|ARBA:ARBA00025892}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the snRNP SmB/SmN family.
CC {ECO:0000256|ARBA:ARBA00009123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFE32193.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ABSU01000017; EFE32193.1; -; Genomic_DNA.
DR RefSeq; XP_003012833.1; XM_003012787.1.
DR AlphaFoldDB; D4AY15; -.
DR STRING; 663331.D4AY15; -.
DR GeneID; 9522911; -.
DR KEGG; abe:ARB_01084; -.
DR eggNOG; KOG3168; Eukaryota.
DR HOGENOM; CLU_076902_1_1_1; -.
DR OMA; KMINYRM; -.
DR OrthoDB; 5475294at2759; -.
DR Proteomes; UP000008866; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0032991; C:protein-containing complex; IEA:UniProt.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR CDD; cd01717; Sm_B; 1.
DR Gene3D; 2.30.30.100; -; 1.
DR InterPro; IPR010920; LSM_dom_sf.
DR InterPro; IPR047575; Sm.
DR InterPro; IPR001163; Sm_dom_euk/arc.
DR PANTHER; PTHR10701:SF0; SMALL NUCLEAR RIBONUCLEOPROTEIN-ASSOCIATED PROTEIN B; 1.
DR PANTHER; PTHR10701; SMALL NUCLEAR RIBONUCLEOPROTEIN-ASSOCIATED PROTEIN B AND N; 1.
DR Pfam; PF01423; LSM; 1.
DR SMART; SM00651; Sm; 1.
DR SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR PROSITE; PS52002; SM; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000008866};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274}.
FT DOMAIN 1..83
FT /note="Sm"
FT /evidence="ECO:0000259|PROSITE:PS52002"
FT REGION 36..57
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 136..192
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 136..181
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 192 AA; 18676 MW; E338D25C3AF5766A CRC64;
MRVTLTDGRQ MTGQMLAFDK HMNLVLADTE EFRRVKKKSG KGGQAAPGSS STTPLVEAEE
KRTLGLTIVR GTHVVSCSVD GPPPAEPAAR LGTTAAGLAG VSATLAAGPG ISKPAGRGLP
VGLGGPAAGV GGPPPPAGFG AFPPPAGFPG GPPPGFGGRG GPPGGPPGGF QPPPGFQPPG
QGRGFPPGMG GR
//