ID G0S8T1_CHATD Unreviewed; 534 AA.
AC G0S8T1;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 36.
DE RecName: Full=WSC domain-containing protein {ECO:0000259|PROSITE:PS51212};
GN ORFNames=CTHT_0040230 {ECO:0000313|EMBL:EGS20284.1};
OS Chaetomium thermophilum (strain DSM 1495 / CBS 144.50 / IMI 039719)
OS (Thermochaetoides thermophila).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Chaetomiaceae; Thermochaetoides.
OX NCBI_TaxID=759272 {ECO:0000313|Proteomes:UP000008066};
RN [1] {ECO:0000313|EMBL:EGS20284.1, ECO:0000313|Proteomes:UP000008066}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 1495 / CBS 144.50 / IMI 039719
RC {ECO:0000313|Proteomes:UP000008066};
RX PubMed=21784248; DOI=10.1016/j.cell.2011.06.039;
RA Amlacher S., Sarges P., Flemming D., van Noort V., Kunze R., Devos D.P.,
RA Arumugam M., Bork P., Hurt E.;
RT "Insight into structure and assembly of the nuclear pore complex by
RT utilizing the genome of a eukaryotic thermophile.";
RL Cell 146:277-289(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL988042; EGS20284.1; -; Genomic_DNA.
DR RefSeq; XP_006694433.1; XM_006694370.1.
DR AlphaFoldDB; G0S8T1; -.
DR GeneID; 18258061; -.
DR KEGG; cthr:CTHT_0040230; -.
DR eggNOG; KOG4157; Eukaryota.
DR HOGENOM; CLU_509962_0_0_1; -.
DR OMA; ANECFCA; -.
DR OrthoDB; 2475704at2759; -.
DR Proteomes; UP000008066; Unassembled WGS sequence.
DR GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:InterPro.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR InterPro; IPR003305; CenC_carb-bd.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR002889; WSC_carb-bd.
DR PANTHER; PTHR24269; KREMEN PROTEIN; 1.
DR PANTHER; PTHR24269:SF16; PROTEIN SLG1; 1.
DR Pfam; PF02018; CBM_4_9; 1.
DR Pfam; PF01822; WSC; 3.
DR SMART; SM00321; WSC; 3.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS51212; WSC; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008066};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..534
FT /note="WSC domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003408912"
FT DOMAIN 37..125
FT /note="WSC"
FT /evidence="ECO:0000259|PROSITE:PS51212"
FT DOMAIN 136..229
FT /note="WSC"
FT /evidence="ECO:0000259|PROSITE:PS51212"
FT DOMAIN 242..332
FT /note="WSC"
FT /evidence="ECO:0000259|PROSITE:PS51212"
FT REGION 333..370
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 343..370
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 534 AA; 57624 MW; 73F6A9421710E83F CRC64;
MSFQRLLPLG VLAATASATA LPQFPKPYPK IARDVTPYTP LGCFVDSGER IFPHRVISSP
DMTAAKCAEN CEGYDYFGTQ WSSECYCGSV APTVPADPSE CNMPCSGDPN EICGAGMRLT
VYQFDKAPVS YPDVNGYEYQ GCYTDNMSLR VLGGNTFGGA NMTLESCAAF CSGYGYSIFG
TENGNECFCG AFLDEDSVKV SEAECPMTCK GNENQKCGGP SRLSVYKLPN SNPPFVPAVV
DEFRYESCWV DDVNDRALTS VDWRDDTMTI DKCAEHCKDY LYFGLEYGRE CYCANEISSG
QAVPEKECAM LCPGDATLFC GGGSRLTLYK REDCDEPEPS TTGAPEPTDT AVPTETAEPT
APVTTTATAT ATAEPTATPI LDGTFESGLG SFTVVDSELN LDYEITDTLT HAGQGALLVK
NKDENIGVLI LETTVTVEPS ATYAFSLYYW HTNPDAYTTL YLSAGGDVNN SDEEADLQQS
PANQWLSQSV TFTTEADQTS IKLEIIVGAV GFSGNGSPEY SDDLYIDDVT LVRI
//