ID T5AML1_OPHSC Unreviewed; 1275 AA.
AC T5AML1;
DT 13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT 13-NOV-2013, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE SubName: Full=Carbohydrate-binding WSC {ECO:0000313|EMBL:EQL03636.1};
GN ORFNames=OCS_00640 {ECO:0000313|EMBL:EQL03636.1};
OS Ophiocordyceps sinensis (strain Co18 / CGMCC 3.14243) (Yarsagumba
OS caterpillar fungus) (Hirsutella sinensis).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Ophiocordycipitaceae; Ophiocordyceps.
OX NCBI_TaxID=911162 {ECO:0000313|EMBL:EQL03636.1, ECO:0000313|Proteomes:UP000019374};
RN [1] {ECO:0000313|EMBL:EQL03636.1, ECO:0000313|Proteomes:UP000019374}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Co18 / CGMCC 3.14243 {ECO:0000313|Proteomes:UP000019374};
RC TISSUE=Fruit-body {ECO:0000313|EMBL:EQL03636.1};
RX DOI=10.1007/s11434-013-5929-5;
RA Hu X., Zhang Y., Xiao G., Zheng P., Xia Y., Zhang X., St Leger R.J.,
RA Liu X., Wang C.;
RT "Genome survey uncovers the secrets of sex and lifestyle in caterpillar
RT fungus.";
RL Chin. Sci. Bull. 58:2846-2854(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KE652200; EQL03636.1; -; Genomic_DNA.
DR AlphaFoldDB; T5AML1; -.
DR eggNOG; KOG4157; Eukaryota.
DR HOGENOM; CLU_000702_0_0_1; -.
DR Proteomes; UP000019374; Unassembled WGS sequence.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR011047; Quinoprotein_ADH-like_supfam.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR002889; WSC_carb-bd.
DR PANTHER; PTHR45964:SF5; WSC DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR45964; WSCD FAMILY MEMBER CG9164; 1.
DR Pfam; PF01822; WSC; 3.
DR SMART; SM00321; WSC; 3.
DR SUPFAM; SSF50998; Quinoprotein alcohol dehydrogenase-like; 1.
DR PROSITE; PS51212; WSC; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000019374};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..1275
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004606116"
FT DOMAIN 915..1006
FT /note="WSC"
FT /evidence="ECO:0000259|PROSITE:PS51212"
FT DOMAIN 1034..1126
FT /note="WSC"
FT /evidence="ECO:0000259|PROSITE:PS51212"
FT DOMAIN 1167..1259
FT /note="WSC"
FT /evidence="ECO:0000259|PROSITE:PS51212"
FT REGION 1125..1159
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1126..1159
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1275 AA; 133950 MW; 2CBDCD8576988B7B CRC64;
MKFSGAVIAS ALVATVSALA STDTITWGGD NSRSGFQTNH GMDPAIVGSS QFGQVFRTLL
PGTYVGQPEQ IFSQPLVYTP NGGTRQYVYF ATTQNNVYKL DAKTGEIVAS RALHIPFLTA
DLDGCRDIEP TIGVTSTGVI DASSDTLYLT AKTYVSQFGD KPQGRPAGRY FVHALDVNDL
SERPNFPVDL EGVPSRNGAS RKFTGGIHLQ RAALLHTGQY IYAGFASHCV QYNFTGWIIG
WDKSTGKIVE GWTTQGQGIP NTIPGAGIWM SGGGLSSDDA GSIFAATGNG YASQLSTIPV
KGFNPPTALE EAALHLTMNA DGSLNVVDFF MPWEKQALDG GDQDLGTTPL EVLPSQFACG
DFKRIGVVTG KSGKTYWLNL DNLGGYRNGK DTQDDVLQVF QNENSVYAGA GVYPLEGGYV
FINVIGSPTN VFKFSCNNGV PTFAKVAVSP TINAYILGVS HGTVTSLDGQ EGTGLLWVTD
VQGVGVKIYN AVPKDGKLVL LNSFNVPAIT KFSRPVFGNA MLYIGTTLGY VYGFGAPVNS
PLNCSSPVDF GSVDINNSSS ARPVTCQATI DLTVTSVGLG EEKHFGISDS IKLPLQLSQG
QSFSVNATFV PTSVGFQSTD LLVNSTNSVA GFSTNTHARL TGTGRSVRAL LALSPSTITF
KGVVTGQDPG GVTESLVVSN RGSSPLKVTS ILYSADNSTG PFQPWDGQGH LVVGKFTLQN
IPTTVPANSG VTVSVAFSST ASGTFTGFVK FVTDGGNGTV SIAGSSGPAA IALLEFQTPD
GQDWVKYQSG TPFHFGNVTE NTSRSLKFRV TNKAPPGGVK LSLTVSKPPF GVDSIVRAAN
QVDLAEGSSF GPGESGTAFL TCAAPKSQWN MPVYNSTAQW TMNTNDPEFG KQFIQFFCNA
VAEQAPPLLP NGQGRYQYIG CFKDNTPGRQ LPNQLLDSVK LTNAQCIATC AKSNYVFCGT
QYLRECWAGN QIPLQKVDDT NCNYNCAGDI NQICGGNGAG DGAFMSLFAD SLQWNGNYTK
PPVSGPSVNP GVSGFASMGC FTEPANARAL PNGVGTEKKT VAACVNACKG AMYSYAGLEY
GGECWCGSQL ATGSVHAPDL ECGMNCNDNA TEYCGGPGRL NLYRLGAPPT SSSTPGSTTT
TTGPTSTSSS ASVPTPTAPA VKNTVGKYRF QGCWTEATNS RALVGSSYAD DKMTLEACAK
FCDAFTYFAT EYGRECYCGN SIQEGSVKAI NQNDCSFPCA GDGTGYCGAA NRLQLYKLSD
SATTMPPSAS LTGAS
//