ID G4U8N2_NEUT9 Unreviewed; 1110 AA.
AC G4U8N2;
DT 14-DEC-2011, integrated into UniProtKB/TrEMBL.
DT 14-DEC-2011, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE RecName: Full=GH18 domain-containing protein {ECO:0000259|PROSITE:PS51910};
GN ORFNames=NEUTE2DRAFT_81058 {ECO:0000313|EMBL:EGZ78457.1};
OS Neurospora tetrasperma (strain FGSC 2509 / P0656).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Sordariomycetidae; Sordariales; Sordariaceae; Neurospora.
OX NCBI_TaxID=510952 {ECO:0000313|EMBL:EGZ78457.1, ECO:0000313|Proteomes:UP000008513};
RN [1] {ECO:0000313|EMBL:EGZ78457.1, ECO:0000313|Proteomes:UP000008513}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=strain FGSC 2509 / P0656 {ECO:0000313|Proteomes:UP000008513};
RX PubMed=21750257; DOI=10.1534/genetics.111.130690;
RA Ellison C.E., Stajich J.E., Jacobson D.J., Natvig D.O., Lapidus A.,
RA Foster B., Aerts A., Riley R., Lindquist E.A., Grigoriev I.V., Taylor J.W.;
RT "Massive changes in genome architecture accompany the transition to self-
RT fertility in the filamentous fungus Neurospora tetrasperma.";
RL Genetics 189:55-69(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL890999; EGZ78457.1; -; Genomic_DNA.
DR AlphaFoldDB; G4U8N2; -.
DR STRING; 510952.G4U8N2; -.
DR eggNOG; KOG4701; Eukaryota.
DR HOGENOM; CLU_005114_0_0_1; -.
DR Proteomes; UP000008513; Unassembled WGS sequence.
DR GO; GO:0008061; F:chitin binding; IEA:UniProtKB-KW.
DR GO; GO:0004568; F:chitinase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR001223; Glyco_hydro18_cat.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR45708; ENDOCHITINASE; 1.
DR PANTHER; PTHR45708:SF49; ENDOCHITINASE; 1.
DR Pfam; PF00704; Glyco_hydro_18; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR PROSITE; PS51910; GH18_2; 1.
PE 4: Predicted;
KW Chitin-binding {ECO:0000256|ARBA:ARBA00022669};
KW Reference proteome {ECO:0000313|Proteomes:UP000008513};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1110
FT /note="GH18 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003469704"
FT DOMAIN 23..310
FT /note="GH18"
FT /evidence="ECO:0000259|PROSITE:PS51910"
FT REGION 491..534
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 573..592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 618..641
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1004..1027
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1051..1085
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1051..1080
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1110 AA; 108678 MW; DEA302D4B11C4E08 CRC64;
MHKSLVAAAV LAATARAHAL TEAKVNVYLG QKGDARLRDH CDQANFDYVT IGAHCDATYY
TNGTTSGQMN GKCSIVASDI KHCQDKGKKV LLSLDGVEHM GSRFSLSSEA KAEEFASFLW
GAFGPYDAKW TGPRPFDYAG HRVSVDGFNL DGELKLNGAG EGAYAAMAKK LRELYNGNGE
LLLTAAPGCS LDDVKLKAIF DNAQFDALFI QFYNNPSCEA GSASGFNYLQ WEQAIAAGMS
KEAKLFIGLA GASDAAGSGY IEPLEAAALI NTYKTRSSFG GAMVLDAFRG QTLANGMTFL
DVINTVVSSA EAVDLSSEFC EDESALPKVP SVTEGSAGGF FTVADPSAIT SGPVVLPSGG
SSEDEVCEDE NVIPKVPSVT DGAHEVNPTI VDTSIITSVP VALPSGDRSE IIGGSPATED
DVCEGEIMTE DPLRSGVVPT GIMPTGALPS GVLPSGDRSE IIGAIPSGSV PLGSAPSGLV
PSGSIPLGSA PSGLVPSGTD PVRSGLVPSG AVPSGAVPSG AVPSGDRSDI IGAIPSGSIP
LGSAPSGLVP SGAVPSGDRS EIIGAIPSGS IPLGSVPSGA APSGDRSEII GAIPSGSIPL
GSAPSGAVPS GLVPSGIVPS GAVPSGDRSE IIGSPADEDD YCEGEITPED PVRSGLVPSG
IVPTGAVPTG LVPSGAVIGS TVPGGVIPSG VAPGGILPSG VVSSVGSKVT GPIDGDNVAL
PSVGLPSGAI ASGVVPSGVV PSGVVSSIGS KVTGPIDGDN VALPSVGLPS GDLPSGVVPS
GVVPSGVLSS VGSKVTGPIN GHNVAVPSAG VPSGVVPSGV LSNVGSKVTG PIDGDNVVPT
GILPSGSVPS GVVSSVVSMV TADPINGDKI ADPSAVTAPA EWTTSTIYAT TTSTITSCAP
EVTDCPAKIG QVTTVTVPIG VTVCPVTATE TAARAPTGIF TSVPVESIPA GFTTSTVYST
TTSTIASCAP EMTDCAGKIG QVATSSIVSV SSMPFTTITV AKPAASAPGA PGAPGAGVPA
ESAAVPAGGA PAVPSAVNAP VVPSGAVPTG TGVSVAPSSA PSTYSMPAPP AQTEPATGSE
PSEVPVTAGA GRNVVAMGVP ALMAALVLAL
//