ID K0SVQ0_THAOC Unreviewed; 659 AA.
AC K0SVQ0;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 24-JAN-2024, entry version 34.
DE RecName: Full=GH18 domain-containing protein {ECO:0000259|PROSITE:PS51910};
DE Flags: Fragment;
GN ORFNames=THAOC_09219 {ECO:0000313|EMBL:EJK69515.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK69515.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK69515.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK69515.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK69515.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01009960; EJK69515.1; -; Genomic_DNA.
DR AlphaFoldDB; K0SVQ0; -.
DR EnsemblProtists; EJK69515; EJK69515; THAOC_09219.
DR eggNOG; ENOG502RZQ1; Eukaryota.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR004302; Cellulose/chitin-bd_N.
DR InterPro; IPR001223; Glyco_hydro18_cat.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR Pfam; PF00704; Glyco_hydro_18; 1.
DR Pfam; PF03067; LPMO_10; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR PROSITE; PS51910; GH18_2; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT DOMAIN 275..593
FT /note="GH18"
FT /evidence="ECO:0000259|PROSITE:PS51910"
FT REGION 42..66
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 596..640
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 46..63
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 612..636
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 659
FT /evidence="ECO:0000313|EMBL:EJK69515.1"
SQ SEQUENCE 659 AA; 70931 MW; C1495AAAB6B702C2 CRC64;
MKFKHSTLSI IAAASIAPTQ FADAHGYLKS PKSRNYYANT NGKWYGGDRE RPRARELPPL
SEPRGHRRVP PVVQACYSPG SVVEFESVLT AHHMGHFEFR ACPVSPGEVP TQECFDSNPL
TFVEDPLYGA VPDPNHPERA YIPNAGVSPW TYKHRYQLPS NLEGELVIIQ WYYLTANSCN
PEGYGDYNFA PLGISGNSLA QCGYPLPSNG VPGKPEQFWN CAEVKISTDC GSTPFPTKQP
SVPITKSPVA VSTSRPTISN APTFKSVVAE SREDSRLIAY LANWQACPTD DMLDAYTHIV
IAFAVSYTWS AAKNNCDTSC SVSAPPTCGN QVRQDLIDKW RGQGKKVVLS FGGAGMGGSW
PGDSNNCWDY CFGKEDKVST QLVSIVQKQN LDGIDLDYGT QSGMCTAKDT SLFPTKASFD
TAAQNFLTGI TSNLRQKMDA LGDNYELTHA PMDSDLDPTS KYYQILKDQN ENLNYLMPQF
YNGYIKVVSD GFTGTGAGAY SAESVYSNLA MDLFPNRPDK ASSNRTVVFG FCVNGCGGTG
SNANGLQAVS VLQQVKEFDQ GQYSCNGGAF FWVTSADASG SWSDPVAAEL ALTAGCLDGT
TPPPPSPPTT TTNPTSGLTL QPTRPSTSQP TDAVTLKPTN APIEVGPSVA VFDAALGAP
//