GenomeNet

Database: UniProt
Entry: K0SVQ0_THAOC
LinkDB: K0SVQ0_THAOC
Original site: K0SVQ0_THAOC 
ID   K0SVQ0_THAOC            Unreviewed;       659 AA.
AC   K0SVQ0;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   24-JAN-2024, entry version 34.
DE   RecName: Full=GH18 domain-containing protein {ECO:0000259|PROSITE:PS51910};
DE   Flags: Fragment;
GN   ORFNames=THAOC_09219 {ECO:0000313|EMBL:EJK69515.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK69515.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK69515.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK69515.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK69515.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01009960; EJK69515.1; -; Genomic_DNA.
DR   AlphaFoldDB; K0SVQ0; -.
DR   EnsemblProtists; EJK69515; EJK69515; THAOC_09219.
DR   eggNOG; ENOG502RZQ1; Eukaryota.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR   Gene3D; 3.20.20.80; Glycosidases; 1.
DR   InterPro; IPR004302; Cellulose/chitin-bd_N.
DR   InterPro; IPR001223; Glyco_hydro18_cat.
DR   InterPro; IPR017853; Glycoside_hydrolase_SF.
DR   Pfam; PF00704; Glyco_hydro_18; 1.
DR   Pfam; PF03067; LPMO_10; 1.
DR   SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR   PROSITE; PS51910; GH18_2; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT   DOMAIN          275..593
FT                   /note="GH18"
FT                   /evidence="ECO:0000259|PROSITE:PS51910"
FT   REGION          42..66
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          596..640
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        46..63
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        612..636
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         659
FT                   /evidence="ECO:0000313|EMBL:EJK69515.1"
SQ   SEQUENCE   659 AA;  70931 MW;  C1495AAAB6B702C2 CRC64;
     MKFKHSTLSI IAAASIAPTQ FADAHGYLKS PKSRNYYANT NGKWYGGDRE RPRARELPPL
     SEPRGHRRVP PVVQACYSPG SVVEFESVLT AHHMGHFEFR ACPVSPGEVP TQECFDSNPL
     TFVEDPLYGA VPDPNHPERA YIPNAGVSPW TYKHRYQLPS NLEGELVIIQ WYYLTANSCN
     PEGYGDYNFA PLGISGNSLA QCGYPLPSNG VPGKPEQFWN CAEVKISTDC GSTPFPTKQP
     SVPITKSPVA VSTSRPTISN APTFKSVVAE SREDSRLIAY LANWQACPTD DMLDAYTHIV
     IAFAVSYTWS AAKNNCDTSC SVSAPPTCGN QVRQDLIDKW RGQGKKVVLS FGGAGMGGSW
     PGDSNNCWDY CFGKEDKVST QLVSIVQKQN LDGIDLDYGT QSGMCTAKDT SLFPTKASFD
     TAAQNFLTGI TSNLRQKMDA LGDNYELTHA PMDSDLDPTS KYYQILKDQN ENLNYLMPQF
     YNGYIKVVSD GFTGTGAGAY SAESVYSNLA MDLFPNRPDK ASSNRTVVFG FCVNGCGGTG
     SNANGLQAVS VLQQVKEFDQ GQYSCNGGAF FWVTSADASG SWSDPVAAEL ALTAGCLDGT
     TPPPPSPPTT TTNPTSGLTL QPTRPSTSQP TDAVTLKPTN APIEVGPSVA VFDAALGAP
//
DBGET integrated database retrieval system