ID K0SRX4_THAOC Unreviewed; 747 AA.
AC K0SRX4;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 22-FEB-2023, entry version 24.
DE RecName: Full=Chitin-binding type-2 domain-containing protein {ECO:0000259|PROSITE:PS50940};
GN ORFNames=THAOC_09698 {ECO:0000313|EMBL:EJK69078.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK69078.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK69078.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK69078.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK69078.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01010472; EJK69078.1; -; Genomic_DNA.
DR AlphaFoldDB; K0SRX4; -.
DR EnsemblProtists; EJK69078; EJK69078; THAOC_09698.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:InterPro.
DR GO; GO:0008061; F:chitin binding; IEA:InterPro.
DR Gene3D; 2.170.140.10; Chitin binding domain; 2.
DR InterPro; IPR002557; Chitin-bd_dom.
DR InterPro; IPR036508; Chitin-bd_dom_sf.
DR PANTHER; PTHR21113; AGAP001705-PA; 1.
DR PANTHER; PTHR21113:SF4; CHITIN-BINDING TYPE-4 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF01607; CBM_14; 2.
DR SMART; SM00494; ChtBD2; 2.
DR SUPFAM; SSF57625; Invertebrate chitin-binding proteins; 2.
DR PROSITE; PS50940; CHIT_BIND_II; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT DOMAIN 298..357
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT DOMAIN 431..489
FT /note="Chitin-binding type-2"
FT /evidence="ECO:0000259|PROSITE:PS50940"
FT REGION 1..38
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 54..80
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 132..241
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 265..291
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 389..417
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 494..519
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..22
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 133..150
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 501..517
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 747 AA; 79485 MW; 8E7EB160DE456804 CRC64;
MAVDDMRPRP PLSRKRGRAE GRATQLTEGS APRVRRGGRR ARTALSLALL SAAPSRGARL
SKQSMLNNEA GGPPPGRAEA GIYPGGTYGL EAQYDRVLRA NEARAAAYAS AVAVDVDVTG
AGWEDVFGEG RSLQTVHSKT SSQSASGRLG TFDRPPQPSK AEPTSGDGPR RQEDGSDGGP
ADGAPDKSAP RTTGELPARL LRGRGHLRPA AGPPSCRTLR LPRVRGGRGE TSAGGSAGGA
GKANYVTEIK QQSSYAGSFA SSFASSGNGG GSGADKPGAG TPQTSPGMVQ TQSSLELASI
CSSQKSFATL PLLTTYCADY VECSEGEVAR EMSCPTGLAF DSTIKGCNWI SQVECPPYVP
PKTNSDFSLG DLGFGEDDDF DGLVVEPDDE EEPVYTVPHG KGDGKGGGGN GSYSNNNSGG
VSNIATFSEE DSICDGAPDG TTPLLTTECR DYVNCVGGKV DEKLSCPEGL AFSAAFSGCQ
WASSVICPTA APTTPVPTTM GPTGKGPPPP TRRPTPMPTK RPVYTKAPTT EAPVAAPVDL
WEWIKSRRRR LNQEIFQSKT KDGVSYRSYW FQYQDFYRAL EIVSDEDASI TGKRKHVFYT
GTERGAEQRD YEYGLVNIAA FLSQAMTESI QHDACDEFNV EQNTDSDNGK AHYALSNSCG
QFGLNYQEFG CARPDEADMA CPVDRTMVLQ ATTSQIYPNA PPPLECRPRS TGESFTGFWD
VGMGSEQNYY PYENSFGRTD VEGCCWW
//