GenomeNet

Database: UniProt
Entry: K0R4S4_THAOC
LinkDB: K0R4S4_THAOC
Original site: K0R4S4_THAOC 
ID   K0R4S4_THAOC            Unreviewed;       353 AA.
AC   K0R4S4;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   27-MAR-2024, entry version 33.
DE   RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
DE   Flags: Fragment;
GN   ORFNames=THAOC_34577 {ECO:0000313|EMBL:EJK46739.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK46739.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK46739.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK46739.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK46739.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01047566; EJK46739.1; -; Genomic_DNA.
DR   AlphaFoldDB; K0R4S4; -.
DR   EnsemblProtists; EJK46739; EJK46739; THAOC_34577.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR001254; Trypsin_dom.
DR   InterPro; IPR018114; TRYPSIN_HIS.
DR   PANTHER; PTHR24260; -; 1.
DR   PANTHER; PTHR24260:SF136; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR   Pfam; PF00089; Trypsin; 1.
DR   SMART; SM00020; Tryp_SPc; 1.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS50240; TRYPSIN_DOM; 1.
DR   PROSITE; PS00134; TRYPSIN_HIS; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW   Virulence {ECO:0000256|ARBA:ARBA00023026}.
FT   DOMAIN          63..288
FT                   /note="Peptidase S1"
FT                   /evidence="ECO:0000259|PROSITE:PS50240"
FT   REGION          1..34
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          293..353
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        311..353
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         1
FT                   /evidence="ECO:0000313|EMBL:EJK46739.1"
SQ   SEQUENCE   353 AA;  38395 MW;  7D678C9298747C23 CRC64;
     STITNDGGTP PKGPRVLSPS LRRGLPAGPR DTDDVPRYKT ISWAVHDDWT PSHDPGGDLA
     RVILSAWGTS AIVTKETPQQ AITSLDSAKF FVSLFDRGDC AGAVIGDNYV VTAAHCVCGL
     EEVNVIDYMN DEHYAVATYT NPDRPFNCNR DGPNSNDVAV VEFSGRPFRN HDARPVYSSS
     DEVGKTIWIL GMGIHGQPDD FPNARACRNG VQDSNLREGY NTVDQADGIL AYTMKPSTGG
     PALINVNGEW QLAGVNSGTD ENNSCDWGSV DQYCRLSEHA GWISRTMGDA VEQDDTVGQD
     DTVGQDDAVG QDDAVEQDDT VGQDDDDDGG DDDDDDDDGY YYYIDDDDDN YYY
//
DBGET integrated database retrieval system