ID K0R4S4_THAOC Unreviewed; 353 AA.
AC K0R4S4;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE RecName: Full=Peptidase S1 domain-containing protein {ECO:0000259|PROSITE:PS50240};
DE Flags: Fragment;
GN ORFNames=THAOC_34577 {ECO:0000313|EMBL:EJK46739.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK46739.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK46739.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK46739.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK46739.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01047566; EJK46739.1; -; Genomic_DNA.
DR AlphaFoldDB; K0R4S4; -.
DR EnsemblProtists; EJK46739; EJK46739; THAOC_34577.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR001254; Trypsin_dom.
DR InterPro; IPR018114; TRYPSIN_HIS.
DR PANTHER; PTHR24260; -; 1.
DR PANTHER; PTHR24260:SF136; PEPTIDASE S1 DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00089; Trypsin; 1.
DR SMART; SM00020; Tryp_SPc; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS50240; TRYPSIN_DOM; 1.
DR PROSITE; PS00134; TRYPSIN_HIS; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW Virulence {ECO:0000256|ARBA:ARBA00023026}.
FT DOMAIN 63..288
FT /note="Peptidase S1"
FT /evidence="ECO:0000259|PROSITE:PS50240"
FT REGION 1..34
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 293..353
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 311..353
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:EJK46739.1"
SQ SEQUENCE 353 AA; 38395 MW; 7D678C9298747C23 CRC64;
STITNDGGTP PKGPRVLSPS LRRGLPAGPR DTDDVPRYKT ISWAVHDDWT PSHDPGGDLA
RVILSAWGTS AIVTKETPQQ AITSLDSAKF FVSLFDRGDC AGAVIGDNYV VTAAHCVCGL
EEVNVIDYMN DEHYAVATYT NPDRPFNCNR DGPNSNDVAV VEFSGRPFRN HDARPVYSSS
DEVGKTIWIL GMGIHGQPDD FPNARACRNG VQDSNLREGY NTVDQADGIL AYTMKPSTGG
PALINVNGEW QLAGVNSGTD ENNSCDWGSV DQYCRLSEHA GWISRTMGDA VEQDDTVGQD
DTVGQDDAVG QDDAVEQDDT VGQDDDDDGG DDDDDDDDGY YYYIDDDDDN YYY
//