ID E4U8U9_OCEP5 Unreviewed; 336 AA.
AC E4U8U9;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=Peptidase S41 {ECO:0000313|EMBL:ADR36779.1};
DE Flags: Precursor;
GN OrderedLocusNames=Ocepr_1322 {ECO:0000313|EMBL:ADR36779.1};
OS Oceanithermus profundus (strain DSM 14977 / NBRC 100410 / VKM B-2274 /
OS 506).
OC Bacteria; Deinococcota; Deinococci; Thermales; Thermaceae; Oceanithermus.
OX NCBI_TaxID=670487 {ECO:0000313|EMBL:ADR36779.1, ECO:0000313|Proteomes:UP000008722};
RN [1] {ECO:0000313|Proteomes:UP000008722}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 14977 / NBRC 100410 / VKM B-2274 / 506
RC {ECO:0000313|Proteomes:UP000008722};
RG US DOE Joint Genome Institute (JGI-PGF);
RA Lucas S., Copeland A., Lapidus A., Bruce D., Goodwin L., Pitluck S.,
RA Kyrpides N., Mavromatis K., Pagani I., Ivanova N., Zhang X., Brettin T.,
RA Detter J.C., Tapia R., Han C., Land M., Hauser L., Markowitz V.,
RA Cheng J.-F., Hugenholtz P., Woyke T., Wu D., Tindall B., Faehnrich R.,
RA Brambilla E., Klenk H.-P., Eisen J.A.;
RT "The complete sequence of chromosome of Oceanithermus profundus DSM
RT 14977.";
RL Submitted (NOV-2010) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:ADR36779.1, ECO:0000313|Proteomes:UP000008722}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 14977 / NBRC 100410 / VKM B-2274 / 506
RC {ECO:0000313|Proteomes:UP000008722};
RX PubMed=21677858; DOI=10.4056/sigs.1734292;
RA Pati A., Zhang X., Lapidus A., Nolan M., Lucas S., Del Rio T.G., Tice H.,
RA Cheng J.F., Tapia R., Han C., Goodwin L., Pitluck S., Liolios K.,
RA Pagani I., Ivanova N., Mavromatis K., Chen A., Palaniappan K., Hauser L.,
RA Jeffries C.D., Brambilla E.M., Rohl A., Mwirichia R., Rohde M.,
RA Tindall B.J., Sikorski J., Wirth R., Goker M., Woyke T., Detter J.C.,
RA Bristow J., Eisen J.A., Markowitz V., Hugenholtz P., Kyrpides N.C.,
RA Klenk H.P., Land M.;
RT "Complete genome sequence of Oceanithermus profundus type strain (506).";
RL Stand. Genomic Sci. 4:210-220(2011).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP002361; ADR36779.1; -; Genomic_DNA.
DR AlphaFoldDB; E4U8U9; -.
DR STRING; 670487.Ocepr_1322; -.
DR KEGG; opr:Ocepr_1322; -.
DR eggNOG; COG0793; Bacteria.
DR HOGENOM; CLU_048401_0_0_0; -.
DR Proteomes; UP000008722; Chromosome.
DR GO; GO:0008236; F:serine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd07562; Peptidase_S41_TRI; 1.
DR Gene3D; 3.30.750.44; -; 1.
DR InterPro; IPR029045; ClpP/crotonase-like_dom_sf.
DR InterPro; IPR005151; Tail-specific_protease.
DR InterPro; IPR028204; Tricorn_C1.
DR PANTHER; PTHR32060:SF22; CARBOXYL-TERMINAL-PROCESSING PEPTIDASE 2, CHLOROPLASTIC-RELATED; 1.
DR PANTHER; PTHR32060; TAIL-SPECIFIC PROTEASE; 1.
DR Pfam; PF03572; Peptidase_S41; 1.
DR Pfam; PF14684; Tricorn_C1; 1.
DR SMART; SM00245; TSPc; 1.
DR SUPFAM; SSF52096; ClpP/crotonase; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008722};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..18
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 19..336
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003188674"
FT DOMAIN 101..320
FT /note="Tail specific protease"
FT /evidence="ECO:0000259|SMART:SM00245"
FT REGION 107..144
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 107..123
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 336 AA; 35973 MW; 167DE47133064EC8 CRC64;
MMRSLLLTLV LASGLAAAAA PDYLTRFDQA WRLVDEFYWD EDHLGVDWRA IGERYRPQVA
AAASWDEVYR LLDRMYGELG DDHSRVLAPD EARAALRGAL CLPLPFPVER PQPPAPGPAA
PAPEGDGPGS AAPDAPGGGA EAGWDPFSYR RLEDGIAYLR VPQLVEPDVA RRLAEAVRGL
ESEGVQGYIL DLRDNPGGLA YVMAEVAGVF MRGLPWRIVT RTKGVTPQPT LPFWGRPLTD
KPLVVLINRN VNSAAEGLAG ALKRAGRARL VGETTAGNTE VVLPYCFPDG GVAMLASGVL
APVGAPTWEG RGVEPDVAVA DEGAQLEAAV KLLRTR
//