ID R7HSS2_9CLOT Unreviewed; 1410 AA.
AC R7HSS2;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=Subtilase {ECO:0000313|EMBL:CDE42276.1};
GN ORFNames=BN648_01205 {ECO:0000313|EMBL:CDE42276.1};
OS Clostridium sp. CAG:411.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262802 {ECO:0000313|EMBL:CDE42276.1, ECO:0000313|Proteomes:UP000018022};
RN [1] {ECO:0000313|EMBL:CDE42276.1, ECO:0000313|Proteomes:UP000018022}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:411 {ECO:0000313|Proteomes:UP000018022};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase S8 family. {ECO:0000256|PROSITE-
CC ProRule:PRU01240, ECO:0000256|RuleBase:RU003355}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDE42276.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBIY010000022; CDE42276.1; -; Genomic_DNA.
DR STRING; 1262802.BN648_01205; -.
DR MEROPS; S08.026; -.
DR Proteomes; UP000018022; Unassembled WGS sequence.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.1080; -; 2.
DR Gene3D; 2.120.10.80; Kelch-type beta propeller; 1.
DR Gene3D; 3.40.50.200; Peptidase S8/S53 domain; 2.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR015915; Kelch-typ_b-propeller.
DR InterPro; IPR006652; Kelch_1.
DR InterPro; IPR000209; Peptidase_S8/S53_dom.
DR InterPro; IPR036852; Peptidase_S8/S53_dom_sf.
DR InterPro; IPR023827; Peptidase_S8_Asp-AS.
DR InterPro; IPR022398; Peptidase_S8_His-AS.
DR InterPro; IPR023828; Peptidase_S8_Ser-AS.
DR InterPro; IPR015500; Peptidase_S8_subtilisin-rel.
DR PANTHER; PTHR42884:SF14; NEUROENDOCRINE CONVERTASE 1; 1.
DR PANTHER; PTHR42884; PROPROTEIN CONVERTASE SUBTILISIN/KEXIN-RELATED; 1.
DR Pfam; PF02368; Big_2; 2.
DR Pfam; PF01344; Kelch_1; 1.
DR Pfam; PF00082; Peptidase_S8; 2.
DR PRINTS; PR00723; SUBTILISIN.
DR SMART; SM00635; BID_2; 2.
DR SMART; SM00612; Kelch; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
DR SUPFAM; SSF117281; Kelch motif; 1.
DR SUPFAM; SSF52743; Subtilisin-like; 1.
DR PROSITE; PS51892; SUBTILASE; 1.
DR PROSITE; PS00136; SUBTILASE_ASP; 1.
DR PROSITE; PS00137; SUBTILASE_HIS; 1.
DR PROSITE; PS00138; SUBTILASE_SER; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|PROSITE-
KW ProRule:PRU01240};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|PROSITE-
KW ProRule:PRU01240}; Reference proteome {ECO:0000313|Proteomes:UP000018022};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825, ECO:0000256|PROSITE-
KW ProRule:PRU01240}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1410
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038696701"
FT DOMAIN 1227..1306
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 1325..1405
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT REGION 38..57
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT ACT_SITE 206
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR615500-1,
FT ECO:0000256|PROSITE-ProRule:PRU01240"
FT ACT_SITE 250
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR615500-1,
FT ECO:0000256|PROSITE-ProRule:PRU01240"
FT ACT_SITE 636
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR615500-1,
FT ECO:0000256|PROSITE-ProRule:PRU01240"
SQ SEQUENCE 1410 AA; 152454 MW; 3BC9F3779ED99E75 CRC64;
MKRFTKSTLS LAMAVALGVS GMPTLGADMV SQAAQKTEKQ NAKEKDGHNL LGEDLTKQKE
SKYEDGQAIV MYRNTTANVK KFAKGAAFGD GIKVESSCTF ANNTEQKNGK VSTKKLAGKG
GYTVSLVSST RYSTNQLIEK LKKDSEVLYA QPNYVCKADN TADFTSYQWA LDNQGQNNGT
KGVDVGIKGV DTSKYTSDEK VIAIIDSGVD YTHDDLKNVI WNNPYTSELK GKHGYDFANG
DADPLDDNGH GSHCAGIIAG DSTDQSGISG VADGNVKIMA LKFLDADGYG DTYSAISAYN
YIYTAQSLGT NVVAVNNSWG GEIDYTYGDS ILEEVINLVG KNGAVSVCAA SNDGTDNDEN
QNISPACLNS DYIISVAAAN ERGELASFSN YGAKSVDIAA PGADILSAVS YNNFEPALYQ
EANNYCSYYL DFSQKLQECK KEDLETVDVS DNSIIHYAVE NAGDANVSVE VSNQEYVSAA
NGNKNSLKWS IKNAKAGDVY SLYIPYEREE SSTPYHQTFH MRSDVPVMDQ DKMINDWSYM
PSTMNIYDSK VTTGGAITDE NLGYILLNGD ANYWSQVEYT REDKITKRTA GRYAYEFSVE
VSDNGDFTFY IDEVGFSKPN VKEESFGKYA YYNGTSMAAP YVTGSVAVAK SLYPKDNAWE
TKERIVNSAK QVDALKGKVA SNGMVDLSNL DNPRPAVSGA SVTADGVATV EGKFFGSNPS
VTVNGEQVAL VSKSDDKVSF KVKKNSLLNV KISTDKGSVE KVLFFTEGKE SDKVAYAVNR
GSAVEAVSDG DNLYLVGEDG SLIKYAVGEK RSYIDTSEDS GITLPMLAEN LCVEQFEVTE
IFGEDKKTAV NYELSTASQP VSCGKEIYTI AELDMGFAMD KALVHYSEEK GTWEKVAAIP
KELKSVKGSA LAVYKNNLYL IGGYDVDKEQ TLSNVYCYDL SKKEWNKVAN LPEGKFNAQA
VQTNGKLLVT LGGTGDKATK GSNKTFIFDG TNWTQAAELP AVYGEDSEQA QIEYADTLVN
DENTIVSTGE KTAYRVLNYY KAAVGVTDAG VIFAGLKAEG LGNTFTYDVS ANTYTSANIC
YTSLDTEDDV TGATVGDKFY VVSGQTWELD LDDLFSKLQT SNVEKKAQEQ TTQPIVDLGD
SEQYIYLVSG IDVVNTPIKV KQEKTYEEGY ISGAGNYTLG ENATVKAIPY EGSFVKALYV
DGQKVENGYT FQVTEKGAVV KAEFGKYVAA VMLNEEAEVS AGGTLKMMPY IMPMDADDLT
LVWKSSDESI ATVDKNGVVK AAEDAAGKTV EITATAADQN KVTATCLVTI TKKQGNKPSD
TKKVAVKKIK LSATKKTVKA GKSLKIKATI TPVKATNKKL TWKSSKKKYA TVNSKGVVKA
KKAGKGHTVK ITATSVSNPK VKGTIKIKIK
//