ID R6HBQ6_9FIRM Unreviewed; 2786 AA.
AC R6HBQ6;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Subtilisin-like serine protease {ECO:0000313|EMBL:CDB31364.1};
GN ORFNames=BN490_00303 {ECO:0000313|EMBL:CDB31364.1};
OS Firmicutes bacterium CAG:137.
OC Bacteria; Bacillota.
OX NCBI_TaxID=1263004 {ECO:0000313|EMBL:CDB31364.1, ECO:0000313|Proteomes:UP000018011};
RN [1] {ECO:0000313|EMBL:CDB31364.1, ECO:0000313|Proteomes:UP000018011}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:137 {ECO:0000313|Proteomes:UP000018011};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase S8 family. {ECO:0000256|PROSITE-
CC ProRule:PRU01240, ECO:0000256|RuleBase:RU003355}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDB31364.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBDK010000635; CDB31364.1; -; Genomic_DNA.
DR Proteomes; UP000018011; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:InterPro.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:UniProtKB-UniRule.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd07475; Peptidases_S8_C5a_Peptidase; 1.
DR Gene3D; 2.60.40.1080; -; 1.
DR Gene3D; 3.40.50.12480; -; 1.
DR Gene3D; 3.50.30.30; -; 1.
DR Gene3D; 3.40.50.200; Peptidase S8/S53 domain; 1.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 6.
DR Gene3D; 2.60.40.1710; Subtilisin-like superfamily; 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR034216; C5a_Peptidase.
DR InterPro; IPR010435; Fn3_5.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR InterPro; IPR046450; PA_dom_sf.
DR InterPro; IPR003137; PA_domain.
DR InterPro; IPR000209; Peptidase_S8/S53_dom.
DR InterPro; IPR036852; Peptidase_S8/S53_dom_sf.
DR InterPro; IPR023827; Peptidase_S8_Asp-AS.
DR InterPro; IPR023828; Peptidase_S8_Ser-AS.
DR InterPro; IPR015500; Peptidase_S8_subtilisin-rel.
DR InterPro; IPR001119; SLH_dom.
DR PANTHER; PTHR45661:SF3; RICH REPEAT DOMAIN PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR Pfam; PF02368; Big_2; 1.
DR Pfam; PF06280; fn3_5; 1.
DR Pfam; PF13306; LRR_5; 7.
DR Pfam; PF02225; PA; 1.
DR Pfam; PF00082; Peptidase_S8; 1.
DR Pfam; PF00395; SLH; 3.
DR PRINTS; PR00723; SUBTILISIN.
DR SMART; SM00635; BID_2; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
DR SUPFAM; SSF52058; L domain-like; 3.
DR SUPFAM; SSF52025; PA domain; 1.
DR SUPFAM; SSF52743; Subtilisin-like; 1.
DR PROSITE; PS51272; SLH; 3.
DR PROSITE; PS51892; SUBTILASE; 1.
DR PROSITE; PS00136; SUBTILASE_ASP; 1.
DR PROSITE; PS00138; SUBTILASE_SER; 1.
PE 3: Inferred from homology;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|PROSITE-
KW ProRule:PRU01240};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|PROSITE-
KW ProRule:PRU01240}; Reference proteome {ECO:0000313|Proteomes:UP000018011};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Secreted {ECO:0000256|ARBA:ARBA00022512};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825, ECO:0000256|PROSITE-
KW ProRule:PRU01240}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..2786
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004408174"
FT DOMAIN 2603..2666
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 2667..2726
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 2730..2786
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT ACT_SITE 224
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR615500-1,
FT ECO:0000256|PROSITE-ProRule:PRU01240"
FT ACT_SITE 292
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR615500-1,
FT ECO:0000256|PROSITE-ProRule:PRU01240"
FT ACT_SITE 627
FT /note="Charge relay system"
FT /evidence="ECO:0000256|PIRSR:PIRSR615500-1,
FT ECO:0000256|PROSITE-ProRule:PRU01240"
SQ SEQUENCE 2786 AA; 299343 MW; AAC9FFCB8679299D CRC64;
MKKRLLACLL ILSLFVTGLG FVPAVQAAED GISAQWTTKE NLAPADQEVM LRPEKGDLSE
DSASEITGES NQYQDSDMVT VIVELEDAPL AEAAPTDVKS FAASTQGVAL ERELRSTQDA
IKAEIRSWGG AVSTQSVNGA SDLEYSYTTV LNGFSMKMPY GDIARARELD GVKRIFVAEQ
YSLPTTLGED EYTISMTSST GMVGANEANE LGYDGTGTIV AILDTGFDAD HEAFSVMPTG
GKYSKSDVAQ LLKGKLSCGV SNVDDVYINE KAPFGYDYAG GDPNAEAVGQ SHGVHVAGTV
AGNNGDDFKG VAPNAQLMIM KIFADNSGST GDDIILAGVD DAVKLGADSI NMSLGSPAGF
TEYGDENEEE TDSYLTYYGV YTRAQEAGVN VMVAAGNETS STYYNPAGTQ LTLAQYPDSA
IVASPSTLEA SVSVASVDNV GYFRNHFALG ETKIPYNGGV DYDSQVETNI LDTMEGQTLE
YVPVPGLGEE KDFEGLDLTG KVALVARGSI NFDVKAANAA AVGAVAIIIY NNTDEGLFYA
SLQSHSIPVI TIAKQDVQAM LDAPEKKITF SSDYYGKALN YNGYQVSSFS SIGPAPDLTI
KPEIAAPGGQ IYSSVIGGGY ETMSGTSMAT PHMAGEAAVL RQYLKETYPN LSDFELGELA
NSLLMSTAVP SLDNGSGTYF SVRRQGAGVA NVYYAIVSGA YLSVEGSNRP KAEVGSSENG
TYTYTATVHN LTDGAKTYSL DTAALVETIT ELNGSNYVAN SEKRLSASEV NVTYTGLTDN
KLTVAANGDA TFTVTIQLTE AGKKYLDDNF PNGSYVEGFT FLTAEDDEGV SLSVPFLGFY
GDWGGLTVFD GDPGEQQNML GTALADIDAA GSGYFVGVNN TSGAYNESKM AYGPQRGNRQ
LVARVSLLRN VYSVEETVTD EDGNLIYTTG DLGMARKTYG VVTQTGIQYT ALLYTPGWAG
RTMKDGVNDA GDWAESGKWY TYTIQATPVG STEPQTKEFK FYLDNTKPTL EDVQLYEEDG
NVYLTGVASD DFYIQRIRVI DSTQEYWYLA EAEAFDAITE TGAKTRFTFD VTELASDLAA
DGKNPGRIGL LLEDVAYNSN LTFVDLGPQS MTIESLNLEV GESKQANVSI KPARLADAKL
TWASQDESVA KVDANGMVTG VADGETMISA TAMSGLTAYA KVTVGKGTPV LLTYGEAPEL
NDRFQTEDGL YWKVIGPDSV QLLEDQNKAS SWAASYASIS GDLEIPATVE YSGKTFRVTS
IGYQAFYSNQ GITSVVIPEG VTDVGYNAFF MCMKLAKISL PDTLEKVDTY AFNTFIATEF
DKMPASIQWI GESAFQKAKI TTLDLPEGLT HIGDEAFFNA EVESLSLPES VTEYGDYIFY
GCQNLSYVEL PDNMTELPKG IFWNCKALKR ISLPSGLKKI GNAAFYGSGL EKITIPASVT
EIDDWAFAWI TNMKTIDIPD SVESVGFNAY IYAQGVKTIN IGSGVKTIGK DGFRTWNLEL
GEAPVMNVKT EETATALRRS GYGQEILLNG VPYTGYNGVS FTDGTFSYMP ISDTEVQVVG
FNSSAAAGEY TMPAEVYCEG DDRTYTVTSV KDRTFFQNQN IFKLTLPDTI EEMGERAFDQ
MFNVTEFNVP KNLKVVGYQA MGYLGWEAKS IGLTVNTDRT LEIPGTVEEW GDCGFAGNMQ
KSIVVGEGVE YIGTYGLSGN YNATSVTLPS TLKRINNFAF QGCSSLTTVD IPDGVTYIGD
GAFNGVPLES IQLPEGLTYI GRQALGAYVY NSDYTAQYWA GPTYVELNGA LKNLGYNAFR
PDAEIVAVLN SQRNLVVASS DLEKLPTVVW DGKTDIPFND GSYVPAGKTV TVTGDVVIDG
KLTIEGKLVV DPTATLIITE DAVIVGAENI EYKTCDGGGV TNEELQNHYG ETPYLNDRFQ
AGDFWYKVTG PDTVQLIRDP GSSWSSYPEL SGEIFIPAEV TYENTAFRVT SIGEHAFGNN
SGITAVTLPD ILETIGESAF SGCYALELTY LPDSIRRLDD MAFQYSSGVH CNLPANLEVM
GSKVFNGTGI TEAVIPESVI RLGDEAFAGC YSLTKVEFLG LPEELGEGLF TFDSALETVV
LPQGLTEIPD NMFQYCASLT DVQIPDTVTV IGESAFAYAG LTEIVIPEAC REIGISAFAY
NASRVIRIAN QVERVEEEAF AGCKNVETVI FGKSVQYIGA GVLNYVTPAE GLDEIHIEVV
SESAASALFC SGYHGVVYKD GLPYVVYTGL PFQSDGVIYQ PTSDRTAFVA GCYDHAGENL
VIPETVYSEP DDMTYTVTGI EAGGLGGAKG TVTLPETVTW ISHLASSGSE ITSIHLPQGL
TEIRERAFLG MGQAETWADE TLTVPGGVGT FGEAAFSETH YRRAVLSEGI TQVSRSAFEM
AGSLEEVCLP DTVTKIGVGA FRYCGALEAV LLPEGLERVE KEAFYSAPLD EEIVIPASVT
YIGAQAFTGS VWDEDYHEIP TGPKQVAIGG GLRDLGWNAF RKDAVITTVL NSQRNLCVTF
GDLEQIPAVI WDGKTAIPLG DGSCVPEGKT VTVTGDVTID GKFCIEGKLV VAPAANLTIT
ENAVIVGPEN IEYKTCDGGE DCFSKTFTDL NTNRWYHVYT DYVIARGLMN GMSSTQFAPE
ANLTRGQLVT TLYRLAGEPE VTEPATFADV AEGRYFTDAI AWAEDLGIAE GITETEFAPD
GAVTREQAVT FLYRYVVNYL GQEPAKGGDL SIFRDAGKIS DYAREAMAWA TAEGFLEGYG
DSTVGPRNPV TRAQMAKFLT ILSKAF
//