ID R5HXR7_9BACT Unreviewed; 1206 AA.
AC R5HXR7;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE SubName: Full=Papain family cysteine protease {ECO:0000313|EMBL:CCY34638.1};
GN ORFNames=BN796_00140 {ECO:0000313|EMBL:CCY34638.1};
OS Alistipes sp. CAG:831.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Rikenellaceae;
OC Alistipes.
OX NCBI_TaxID=1262698 {ECO:0000313|EMBL:CCY34638.1, ECO:0000313|Proteomes:UP000018094};
RN [1] {ECO:0000313|EMBL:CCY34638.1, ECO:0000313|Proteomes:UP000018094}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:831 {ECO:0000313|Proteomes:UP000018094};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCY34638.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAYB010000013; CCY34638.1; -; Genomic_DNA.
DR AlphaFoldDB; R5HXR7; -.
DR STRING; 1262698.BN796_00140; -.
DR Proteomes; UP000018094; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02619; Peptidase_C1; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR PANTHER; PTHR12411:SF741; CATHEPSIN K; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF00112; Peptidase_C1; 1.
DR SMART; SM00645; Pept_C1; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Hydrolase {ECO:0000313|EMBL:CCY34638.1};
KW Membrane {ECO:0000256|SAM:Phobius}; Protease {ECO:0000313|EMBL:CCY34638.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000018094};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 859..876
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 888..909
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 929..947
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 476..706
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|SMART:SM00645"
FT COILED 404..438
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 748..803
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 954..988
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 1206 AA; 137842 MW; 1C473B6F28C4F208 CRC64;
MEHFAHIFIG KEFAEIVSNI GRKVYKYGGE ELLSSVNQFV VDDGRVRQLL YPEGAIDIKD
MIQHSSLQLE WRELGRLTDN PEDDKDLFLN GIFNQILRVG NVGANASLYT MIHFPLYKSE
ALESAEYLYR IIKLAGRPVD LDFMGFCDDM ASIIEPDYVI TSPSKNQVEK YKAFRDAESI
GYSTHFIVMQ NASQNGISLG LNADSFATIL SHFSILCAGY YHEMFPNTVE YMDVVAFGFS
TLCLDKYLYA EYLLSKSMLN AMDAAHVNDR EVDVNNAYVV VNNMLKDKAT LLSTLFQQLD
GHRNSEADDE YQDIQKKFTD DVQEIVDRCK DTLKKQSAIT MQAAILAVCL AKTDCELFSE
TIFTQDIVTV DKLFDEPIEY YINNDHAHYY KLGGENPVNP IQKLKELDAR LINSETAIRN
LKNDLETLNK QIGDSEKVKE FYVEDGYIHF KNQKFRLLPS VEQVPLAETY TPHEVKVRSL
DMRSKFTSVK NQGQQGSCLS FALTSIFEYV MHLNAAQEFD LSEAFLYYNA RDLDGDGGVN
NDSGSRFKPS IDALIKYGIA LESVWPYNDQ VYSKKPSDEA YADAAGRKLV SAMNVNSKVS
DIKSALVDNC PVAASFTLTK SFFEFGRAGG YIPMPGDEEI ASVIEQESEE DLHSCHAMVI
VGFSDDLQMF IVRNSWGTDW GDNGYCYIPY SYVEHEGLLN YACIFTEIEK LVTCPDNMEV
MPLTIDSSDL HIQYAVKKNE LYIEEHTAEQ LRNDRKELRV YFESLKQMYC NPNQRDCFID
ANKALLTAEQ ESLKEKISAK EAECDANDEK MNKYKKDLLF RSASFLTGTV LLAVMYKQFV
DFLHVSSADP DFQFTFKPFL VWIIIFAILA IVSWLLRKKS FVKSFWMSIW GVVGVLMAKA
ISRLVSYFVG SGTLVSYFGP TDFFHKIDGG QFLWLLAVFA VAIIITYRNG HIVWRDWRDE
RDRINAEIDK LHNEINVKEK EKRFLKLKTF SAWTLITKLQ GLHTEFYSNY ANLISLINNF
RVWYNELKTK GDDISLHTVF PETSLLSAEL LDEFFDRSLK NDPELCVDLC ANISQYNITS
ESLSKYKETL MRQVIDSLLS RKEIREFDIS NHIANNTFAD IAMEVNRKLV TALDEQSGIF
LNVNSQQRGV IVPSTAVFAP SLGKYRDSIR KKLGKYSEPY YESAGKYCLT FLKTATVWFQ
ECVNFK
//