ID A0A1G9XFD9_9FIRM Unreviewed; 1281 AA.
AC A0A1G9XFD9;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 18.
DE SubName: Full=Cysteine protease, C1A family {ECO:0000313|EMBL:SDM95261.1};
GN ORFNames=SAMN05192585_10853 {ECO:0000313|EMBL:SDM95261.1};
OS Acetanaerobacterium elongatum.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Acetanaerobacterium.
OX NCBI_TaxID=258515 {ECO:0000313|EMBL:SDM95261.1, ECO:0000313|Proteomes:UP000199182};
RN [1] {ECO:0000313|EMBL:SDM95261.1, ECO:0000313|Proteomes:UP000199182}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CGMCC 1.5012 {ECO:0000313|EMBL:SDM95261.1,
RC ECO:0000313|Proteomes:UP000199182};
RA de Groot N.N.;
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C1 family.
CC {ECO:0000256|ARBA:ARBA00008455}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FNID01000008; SDM95261.1; -; Genomic_DNA.
DR STRING; 258515.SAMN05192585_10853; -.
DR OrthoDB; 3648721at2; -.
DR Proteomes; UP000199182; Unassembled WGS sequence.
DR GO; GO:0008234; F:cysteine-type peptidase activity; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR CDD; cd02619; Peptidase_C1; 1.
DR Gene3D; 2.160.20.110; -; 1.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 2.
DR InterPro; IPR011493; GLUG.
DR InterPro; IPR040528; Lectin-like.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR000169; Pept_cys_AS.
DR InterPro; IPR025660; Pept_his_AS.
DR InterPro; IPR013128; Peptidase_C1A.
DR InterPro; IPR000668; Peptidase_C1A_C.
DR PANTHER; PTHR12411:SF741; CATHEPSIN K; 1.
DR PANTHER; PTHR12411; CYSTEINE PROTEASE FAMILY C1-RELATED; 1.
DR Pfam; PF07581; Glug; 2.
DR Pfam; PF18560; Lectin_like; 1.
DR Pfam; PF00112; Peptidase_C1; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 2.
DR PROSITE; PS00139; THIOL_PROTEASE_CYS; 1.
DR PROSITE; PS00639; THIOL_PROTEASE_HIS; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000313|EMBL:SDM95261.1};
KW Protease {ECO:0000313|EMBL:SDM95261.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000199182};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..1281
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011793322"
FT DOMAIN 93..256
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|Pfam:PF00112"
FT DOMAIN 297..465
FT /note="Lectin-like"
FT /evidence="ECO:0000259|Pfam:PF18560"
FT DOMAIN 635..666
FT /note="Peptidase C1A papain C-terminal"
FT /evidence="ECO:0000259|Pfam:PF00112"
FT DOMAIN 877..899
FT /note="GLUG"
FT /evidence="ECO:0000259|Pfam:PF07581"
FT DOMAIN 975..996
FT /note="GLUG"
FT /evidence="ECO:0000259|Pfam:PF07581"
FT REGION 476..556
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 489..531
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1281 AA; 134760 MW; 33563214A94C1349 CRC64;
MKRCKKLVSL VLTVLMALSL LPASLAAEGT GADSAAPGDS ALPGQYITVE PQLEGASEPA
TGAADYIDLS EMHLFAGGTR PDQPFSYDLG LGSGGNAEMA TSFLTRWADP VNEADNPYPD
YAGTLSDSNI VINSSAQTAK RVQEIISIPK KPYNQPLAND AVKQAIMDHG AAYASLWWDG
RCENQTKGTY YFPKNAATNI TYGGGHAITI VGWDDDYLAS NFSGCPAKVT PPGKGAFIIK
NSWGTSAGQS GYYYVSYYDR FLVSAVEGYY SNDQVRYSSD SATFFTKVEN TNNYSGVFSY
DDYGAVYVHG VTTPNYVFYA NKFNTTTQKD IAAVSFYTYS YNDSYRIYVH PLSSGGGLSI
PTSTTGLPAP SAVGTAYYPG YHTINLLQNV TIPAGKDFEV IIAVQNPTAG VVGIPLEVKE
SGYNSKATIG AGQSYVFVNG VWRDTTSLLT SNGSYVPMNV CVKAFVAGAG MDGSSHNGNP
PVVGGSGTEP ATPGSSPVAG QSDVVTPGSS PAIGSSNIAG DPPTVGDSQP VPGQSEPPVP
GGSEQTAPGA SAVNEPVTGS VEFSTVIPEP RDAFKGALKQ LGEPPAVGGS GEAGRAFGEI
PAPYDFFNGR TLADMGEIST GGNFAAKYSL LPLGRVTSVK NQGNYGACWT FATMSSIESA
YLTENGLDNS KTDAKFSQSS RYLKNSTGSS ITLTAVAPAS LVSSYSWTIK SGDIAVTNTS
LTNKSVTFNY KAGAPETELK VEGKVAYTNG DTYVCTFRIP VTNYGSGDGT EAKPFEVSTP
MGLQLIDCFK GKPSQGLNFT QTNDITLTDY WHPIGDAANP FMGSFNGGGF KINGFNLITL
FSFEPAGLFG CVNGGTIENI ILYDADIEAS PSYFVGECYV GGIAGRAQNS TIQNCAVSGE
YGFTLSQDNA TEPSVGGLIG DAKDNTVVKH CSFSGSIHMT STKKITIGGL IGRATNATVS
NSFANLNAHG GKESSFGGLA GFADKANITD CYATGTFSTL GSAIGVCAGG ITREARNQCS
FTNCYTSVNY DKSFTESGSR KGAIAAWCSG TQKLSNCYYD KTAAGDGVVL GTFTSAPKGL
TAQQMAQQTS FTGFNFTTNW VMAPGPISYV PVLRNPIGEV DQFEFSSRVC KTLFSSQATP
LCGRFFPEYA WPRPLDITVS DSKVVTFDAG LLKAGKYGAA TVSVTGVPSS PHNMQIFTRI
GDVNFDGKCE YSQAVTALLN WLTGKTPTVT DPYVLNADHK DKDGKTTVSD NYNIVQEDLS
LTDLLKMQQS NAGVNPDDTA G
//