ID S0IVV7_9FIRM Unreviewed; 405 AA.
AC S0IVV7;
DT 18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2013, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE RecName: Full=NlpC/P60 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=C805_02921 {ECO:0000313|EMBL:EOT24709.1};
OS Eubacterium sp. 14-2.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Eubacteriaceae;
OC Eubacterium.
OX NCBI_TaxID=1235790 {ECO:0000313|EMBL:EOT24709.1, ECO:0000313|Proteomes:UP000014176};
RN [1] {ECO:0000313|EMBL:EOT24709.1, ECO:0000313|Proteomes:UP000014176}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=14-2 {ECO:0000313|EMBL:EOT24709.1,
RC ECO:0000313|Proteomes:UP000014176};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Eubacterium bacterium 14-2.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C40 family.
CC {ECO:0000256|ARBA:ARBA00007074}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOT24709.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASSS01000010; EOT24709.1; -; Genomic_DNA.
DR RefSeq; WP_016215947.1; NZ_KE159568.1.
DR AlphaFoldDB; S0IVV7; -.
DR STRING; 1235790.C805_02921; -.
DR PATRIC; fig|1235790.3.peg.3157; -.
DR eggNOG; COG0791; Bacteria.
DR eggNOG; COG3103; Bacteria.
DR HOGENOM; CLU_016043_13_0_9; -.
DR OrthoDB; 9808890at2; -.
DR Proteomes; UP000014176; Unassembled WGS sequence.
DR Gene3D; 3.90.1720.10; endopeptidase domain like (from Nostoc punctiforme); 1.
DR Gene3D; 2.30.30.40; SH3 Domains; 2.
DR InterPro; IPR000064; NLP_P60_dom.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR003646; SH3-like_bac-type.
DR PANTHER; PTHR34408; FAMILY PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR34408:SF1; GLYCOSYL HYDROLASE FAMILY 19 DOMAIN-CONTAINING PROTEIN HI_1415; 1.
DR Pfam; PF00877; NLPC_P60; 1.
DR Pfam; PF08239; SH3_3; 2.
DR SMART; SM00287; SH3b; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS51935; NLPC_P60; 1.
DR PROSITE; PS51781; SH3B; 2.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000014176};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 30..405
FT /note="NlpC/P60 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004488121"
FT DOMAIN 97..160
FT /note="SH3b"
FT /evidence="ECO:0000259|PROSITE:PS51781"
FT DOMAIN 169..231
FT /note="SH3b"
FT /evidence="ECO:0000259|PROSITE:PS51781"
FT DOMAIN 291..405
FT /note="NlpC/P60"
FT /evidence="ECO:0000259|PROSITE:PS51935"
FT REGION 238..292
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 238..261
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 262..292
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 405 AA; 42531 MW; F52E401DE5AC34BB CRC64;
MNRNSIRKAT TCCLTAAVVF TGSSFVSHAA VSAGAGNLIS AAQETQIDNG QDTKKDTSAT
AGVTEIFANS LAPRQEAIDT NVDVVQNETP VTRTEYDDIA IAQVDDYVNV RSIAGEDGEV
LGKLYNNSAC TVLGTEGDWY KIHSGNVEGY IKAEFLVLGN AELAKSVGYR VADVTTDNLN
VRADSSTESD VLGQVPQGEK LSVVEEKDGW VKVAIEEGDG WVSSEFVECS TNYVVAESKE
EEEARLKKEE EERKAAEEAA RRATRSSSSS SSSGSSSSSS GESRSYNPPS GGSGQAVADY
ACQFVGNPYV YGGTSLTNGA DCSGFVMSVY AAFGVSLPHS SSALAGVGYG VSTDAMQPGD
IVCYSGHVGI YIGGDTIVHA STEATGIKYT SPAAYRTIVA VRRIF
//