ID F7K6V0_9FIRM Unreviewed; 648 AA.
AC F7K6V0;
DT 21-SEP-2011, integrated into UniProtKB/TrEMBL.
DT 21-SEP-2011, sequence version 1.
DT 24-JAN-2024, entry version 41.
DE RecName: Full=NlpC/P60 domain-containing protein {ECO:0000259|PROSITE:PS51935};
GN ORFNames=HMPREF0994_01617 {ECO:0000313|EMBL:EGN42091.1};
OS Lachnospiraceae bacterium 3_1_57FAA_CT1.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=658086 {ECO:0000313|EMBL:EGN42091.1, ECO:0000313|Proteomes:UP000003336};
RN [1] {ECO:0000313|Proteomes:UP000003336}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_57FAA_CT1 {ECO:0000313|Proteomes:UP000003336};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Allen-Vercoe E., Young S.K., Zeng Q., Gargeya S., Fitzgerald M., Haas B.,
RA Abouelleil A., Alvarado L., Arachchi H.M., Berlin A., Brown A.,
RA Chapman S.B., Chen Z., Dunbar C., Freedman E., Gearin G., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heiman D., Howarth C., Larson L., Lui A.,
RA MacDonald P.J.P., Mehta T., Montmayeur A., Murphy C., Neiman D.,
RA Pearson M., Priest M., Roberts A., Saif S., Shea T., Shenoy N., Sisk P.,
RA Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 2_1_58FAA.";
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EGN42091.1, ECO:0000313|Proteomes:UP000003336}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_57FAA_CT1 {ECO:0000313|EMBL:EGN42091.1,
RC ECO:0000313|Proteomes:UP000003336};
RG The Broad Institute Genomics Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Allen-Vercoe E., Walker B., Young S., Zeng Q., Gargeya S., Fitzgerald M.,
RA Haas B., Abouelleil A., Allen A.W., Alvarado L., Arachchi H.M.,
RA Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A., Larimer J.,
RA McCowan C., Murphy C., Pearson M., Poon T.W., Priest M., Roberts A.,
RA Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 3-1-57FAA CT1.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the peptidase C40 family.
CC {ECO:0000256|ARBA:ARBA00007074}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGN42091.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACTP02000005; EGN42091.1; -; Genomic_DNA.
DR AlphaFoldDB; F7K6V0; -.
DR STRING; 658086.HMPREF0994_01617; -.
DR PATRIC; fig|658086.3.peg.1765; -.
DR eggNOG; COG0791; Bacteria.
DR HOGENOM; CLU_012396_2_0_9; -.
DR Proteomes; UP000003336; Unassembled WGS sequence.
DR Gene3D; 3.90.1720.10; endopeptidase domain like (from Nostoc punctiforme); 1.
DR InterPro; IPR000064; NLP_P60_dom.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR PANTHER; PTHR47053; MUREIN DD-ENDOPEPTIDASE MEPH-RELATED; 1.
DR PANTHER; PTHR47053:SF1; MUREIN DD-ENDOPEPTIDASE MEPH-RELATED; 1.
DR Pfam; PF00877; NLPC_P60; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR PROSITE; PS51935; NLPC_P60; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000003336}.
FT DOMAIN 525..648
FT /note="NlpC/P60"
FT /evidence="ECO:0000259|PROSITE:PS51935"
FT REGION 1..92
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 16..36
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 39..73
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 648 AA; 71844 MW; E571F97BDB046297 CRC64;
MKPLKPRDKV TQRMTRAGLT LDNQTTGESA GISSRETEPE YTAKPDGTVE KALERAVDIR
DRHKAKRTAR NGERMARQAG APASRLQFTA GERASPELAR YIKKAEKRAD KLEAAKEALP
KKRVLTKETV YNEAKGKAKS KLHFEKVEKA PPKLKPNPAS RPVQEARLYL HGKIHEVEQE
NVGVEAGHKA EELAERQAGK VLKSAIRRHK LKPYRAAAKA ERKSMAANAE FVYRKSLRDN
PELAQAVKNP VSRLCQKQHI KREYAKAARA AGRSASGSAK TTASAARKAA EKGKQVASLV
ARHWKGALLI GGVGLLLMLV MGGLQSCTAM FGSAGTGLAA TSYLSEDSDM LGAEATYAGM
EADLQHELDN YESLHPGYDE YRFDLDDISH DPYVLTSILS ALHGGAFTLD EVQGNLAMLF
EQQYTLTERV EMEIRYRTVT HTDDDGNEYE EEEPYQYFIC YVTLENADLS HLPVYLMDEN
QLSLYAAYMQ TLGNRPDLFP SGSYPHASTV KEPTYYEIPP EALEDETFAA MIAEADKYVG
YPYVWGGSSP STSFDCSGFI SWVVNHSGWN VGRQTAQGLY NLCTPVSPEQ AKPGDLVFFV
GTYDTSGMSH VGLYVGNSVM LHCGNPISYT NLNSSYWQEH FYCYGRLP
//