ID G1WG92_9ACTN Unreviewed; 1399 AA.
AC G1WG92;
DT 16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT 16-NOV-2011, sequence version 1.
DT 24-JAN-2024, entry version 49.
DE RecName: Full=NlpC/P60 domain-containing protein {ECO:0000259|PROSITE:PS51935};
GN ORFNames=HMPREF9452_00355 {ECO:0000313|EMBL:EGX67343.1};
OS Collinsella tanakaei YIT 12063.
OC Bacteria; Actinomycetota; Coriobacteriia; Coriobacteriales;
OC Coriobacteriaceae; Collinsella.
OX NCBI_TaxID=742742 {ECO:0000313|EMBL:EGX67343.1, ECO:0000313|Proteomes:UP000004830};
RN [1] {ECO:0000313|EMBL:EGX67343.1, ECO:0000313|Proteomes:UP000004830}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=YIT 12063 {ECO:0000313|EMBL:EGX67343.1,
RC ECO:0000313|Proteomes:UP000004830};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Morotomi M., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J.,
RA Nusbaum C., Birren B.;
RT "The Genome Sequence of Collinsella tanakaei YIT 12063.";
RL Submitted (JUN-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 25 family.
CC {ECO:0000256|ARBA:ARBA00010646}.
CC -!- SIMILARITY: Belongs to the peptidase C40 family.
CC {ECO:0000256|ARBA:ARBA00007074}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGX67343.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADLS01000006; EGX67343.1; -; Genomic_DNA.
DR RefSeq; WP_009140391.1; NZ_JH126467.1.
DR STRING; 742742.HMPREF9452_00355; -.
DR GeneID; 62758147; -.
DR PATRIC; fig|742742.3.peg.345; -.
DR eggNOG; COG0791; Bacteria.
DR eggNOG; COG3533; Bacteria.
DR eggNOG; COG3693; Bacteria.
DR eggNOG; COG3757; Bacteria.
DR HOGENOM; CLU_254398_0_0_11; -.
DR Proteomes; UP000004830; Unassembled WGS sequence.
DR GO; GO:0003796; F:lysozyme activity; IEA:InterPro.
DR GO; GO:0016998; P:cell wall macromolecule catabolic process; IEA:InterPro.
DR GO; GO:0009253; P:peptidoglycan catabolic process; IEA:InterPro.
DR CDD; cd06414; GH25_LytC-like; 1.
DR CDD; cd00161; RICIN; 5.
DR Gene3D; 2.80.10.50; -; 11.
DR Gene3D; 3.90.1720.10; endopeptidase domain like (from Nostoc punctiforme); 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR002053; Glyco_hydro_25.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR000064; NLP_P60_dom.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR035992; Ricin_B-like_lectins.
DR InterPro; IPR000772; Ricin_B_lectin.
DR PANTHER; PTHR34135; LYSOZYME; 1.
DR PANTHER; PTHR34135:SF2; LYSOZYME; 1.
DR Pfam; PF01183; Glyco_hydro_25; 1.
DR Pfam; PF00877; NLPC_P60; 1.
DR Pfam; PF14200; RicinB_lectin_2; 7.
DR SMART; SM00458; RICIN; 6.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF50370; Ricin B-like lectins; 6.
DR PROSITE; PS51904; GLYCOSYL_HYDROL_F25_2; 1.
DR PROSITE; PS51935; NLPC_P60; 1.
DR PROSITE; PS50231; RICIN_B_LECTIN; 5.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000004830}.
FT DOMAIN 1269..1399
FT /note="NlpC/P60"
FT /evidence="ECO:0000259|PROSITE:PS51935"
SQ SEQUENCE 1399 AA; 148813 MW; F23DA4ED6DE7CCDB CRC64;
MSRRARGLLL STFAVLIALV CGPVFIAAGE DVVDGAVIAS TDEPSALAVE GEGSAEATED
SAEASNAVSS SEDVAYSWRY EAGELEIQAG DSDVMLLSRP TLPEGAAEWG VDVSFAQGDI
DWAKAKADGV DFAILRLGYG AGGSDRRFVA NVQGCKANGI KFGVYLYSYA WNASTATSEA
EWTLTVLRNA GVSPSDLGLP VYYDLENQNP ATGRPAGVDD KNQYHEIEGG SATFAAMGKA
FCSKIAAAGY TPGVYANLRW WNNYLTDSVF DNWDRWVAQY NSTCDYEGDY TLWQYSSSGS
VDGISGRVDV NYLYDPEGTP QYMDKLAAAN KGLFQDGSYV VALAAGNRQV LDVSGGSLYA
GANVQSYDAN ASVAQTWVVQ TVDGGYLKIS NLASGKALSV TPSSSQVGAN VQQEDWSDTR
SQKWVATKQG DGIVLRSALG KHLVLDIPQG AHNGANAVVG EDASSSSQTF VLYSADGVSS
QSRTLPDGTY SFTLNGLSLD IAGASTANGA ALQLYTSNGT AAQLFSVSFH EISGGKGYYT
IRPAHSQKLL DADNGACYPG ASVAQWGDTA GAKQRYWVIE KNSDGTVSII NAANGYALAA
SSRAAGAQVV TLPKTDSGAL AFTYKRSILP ISHDDIDALA KEHIDELPEG TYAFGSGTSS
RLVFDVSGGS TGSCANIQIY SSNRSAAQKW SVERVDKSNG YVRIVNVGSG KVLDIQSGSN
TPGANVQQYS WNGSRAQLWL PVKQADGSYV FYSAVANALV LDVSVAGAYN GANIDVYTSN
GTIAQSFTAY NLNPNVPSQG RVVDDGVYTI VSSSNSNSAV ALGEPLDANG TLLGIADANG
SSTSQQFMLT YGDDGYYRIR SVSSQKGLDL KDGDFLAGAK IQQWDYSSAN KNQKWVVSKN
SDGTYSVVSA STGLAWDLSG SSLVCNPVSS SSSQRWSLNK YVPVIEAGAY IFKSGVGDNV
LDVASGSIQG GANVRMWTNN GSLAQRWYVR KVSDGVYRFQ NVLSGKYLSA DSSDNVVQSS
LTSRCDWTTE ASLFGVVLKN VATGRVLDVS GGSSKAGANV QVYSSNGSKA QAWMLQAKEL
VSEGFYQIAS SINSSFVLDV PQGSSANGAN VQIYTNNSTS SQKWMVKSAG NGWYSIIAAC
SAKALDVAGG SASPETNVDQ YDQNGTAAQK WTFRMGENGV EIVSMLGTVL DVKGGEAYNG
SNVQTYTSNH SRSQQWHLTS IEAPGKIGYQ NPSQFFQVSS KNVRLVDAAY STPYCYVTPS
RIDVDATREE CVEAFIARAF EYINAPYVWN YSLSPQKGVD CAGLVMQAAY ACGMDLGEYN
PYAHWYDPWH SHDANNMSVD PRFMHVKLAD RKRGDLVFYP GHVAIYLGND QIIEATPPRV
RTASVFWLGA PTAVARPFV
//