GenomeNet

Database: UniProt
Entry: G1WG92_9ACTN
LinkDB: G1WG92_9ACTN
Original site: G1WG92_9ACTN 
ID   G1WG92_9ACTN            Unreviewed;      1399 AA.
AC   G1WG92;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   16-NOV-2011, sequence version 1.
DT   24-JAN-2024, entry version 49.
DE   RecName: Full=NlpC/P60 domain-containing protein {ECO:0000259|PROSITE:PS51935};
GN   ORFNames=HMPREF9452_00355 {ECO:0000313|EMBL:EGX67343.1};
OS   Collinsella tanakaei YIT 12063.
OC   Bacteria; Actinomycetota; Coriobacteriia; Coriobacteriales;
OC   Coriobacteriaceae; Collinsella.
OX   NCBI_TaxID=742742 {ECO:0000313|EMBL:EGX67343.1, ECO:0000313|Proteomes:UP000004830};
RN   [1] {ECO:0000313|EMBL:EGX67343.1, ECO:0000313|Proteomes:UP000004830}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=YIT 12063 {ECO:0000313|EMBL:EGX67343.1,
RC   ECO:0000313|Proteomes:UP000004830};
RG   The Broad Institute Genome Sequencing Platform;
RA   Earl A., Ward D., Feldgarden M., Gevers D., Morotomi M., Young S.K.,
RA   Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA   Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA   Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA   Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P., Mehta T.,
RA   Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M., Roberts A.,
RA   Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S., Wortman J.,
RA   Nusbaum C., Birren B.;
RT   "The Genome Sequence of Collinsella tanakaei YIT 12063.";
RL   Submitted (JUN-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- SIMILARITY: Belongs to the glycosyl hydrolase 25 family.
CC       {ECO:0000256|ARBA:ARBA00010646}.
CC   -!- SIMILARITY: Belongs to the peptidase C40 family.
CC       {ECO:0000256|ARBA:ARBA00007074}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EGX67343.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ADLS01000006; EGX67343.1; -; Genomic_DNA.
DR   RefSeq; WP_009140391.1; NZ_JH126467.1.
DR   STRING; 742742.HMPREF9452_00355; -.
DR   GeneID; 62758147; -.
DR   PATRIC; fig|742742.3.peg.345; -.
DR   eggNOG; COG0791; Bacteria.
DR   eggNOG; COG3533; Bacteria.
DR   eggNOG; COG3693; Bacteria.
DR   eggNOG; COG3757; Bacteria.
DR   HOGENOM; CLU_254398_0_0_11; -.
DR   Proteomes; UP000004830; Unassembled WGS sequence.
DR   GO; GO:0003796; F:lysozyme activity; IEA:InterPro.
DR   GO; GO:0016998; P:cell wall macromolecule catabolic process; IEA:InterPro.
DR   GO; GO:0009253; P:peptidoglycan catabolic process; IEA:InterPro.
DR   CDD; cd06414; GH25_LytC-like; 1.
DR   CDD; cd00161; RICIN; 5.
DR   Gene3D; 2.80.10.50; -; 11.
DR   Gene3D; 3.90.1720.10; endopeptidase domain like (from Nostoc punctiforme); 1.
DR   Gene3D; 3.20.20.80; Glycosidases; 1.
DR   InterPro; IPR002053; Glyco_hydro_25.
DR   InterPro; IPR017853; Glycoside_hydrolase_SF.
DR   InterPro; IPR000064; NLP_P60_dom.
DR   InterPro; IPR038765; Papain-like_cys_pep_sf.
DR   InterPro; IPR035992; Ricin_B-like_lectins.
DR   InterPro; IPR000772; Ricin_B_lectin.
DR   PANTHER; PTHR34135; LYSOZYME; 1.
DR   PANTHER; PTHR34135:SF2; LYSOZYME; 1.
DR   Pfam; PF01183; Glyco_hydro_25; 1.
DR   Pfam; PF00877; NLPC_P60; 1.
DR   Pfam; PF14200; RicinB_lectin_2; 7.
DR   SMART; SM00458; RICIN; 6.
DR   SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR   SUPFAM; SSF54001; Cysteine proteinases; 1.
DR   SUPFAM; SSF50370; Ricin B-like lectins; 6.
DR   PROSITE; PS51904; GLYCOSYL_HYDROL_F25_2; 1.
DR   PROSITE; PS51935; NLPC_P60; 1.
DR   PROSITE; PS50231; RICIN_B_LECTIN; 5.
PE   3: Inferred from homology;
KW   Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Reference proteome {ECO:0000313|Proteomes:UP000004830}.
FT   DOMAIN          1269..1399
FT                   /note="NlpC/P60"
FT                   /evidence="ECO:0000259|PROSITE:PS51935"
SQ   SEQUENCE   1399 AA;  148813 MW;  F23DA4ED6DE7CCDB CRC64;
     MSRRARGLLL STFAVLIALV CGPVFIAAGE DVVDGAVIAS TDEPSALAVE GEGSAEATED
     SAEASNAVSS SEDVAYSWRY EAGELEIQAG DSDVMLLSRP TLPEGAAEWG VDVSFAQGDI
     DWAKAKADGV DFAILRLGYG AGGSDRRFVA NVQGCKANGI KFGVYLYSYA WNASTATSEA
     EWTLTVLRNA GVSPSDLGLP VYYDLENQNP ATGRPAGVDD KNQYHEIEGG SATFAAMGKA
     FCSKIAAAGY TPGVYANLRW WNNYLTDSVF DNWDRWVAQY NSTCDYEGDY TLWQYSSSGS
     VDGISGRVDV NYLYDPEGTP QYMDKLAAAN KGLFQDGSYV VALAAGNRQV LDVSGGSLYA
     GANVQSYDAN ASVAQTWVVQ TVDGGYLKIS NLASGKALSV TPSSSQVGAN VQQEDWSDTR
     SQKWVATKQG DGIVLRSALG KHLVLDIPQG AHNGANAVVG EDASSSSQTF VLYSADGVSS
     QSRTLPDGTY SFTLNGLSLD IAGASTANGA ALQLYTSNGT AAQLFSVSFH EISGGKGYYT
     IRPAHSQKLL DADNGACYPG ASVAQWGDTA GAKQRYWVIE KNSDGTVSII NAANGYALAA
     SSRAAGAQVV TLPKTDSGAL AFTYKRSILP ISHDDIDALA KEHIDELPEG TYAFGSGTSS
     RLVFDVSGGS TGSCANIQIY SSNRSAAQKW SVERVDKSNG YVRIVNVGSG KVLDIQSGSN
     TPGANVQQYS WNGSRAQLWL PVKQADGSYV FYSAVANALV LDVSVAGAYN GANIDVYTSN
     GTIAQSFTAY NLNPNVPSQG RVVDDGVYTI VSSSNSNSAV ALGEPLDANG TLLGIADANG
     SSTSQQFMLT YGDDGYYRIR SVSSQKGLDL KDGDFLAGAK IQQWDYSSAN KNQKWVVSKN
     SDGTYSVVSA STGLAWDLSG SSLVCNPVSS SSSQRWSLNK YVPVIEAGAY IFKSGVGDNV
     LDVASGSIQG GANVRMWTNN GSLAQRWYVR KVSDGVYRFQ NVLSGKYLSA DSSDNVVQSS
     LTSRCDWTTE ASLFGVVLKN VATGRVLDVS GGSSKAGANV QVYSSNGSKA QAWMLQAKEL
     VSEGFYQIAS SINSSFVLDV PQGSSANGAN VQIYTNNSTS SQKWMVKSAG NGWYSIIAAC
     SAKALDVAGG SASPETNVDQ YDQNGTAAQK WTFRMGENGV EIVSMLGTVL DVKGGEAYNG
     SNVQTYTSNH SRSQQWHLTS IEAPGKIGYQ NPSQFFQVSS KNVRLVDAAY STPYCYVTPS
     RIDVDATREE CVEAFIARAF EYINAPYVWN YSLSPQKGVD CAGLVMQAAY ACGMDLGEYN
     PYAHWYDPWH SHDANNMSVD PRFMHVKLAD RKRGDLVFYP GHVAIYLGND QIIEATPPRV
     RTASVFWLGA PTAVARPFV
//
DBGET integrated database retrieval system