GenomeNet

Database: UniProt
Entry: C2L1D6_9FIRM
LinkDB: C2L1D6_9FIRM
Original site: C2L1D6_9FIRM 
ID   C2L1D6_9FIRM            Unreviewed;       670 AA.
AC   C2L1D6;
DT   16-JUN-2009, integrated into UniProtKB/TrEMBL.
DT   16-JUN-2009, sequence version 1.
DT   24-JAN-2024, entry version 40.
DE   RecName: Full=Endo-alpha-N-acetylgalactosaminidase domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=HMPREF6123_2555 {ECO:0000313|EMBL:EEJ50163.1};
OS   Oribacterium sinus F0268.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC   Oribacterium.
OX   NCBI_TaxID=585501 {ECO:0000313|EMBL:EEJ50163.1, ECO:0000313|Proteomes:UP000004121};
RN   [1] {ECO:0000313|EMBL:EEJ50163.1, ECO:0000313|Proteomes:UP000004121}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=F0268 {ECO:0000313|EMBL:EEJ50163.1,
RC   ECO:0000313|Proteomes:UP000004121};
RA   Qin X., Bachman B., Battles P., Bell A., Bess C., Bickham C., Chaboub L.,
RA   Chen D., Coyle M., Deiros D.R., Dinh H., Forbes L., Fowler G.,
RA   Francisco L., Fu Q., Gubbala S., Hale W., Han Y., Hemphill L.,
RA   Highlander S.K., Hirani K., Hogues M., Jackson L., Jakkamsetti A.,
RA   Javaid M., Jiang H., Korchina V., Kovar C., Lara F., Lee S., Mata R.,
RA   Mathew T., Moen C., Morales K., Munidasa M., Nazareth L., Ngo R.,
RA   Nguyen L., Okwuonu G., Ongeri F., Patil S., Petrosino J., Pham C., Pham P.,
RA   Pu L.-L., Puazo M., Raj R., Reid J., Rouhana J., Saada N., Shang Y.,
RA   Simmons D., Thornton R., Warren J., Weissenberger G., Zhang J., Zhang L.,
RA   Zhou C., Zhu D., Muzny D., Worley K., Gibbs R.;
RL   Submitted (APR-2009) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EEJ50163.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ACKX01000241; EEJ50163.1; -; Genomic_DNA.
DR   RefSeq; WP_007157969.1; NZ_GG668536.1.
DR   AlphaFoldDB; C2L1D6; -.
DR   STRING; 585501.HMPREF6123_2555; -.
DR   eggNOG; ENOG502Z91S; Bacteria.
DR   HOGENOM; CLU_441306_0_0_9; -.
DR   InParanoid; C2L1D6; -.
DR   OrthoDB; 2496946at2; -.
DR   Proteomes; UP000004121; Unassembled WGS sequence.
DR   GO; GO:0033926; F:glycopeptide alpha-N-acetylgalactosaminidase activity; IEA:InterPro.
DR   CDD; cd14244; GH_101_like; 1.
DR   Gene3D; 3.20.20.80; Glycosidases; 1.
DR   InterPro; IPR043751; DUF5696.
DR   InterPro; IPR025706; Endoa_GalNAc.
DR   Pfam; PF18952; DUF5696; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000004121}.
SQ   SEQUENCE   670 AA;  77398 MW;  9F81A5EB5623FFAC CRC64;
     MNPSKILLSF EDILLEFDPD NLRYSISTGK TKWETRADFS PYIVLQEKKD NLSNGERQDS
     KIDKAEKEKE ENETIKALLY FKDAKIKSHE LIRTGVGEGI RSIFKDFILP SSYSNKKEEL
     LSFSFETYVW IEYSTKQVYF EWIPLTEAPI DCIDKILFPG PFAFQKKSSS WYSIIPKEQG
     ILIPNTWDVS FHQDGFKGRF GTASAYLPIF GQVKDGEGYL AESLTPWNMG YEAVHEAGSM
     ETSIEFRIEP SLGQMEYRRI IRYLFLSDCD YNSLLKTYRK LAREEGKLKT LKEKEVANPS
     VRKLLGASFV HKGIKTCVQP DSEFFDKDAP DKNNHLTTFR KRAEEMRALK ADGIEKLYFH
     LDGWGDAGYD NKHPDVGPAC KEAGGWEGMR ELSDTMKELG FLFGIHDQYR DFYKKAESYH
     DDLACKSPDG SIFTHARWAG GPQAYLCTSQ APYYVRRNFE RLLDEGINLD GAYLDVFTCN
     EGDECDNPRH RMRRKDSYEY RCQCFRYLLS KDILPSSEEV NDWAVPWLVF CHYAPYDFML
     REPGSPKYGI PIPMFNLVYH DCLVIPWMME KLPEEDYMLY ALLNGGAPYL IRDPAYLGID
     GAFTLEEEMP WEKHLERVGI VSDFHKNVGD AELVKHELLD EKGYRQKSTF ANGYAVEVDL
     QTGSYSISKN
//
DBGET integrated database retrieval system