ID C2L1D6_9FIRM Unreviewed; 670 AA.
AC C2L1D6;
DT 16-JUN-2009, integrated into UniProtKB/TrEMBL.
DT 16-JUN-2009, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE RecName: Full=Endo-alpha-N-acetylgalactosaminidase domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=HMPREF6123_2555 {ECO:0000313|EMBL:EEJ50163.1};
OS Oribacterium sinus F0268.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Oribacterium.
OX NCBI_TaxID=585501 {ECO:0000313|EMBL:EEJ50163.1, ECO:0000313|Proteomes:UP000004121};
RN [1] {ECO:0000313|EMBL:EEJ50163.1, ECO:0000313|Proteomes:UP000004121}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F0268 {ECO:0000313|EMBL:EEJ50163.1,
RC ECO:0000313|Proteomes:UP000004121};
RA Qin X., Bachman B., Battles P., Bell A., Bess C., Bickham C., Chaboub L.,
RA Chen D., Coyle M., Deiros D.R., Dinh H., Forbes L., Fowler G.,
RA Francisco L., Fu Q., Gubbala S., Hale W., Han Y., Hemphill L.,
RA Highlander S.K., Hirani K., Hogues M., Jackson L., Jakkamsetti A.,
RA Javaid M., Jiang H., Korchina V., Kovar C., Lara F., Lee S., Mata R.,
RA Mathew T., Moen C., Morales K., Munidasa M., Nazareth L., Ngo R.,
RA Nguyen L., Okwuonu G., Ongeri F., Patil S., Petrosino J., Pham C., Pham P.,
RA Pu L.-L., Puazo M., Raj R., Reid J., Rouhana J., Saada N., Shang Y.,
RA Simmons D., Thornton R., Warren J., Weissenberger G., Zhang J., Zhang L.,
RA Zhou C., Zhu D., Muzny D., Worley K., Gibbs R.;
RL Submitted (APR-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EEJ50163.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACKX01000241; EEJ50163.1; -; Genomic_DNA.
DR RefSeq; WP_007157969.1; NZ_GG668536.1.
DR AlphaFoldDB; C2L1D6; -.
DR STRING; 585501.HMPREF6123_2555; -.
DR eggNOG; ENOG502Z91S; Bacteria.
DR HOGENOM; CLU_441306_0_0_9; -.
DR InParanoid; C2L1D6; -.
DR OrthoDB; 2496946at2; -.
DR Proteomes; UP000004121; Unassembled WGS sequence.
DR GO; GO:0033926; F:glycopeptide alpha-N-acetylgalactosaminidase activity; IEA:InterPro.
DR CDD; cd14244; GH_101_like; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR043751; DUF5696.
DR InterPro; IPR025706; Endoa_GalNAc.
DR Pfam; PF18952; DUF5696; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000004121}.
SQ SEQUENCE 670 AA; 77398 MW; 9F81A5EB5623FFAC CRC64;
MNPSKILLSF EDILLEFDPD NLRYSISTGK TKWETRADFS PYIVLQEKKD NLSNGERQDS
KIDKAEKEKE ENETIKALLY FKDAKIKSHE LIRTGVGEGI RSIFKDFILP SSYSNKKEEL
LSFSFETYVW IEYSTKQVYF EWIPLTEAPI DCIDKILFPG PFAFQKKSSS WYSIIPKEQG
ILIPNTWDVS FHQDGFKGRF GTASAYLPIF GQVKDGEGYL AESLTPWNMG YEAVHEAGSM
ETSIEFRIEP SLGQMEYRRI IRYLFLSDCD YNSLLKTYRK LAREEGKLKT LKEKEVANPS
VRKLLGASFV HKGIKTCVQP DSEFFDKDAP DKNNHLTTFR KRAEEMRALK ADGIEKLYFH
LDGWGDAGYD NKHPDVGPAC KEAGGWEGMR ELSDTMKELG FLFGIHDQYR DFYKKAESYH
DDLACKSPDG SIFTHARWAG GPQAYLCTSQ APYYVRRNFE RLLDEGINLD GAYLDVFTCN
EGDECDNPRH RMRRKDSYEY RCQCFRYLLS KDILPSSEEV NDWAVPWLVF CHYAPYDFML
REPGSPKYGI PIPMFNLVYH DCLVIPWMME KLPEEDYMLY ALLNGGAPYL IRDPAYLGID
GAFTLEEEMP WEKHLERVGI VSDFHKNVGD AELVKHELLD EKGYRQKSTF ANGYAVEVDL
QTGSYSISKN
//