GenomeNet

Database: UniProt
Entry: R5KL36_9CLOT
LinkDB: R5KL36_9CLOT
Original site: R5KL36_9CLOT 
ID   R5KL36_9CLOT            Unreviewed;       780 AA.
AC   R5KL36;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   24-JAN-2024, entry version 21.
DE   SubName: Full=Alpha-N-acetylgalactosaminidase {ECO:0000313|EMBL:CCY67510.1};
GN   ORFNames=BN753_01180 {ECO:0000313|EMBL:CCY67510.1};
OS   Clostridium sp. CAG:678.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC   Clostridium.
OX   NCBI_TaxID=1262831 {ECO:0000313|EMBL:CCY67510.1, ECO:0000313|Proteomes:UP000017959};
RN   [1] {ECO:0000313|EMBL:CCY67510.1, ECO:0000313|Proteomes:UP000017959}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=MGS:678 {ECO:0000313|Proteomes:UP000017959};
RA   Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA   Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA   Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA   Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA   Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA   Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA   Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA   Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA   Wang J., Brunak S., Ehrlich S.D.;
RT   "Dependencies among metagenomic species, viruses, plasmids and units of
RT   genetic variation.";
RL   Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:CCY67510.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CAYP010000051; CCY67510.1; -; Genomic_DNA.
DR   AlphaFoldDB; R5KL36; -.
DR   STRING; 1262831.BN753_01180; -.
DR   Proteomes; UP000017959; Unassembled WGS sequence.
DR   Gene3D; 3.20.20.70; Aldolase class I; 1.
DR   InterPro; IPR013785; Aldolase_TIM.
DR   InterPro; IPR017853; Glycoside_hydrolase_SF.
DR   SUPFAM; SSF51445; (Trans)glycosidases; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000017959}.
SQ   SEQUENCE   780 AA;  88980 MW;  AF89459996980EC7 CRC64;
     MFKLKSKKLE REFNVTDGFL YASQIKNTYS GMDLIPDGSG SEFEIRFKDG DTLSSKSLAV
     GEAVERDGKL FFRFKEEMHT TVTMSYRIGS DGETLEKQIA IEQSEPKTID YVMLENIGIV
     NSKTSYTTNG GATETDEFYS NLGQPFYIDS LFFGCVFPGT KNGVFHGRGE IVYFIGKSAE
     RKIVCPTTVM GAAKSGMMVD LKKAFFEYID RIAVKSPFRV QYNTWYDRMM DIDADNTQAA
     FYKVESKLSS NGVPPLDGYV IDDGWNNYKA GFWSFNQKRF PNGVLDLSYL TKCLGSSFGL
     WLGPRGGYNF NSKFAKRIER AGNGYYNAES DDICVCSKTY LNKLKDFLVQ TTRENDIAYW
     KLDGFALKPC KNPKHDHITG GDHDMYYITE LWRRWIKIFK ALRETRSAQG KDLWINFTCY
     VNPSPWWLQY VNSIWLQNSK DIGFAENYPE GEQSQADAEM TYRDSVYYDF IVNRGLQFPM
     GNIYNHEPIY GREAHLDYTD AEFEKAFFWN ACRGAALNEL YISHSMMNDE KWRILARVLN
     WQRANHPILK HAMMLGGDPA DNNVYCYAAW TQDGEGIIGL RNPTHEAAPL TLTLNKLMGC
     PESLKDVRRF NVLNKNGAES SDLYNYNDKI NLTLAPFEVK IFQFGKTDKR YTGADEGNDF
     TIVFDYDGND GVICQNDDIL IRVDKGMITV HMGTLTLRSE NSIRSSSHTV TVVREKNRMV
     KLYVDSSLDC SAYEERGKAV LSCELTSGAE GFKVIHSATP YNDIVEIGGI LKKSRKRRRN
//
DBGET integrated database retrieval system