ID R5KL36_9CLOT Unreviewed; 780 AA.
AC R5KL36;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE SubName: Full=Alpha-N-acetylgalactosaminidase {ECO:0000313|EMBL:CCY67510.1};
GN ORFNames=BN753_01180 {ECO:0000313|EMBL:CCY67510.1};
OS Clostridium sp. CAG:678.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262831 {ECO:0000313|EMBL:CCY67510.1, ECO:0000313|Proteomes:UP000017959};
RN [1] {ECO:0000313|EMBL:CCY67510.1, ECO:0000313|Proteomes:UP000017959}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:678 {ECO:0000313|Proteomes:UP000017959};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCY67510.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAYP010000051; CCY67510.1; -; Genomic_DNA.
DR AlphaFoldDB; R5KL36; -.
DR STRING; 1262831.BN753_01180; -.
DR Proteomes; UP000017959; Unassembled WGS sequence.
DR Gene3D; 3.20.20.70; Aldolase class I; 1.
DR InterPro; IPR013785; Aldolase_TIM.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017959}.
SQ SEQUENCE 780 AA; 88980 MW; AF89459996980EC7 CRC64;
MFKLKSKKLE REFNVTDGFL YASQIKNTYS GMDLIPDGSG SEFEIRFKDG DTLSSKSLAV
GEAVERDGKL FFRFKEEMHT TVTMSYRIGS DGETLEKQIA IEQSEPKTID YVMLENIGIV
NSKTSYTTNG GATETDEFYS NLGQPFYIDS LFFGCVFPGT KNGVFHGRGE IVYFIGKSAE
RKIVCPTTVM GAAKSGMMVD LKKAFFEYID RIAVKSPFRV QYNTWYDRMM DIDADNTQAA
FYKVESKLSS NGVPPLDGYV IDDGWNNYKA GFWSFNQKRF PNGVLDLSYL TKCLGSSFGL
WLGPRGGYNF NSKFAKRIER AGNGYYNAES DDICVCSKTY LNKLKDFLVQ TTRENDIAYW
KLDGFALKPC KNPKHDHITG GDHDMYYITE LWRRWIKIFK ALRETRSAQG KDLWINFTCY
VNPSPWWLQY VNSIWLQNSK DIGFAENYPE GEQSQADAEM TYRDSVYYDF IVNRGLQFPM
GNIYNHEPIY GREAHLDYTD AEFEKAFFWN ACRGAALNEL YISHSMMNDE KWRILARVLN
WQRANHPILK HAMMLGGDPA DNNVYCYAAW TQDGEGIIGL RNPTHEAAPL TLTLNKLMGC
PESLKDVRRF NVLNKNGAES SDLYNYNDKI NLTLAPFEVK IFQFGKTDKR YTGADEGNDF
TIVFDYDGND GVICQNDDIL IRVDKGMITV HMGTLTLRSE NSIRSSSHTV TVVREKNRMV
KLYVDSSLDC SAYEERGKAV LSCELTSGAE GFKVIHSATP YNDIVEIGGI LKKSRKRRRN
//