ID A0A1I0I1N4_9FIRM Unreviewed; 1570 AA.
AC A0A1I0I1N4;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE SubName: Full=Uncharacterized Sugar-binding Domain {ECO:0000313|EMBL:SET89608.1};
DE Flags: Fragment;
GN ORFNames=SAMN04489758_1701 {ECO:0000313|EMBL:SET89608.1};
OS Thomasclavelia cocleata.
OC Bacteria; Bacillota; Erysipelotrichia; Erysipelotrichales;
OC Coprobacillaceae; Thomasclavelia.
OX NCBI_TaxID=69824 {ECO:0000313|EMBL:SET89608.1, ECO:0000313|Proteomes:UP000198558};
RN [1] {ECO:0000313|Proteomes:UP000198558}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 1551 {ECO:0000313|Proteomes:UP000198558};
RA Varghese N., Submissions S.;
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FOIN01000070; SET89608.1; -; Genomic_DNA.
DR Proteomes; UP000198558; Unassembled WGS sequence.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 1.20.1270.70; Designed single chain three-helix bundle; 2.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 4.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 1.20.120.670; N-acetyl-b-d-glucoasminidase; 1.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR007781; NAGLU.
DR InterPro; IPR024732; NAGLU_C.
DR InterPro; IPR024240; NAGLU_N.
DR InterPro; IPR024733; NAGLU_tim-barrel.
DR PANTHER; PTHR12872; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR12872:SF1; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR Pfam; PF00754; F5_F8_type_C; 4.
DR Pfam; PF07554; FIVAR; 2.
DR Pfam; PF05089; NAGLU; 1.
DR Pfam; PF12972; NAGLU_C; 1.
DR Pfam; PF12971; NAGLU_N; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 4.
DR PROSITE; PS50022; FA58C_3; 4.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000198558};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..1570
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011629196"
FT DOMAIN 29..151
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 175..318
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 323..464
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1194..1348
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT NON_TER 1570
FT /evidence="ECO:0000313|EMBL:SET89608.1"
SQ SEQUENCE 1570 AA; 179213 MW; 958847C2802DF067 CRC64;
MNLKKKLKIS LILSLVLVLL FPTMTIFASE TKSRAARENI ALNKPVTSSK IENSNIPEYA
VDGKENTFWA SVNPSELEVD LQGFYKISEI NLMAYFDVPA NAERYYDYEI YASMDQEKYD
LVAKKSDTSW NTPQGDTYVF NDTFTAHYIK VKILKTHAEN QPNNNTGHIK ELRVYGELDP
EYENPVIKSN VALNKTVTAS GQEGSSNPGK AVDNNVNSFW AVAGEGWLEV DLEGYFDVDE
INVLPYYSDG RYYNYEVYVS VNGLDYIKAG EKKDNTPQNK DGDTYTFDTK TIRFVKVKML
SNSANPSMHI NELRVYGIEN TEYKPPIEDI DTTSIAYQKP ARSYSNSASY PVSNINDGML
STSWQALYYP AYIDIDLEGN YNINKIQVVP DFKDLQAYYQ YSIFTSIDGV TFDKVAKKDD
DTVVTKEGNL FEINGKEARI VRVYLEYCSV GNTGSFAEVR VYGEESENPV IERADINISD
FEDTEYVLPI TEDETINEVK GVLSRVIGEK YLDWFEFVLE ENSESDKDYY EISNHNGKIK
IKGNEGLSLT TGLNYYLKYY CNVSITQQAR QVKMPANVVP VTETIRKETP YEVRYAYNYC
THSYTMAFWG ETEWQNEMDW LALNGVNAIL DITGQEEVWR RFMGNLGYSL DEIKDWLVGP
GYYGWQYMAN MENVNGPIPD NWFAQRTELA RMNQRKMRAL GMTPILQGYS GMVPNSITEK
DSKAQVIPQG LWNGMQRPAM LKTNTETYKE YAKMFYQAQE EVYGKVSNYY ATDPFHEGGQ
TGGIGRDIVG RKVLDEMKVY DDDAIWVIQS WSFQPALLSE ITTEEKQNNI LLLDLNASKS
AKYSSTNEFA GSNWVYCMLE NYGGRSGVHG NLEKLTKIPS QVKDKTSHMV GMGIAPEGTN
NNPVRFDLFF EMMWEENDVD LNEWITHYVE RRYGCENENA QKAWKLLLET VYRPATHADP
PESIINVRPQ FNAKQSAPNG NMSKNYDFKR FEKALDYLMK DFEQLKGSEG YLYDVTDFLR
QAVANSAERT YSEFTKAFND GNVELFKEKS QEFLKMVELQ DQVLNSNKNF MVGTWLNASK
NASEGQDDFI KMIFQLNGKA LISTWAPYYC WGVYDYANRE YGGLTKDYYL QRWELWIQRL
TDKIEGKNVS NYNEISIKES HQMAWQWARS EKDYSSEATG DTKELYKEFV NNYCLNNGSV
DELQVDKMSI SISCDKPDYN GYPISNAIDG NAGSFWTTQT NIKGPYTVTF KFDQAETINS
FSILPRNYNS NATGNGDILG IELWVSDDGN EYRKIAQGEY EQNGSERICN FEAVTTKYVK
LVITKTLIWN NVATNASASA AEFSFYKPYA QLRSGIYTIE NNVIGNVYEN TTVKQFLAGF
DISQNGKVIV TRNGNELTEN DVIEQDDIVE YYYKNEKVKT YQIDEFAQVS DFTKLNELIL
ECEKLNQNDY TEESWNQFSE ILTLAVGVQV NPEASQEEID NAVKNLKEAY SNLVSVTVVS
KTALQIAVEM AGNVTEEQLD KVVPAVVTEF NAALEEARTI LANDNATQEE VDASFARLSV
AMHMLEFLKG
//