ID G6AUR2_9BACT Unreviewed; 1199 AA.
AC G6AUR2;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 24-JAN-2024, entry version 48.
DE SubName: Full=F5/8 type C domain protein {ECO:0000313|EMBL:EHJ41833.1};
GN ORFNames=HMPREF0673_00347 {ECO:0000313|EMBL:EHJ41833.1};
OS Leyella stercorea DSM 18206.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Leyella.
OX NCBI_TaxID=1002367 {ECO:0000313|EMBL:EHJ41833.1, ECO:0000313|Proteomes:UP000004407};
RN [1] {ECO:0000313|EMBL:EHJ41833.1, ECO:0000313|Proteomes:UP000004407}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 18206 {ECO:0000313|EMBL:EHJ41833.1,
RC ECO:0000313|Proteomes:UP000004407};
RA Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA Hall O., Minx P., Tomlinson C., Mitreva M., Hou S., Chen J., Wollam A.,
RA Pepin K.H., Johnson M., Bhonagiri V., Zhang X., Suruliraj S., Warren W.,
RA Chinwalla A., Mardis E.R., Wilson R.K.;
RL Submitted (AUG-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytosol
CC {ECO:0000256|ARBA:ARBA00004514}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHJ41833.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AFZZ01000044; EHJ41833.1; -; Genomic_DNA.
DR RefSeq; WP_007897195.1; NZ_JH379366.1.
DR AlphaFoldDB; G6AUR2; -.
DR GeneID; 78336218; -.
DR PATRIC; fig|1002367.3.peg.279; -.
DR eggNOG; COG3291; Bacteria.
DR eggNOG; COG4724; Bacteria.
DR HOGENOM; CLU_251334_0_0_10; -.
DR Proteomes; UP000004407; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:UniProtKB-SubCell.
DR GO; GO:0033925; F:mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0008152; P:metabolic process; IEA:UniProtKB-KW.
DR CDD; cd00146; PKD; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR032979; ENGase.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR005201; Glyco_hydro_85.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR000601; PKD_dom.
DR InterPro; IPR035986; PKD_dom_sf.
DR PANTHER; PTHR13246:SF1; CYTOSOLIC ENDO-BETA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR13246; ENDO BETA N-ACETYLGLUCOSAMINIDASE; 1.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF03644; Glyco_hydro_85; 1.
DR Pfam; PF00801; PKD; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF49299; PKD domain; 1.
DR PROSITE; PS50022; FA58C_3; 1.
DR PROSITE; PS50093; PKD; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..1199
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003485017"
FT DOMAIN 658..738
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT DOMAIN 743..888
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
SQ SEQUENCE 1199 AA; 133753 MW; 274D6547EEA9A673 CRC64;
MKLNRLILAA VAALFVQSSY AQQPYGGCWH PDDIKNWSPE TDKDAKFNRS RVPLAKRFKE
PTLMKANSQQ YYEGQICNAP ILFPTCSMCP SQGAYNFLGY QPTYWQYMDK LVYWAGSASE
GIIIPPPAGS IDAAHQSGVK VLGQVFFPPY AFGGNQAWVR QMLTKENGVY IYAKKLYEIA
KYIGFDGWFI NEETGGGSES EWVGFIKEFN KIADANGDTH MEIQWYNAEY SPNVTILKSH
KNTSQFLEYG SPGDYRRYAS QLGCTEAETF SKIYGGVQVA SSGHTGFERD LQKAMPTSGH
VGSLDLFCPE EKTWKDNVRN LLGKNDTGPD AYYAIKETFK NEMQMWTNYA GDPTVTTDEW
SGISGHVLER SVISSMPFTT SFCVGVGKHR FVEGEKVATQ DWYHSGVQSI MPTWRYWMEN
KDDASFEINW EDAWNFGSSL KLKGGSAFSG DHLWRLYKTQ LAVNGGGTLR LVYKSSNTTN
ASVEVKLSTT SSVNPDVTLS APKTTTKNGW TVAEYDLSSL NGKTIYMIGL NVKAKFTVYD
YELSLGELSV LPANYAPAPV EVKNLATTST LGNVQGDARL TWDFDYTADF DHFDIYKENE
DGTRTMVGQT RDEAFYVPTF QRKDNEAYVK FIVTPVMKDM RQQKGKSIVL DYPQATAPVV
SFTISNSYLK VGESATITAN GTGNPTAFKW TLPEGLKLAN GFSLTDQTIT VVAEKVGKQR
VTVDVTNAIG TSTTSSNILD VMTEEEYKKI YNVVYQKKVL GYSGSTNYKE VPDKIIDGET
NPTSASDKWC NVSADNWVTF DLQSVYRIYS FKIFDGNAGP ESGVDQIDSY QILLSEDNEH
WTTVVDTYNR QKESIKTDYI APMRARYVKL VPHVNGILRI WEFEVYGKKD NSMKLSTLTN
NVTMNAGESY NINVTYDLGE GATKADKFEC VATSKNGNVT IGTIKENVKK NTFVIPVTAK
QRMGDDVVTI RVVNGDDYEE IAVQVEVDAT TQPNVLKGKT AEVRHYENDY SFEAAFKKYD
VSGLTDGNKA DEALTDIEAP STHRDDVWVI FTAPEGKQWD LAKVIVNIPN ENYAENDNNE
MNYVNKDVKI AVGDDLTRMN TAKTFSGLKK VSELSYIFPE SVKTKYLAVI CNLYVYSLPS
MAEVSAYEQF ADPTAINNAT VQNAVKGIYT VNGTKLNALQ KGLNIVKFAD GTVQKVLVK
//