ID A0A1Y4D3W3_9BACT Unreviewed; 1333 AA.
AC A0A1Y4D3W3;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=Glycosyl hydrolase family 98 putative carbohydrate-binding module domain-containing protein {ECO:0000259|SMART:SM00776};
GN ORFNames=B5F77_05305 {ECO:0000313|EMBL:OUO53746.1};
OS Parabacteroides sp. An277.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Tannerellaceae;
OC Parabacteroides.
OX NCBI_TaxID=1965619 {ECO:0000313|EMBL:OUO53746.1, ECO:0000313|Proteomes:UP000196154};
RN [1] {ECO:0000313|Proteomes:UP000196154}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=An277 {ECO:0000313|Proteomes:UP000196154};
RA Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA Rychlik I.;
RT "Function of individual gut microbiota members based on whole genome
RT sequencing of pure cultures obtained from chicken caecum.";
RL Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OUO53746.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NFJB01000009; OUO53746.1; -; Genomic_DNA.
DR OrthoDB; 9768004at2; -.
DR Proteomes; UP000196154; Unassembled WGS sequence.
DR Gene3D; 2.60.120.1060; NPCBM/NEW2 domain; 2.
DR Gene3D; 3.90.1580.10; paralog of FGE (formylglycine-generating enzyme); 1.
DR Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR013222; Glyco_hyd_98_carb-bd.
DR InterPro; IPR040698; HZS_alpha_mid.
DR InterPro; IPR036280; Multihaem_cyt_sf.
DR InterPro; IPR038637; NPCBM_sf.
DR InterPro; IPR005532; SUMF_dom.
DR InterPro; IPR042095; SUMF_sf.
DR PANTHER; PTHR23150:SF37; FGE-SULFATASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR23150; SULFATASE MODIFYING FACTOR 1, 2; 1.
DR Pfam; PF03781; FGE-sulfatase; 1.
DR Pfam; PF18582; HZS_alpha; 1.
DR Pfam; PF08305; NPCBM; 2.
DR SMART; SM00776; NPCBM; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF82171; DPP6 N-terminal domain-like; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF48695; Multiheme cytochromes; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000196154};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1333
FT /note="Glycosyl hydrolase family 98 putative carbohydrate-
FT binding module domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012327994"
FT DOMAIN 102..226
FT /note="Glycosyl hydrolase family 98 putative carbohydrate-
FT binding module"
FT /evidence="ECO:0000259|SMART:SM00776"
SQ SEQUENCE 1333 AA; 150311 MW; EB21EBF178F586ED CRC64;
MNRFQILASV LLSTSTVLWA APKQDTESWI DASVKAKSEL LAKLKEAGMP VISQWVKHKQ
KAQPFSVDVK GLEKLVLVTA GGPDGTDYDQ AVWANARLIK ADGTEVWLDE IPYEYGVAGW
AKPKMNTNAY DHEIIIDGKE YKHGVFCHAN GTLVYPLNGE YVRFEAEVGI DDSSSGGSVF
FQALNVMPGK VGEALNAKYP NEISMLGSVL DGLDSWLITA DASVEKQAVE KALSHLKDKT
YFAGLAKQIE SETDVNTQIH KYLELLEQIQ NVATIQDELA WLNVDAIQKA YDDMKKRKGY
DTAKYGPMLD ELLALNKKGF DGIYKGDEQA MADARKALAN KRAILMGNTL LDKDKIVATR
FKLGVKARQA MAPDLGTQAN NWSNQESARR GGFDAEIVEL SNLRGDTVQM RQVYKAKYGS
SIADLKLHWD GDRVMFTQMM PDKRWNIHEV KLDGTGYHPM MEMKEPDLEF YDGTYLPDGR
IIAISNIGYQ GVPCVSGDDP VGNMVLYNPK DQSMRRITFD QDANWNPTIM ANGKVMYTRW
EYTDLTHYYS RIVMHMNPDG TEQKALYGSG SMFPNSTFDV QPLPGHPSAF VGIISGHHGV
ARSGRLILFD PSKARKGAAG MTQEIPYHDR PIVEEIKDQL VDGVWPQFVK PMPLNDKYYL
VSAKLSPDDL WGLYLVDVFD NVTCIYKAEG EGYISPILVR KTTTPPAIPD RVKLNDKEAT
VFIQDIYTGE GLQGVPRGTV KELRIHAYEY AYVKTQSDHN WHGIQSGWDI KRLLGTVPVE
EDGSVIFKIP ANTPISIQPI DKDGAAIQWM RSWLTGQPGE VVSCVGCHED QNEIAIPKRV
IASQKAATPL KAPEGGTRSF TFDLEIQPIL DRACIACHNG EGKAFDLRGG KKDKLGYGTS
YLNLHPYVHR QGGEGDMLVL YPYEYYQNTS ELVRLLKKGH HNVKLTEEEW KTLYNWIDYN
APDKGYFNAN VLGKEIIPYQ GFDQIQRRIE LNNKYAGGSG VDWKKEISDY AAYLEKKGPI
TPVMPEPEKP VKVKNLKVKG WPFDAEAIKE KLADEKETRM EIELAPGVKM TFVRIPAGQF
VMGSYHGESD ARPTAKVKID KSFWMGELEV TNQQYNVFFP KHDSRHMDQQ WKDHVHQGYV
ANDPDQPVIR VSYNDAMEFC KQLSEKTGLH VTLPTEAQWE WACRAGSDED FWFGNLNADF
GKMENLADET TNLLAVSGID PQPMPKTSFW YKYYTFLPKV ESVNDGSLIQ IAGKKYEANP
FGLYNMHGNV AEWTRSDYLP YPYKENSKET SEYKVVRGGS WVERPKFSTA YSRKAYYPWQ
PVFNVGFRVI IED
//