GenomeNet

Database: UniProt
Entry: A0A1Y4D3W3_9BACT
LinkDB: A0A1Y4D3W3_9BACT
Original site: A0A1Y4D3W3_9BACT 
ID   A0A1Y4D3W3_9BACT        Unreviewed;      1333 AA.
AC   A0A1Y4D3W3;
DT   30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT   30-AUG-2017, sequence version 1.
DT   27-MAR-2024, entry version 24.
DE   RecName: Full=Glycosyl hydrolase family 98 putative carbohydrate-binding module domain-containing protein {ECO:0000259|SMART:SM00776};
GN   ORFNames=B5F77_05305 {ECO:0000313|EMBL:OUO53746.1};
OS   Parabacteroides sp. An277.
OC   Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Tannerellaceae;
OC   Parabacteroides.
OX   NCBI_TaxID=1965619 {ECO:0000313|EMBL:OUO53746.1, ECO:0000313|Proteomes:UP000196154};
RN   [1] {ECO:0000313|Proteomes:UP000196154}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=An277 {ECO:0000313|Proteomes:UP000196154};
RA   Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA   Rychlik I.;
RT   "Function of individual gut microbiota members based on whole genome
RT   sequencing of pure cultures obtained from chicken caecum.";
RL   Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:OUO53746.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; NFJB01000009; OUO53746.1; -; Genomic_DNA.
DR   OrthoDB; 9768004at2; -.
DR   Proteomes; UP000196154; Unassembled WGS sequence.
DR   Gene3D; 2.60.120.1060; NPCBM/NEW2 domain; 2.
DR   Gene3D; 3.90.1580.10; paralog of FGE (formylglycine-generating enzyme); 1.
DR   Gene3D; 2.120.10.30; TolB, C-terminal domain; 1.
DR   InterPro; IPR011042; 6-blade_b-propeller_TolB-like.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR013222; Glyco_hyd_98_carb-bd.
DR   InterPro; IPR040698; HZS_alpha_mid.
DR   InterPro; IPR036280; Multihaem_cyt_sf.
DR   InterPro; IPR038637; NPCBM_sf.
DR   InterPro; IPR005532; SUMF_dom.
DR   InterPro; IPR042095; SUMF_sf.
DR   PANTHER; PTHR23150:SF37; FGE-SULFATASE DOMAIN-CONTAINING PROTEIN; 1.
DR   PANTHER; PTHR23150; SULFATASE MODIFYING FACTOR 1, 2; 1.
DR   Pfam; PF03781; FGE-sulfatase; 1.
DR   Pfam; PF18582; HZS_alpha; 1.
DR   Pfam; PF08305; NPCBM; 2.
DR   SMART; SM00776; NPCBM; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF82171; DPP6 N-terminal domain-like; 1.
DR   SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR   SUPFAM; SSF48695; Multiheme cytochromes; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000196154};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..1333
FT                   /note="Glycosyl hydrolase family 98 putative carbohydrate-
FT                   binding module domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012327994"
FT   DOMAIN          102..226
FT                   /note="Glycosyl hydrolase family 98 putative carbohydrate-
FT                   binding module"
FT                   /evidence="ECO:0000259|SMART:SM00776"
SQ   SEQUENCE   1333 AA;  150311 MW;  EB21EBF178F586ED CRC64;
     MNRFQILASV LLSTSTVLWA APKQDTESWI DASVKAKSEL LAKLKEAGMP VISQWVKHKQ
     KAQPFSVDVK GLEKLVLVTA GGPDGTDYDQ AVWANARLIK ADGTEVWLDE IPYEYGVAGW
     AKPKMNTNAY DHEIIIDGKE YKHGVFCHAN GTLVYPLNGE YVRFEAEVGI DDSSSGGSVF
     FQALNVMPGK VGEALNAKYP NEISMLGSVL DGLDSWLITA DASVEKQAVE KALSHLKDKT
     YFAGLAKQIE SETDVNTQIH KYLELLEQIQ NVATIQDELA WLNVDAIQKA YDDMKKRKGY
     DTAKYGPMLD ELLALNKKGF DGIYKGDEQA MADARKALAN KRAILMGNTL LDKDKIVATR
     FKLGVKARQA MAPDLGTQAN NWSNQESARR GGFDAEIVEL SNLRGDTVQM RQVYKAKYGS
     SIADLKLHWD GDRVMFTQMM PDKRWNIHEV KLDGTGYHPM MEMKEPDLEF YDGTYLPDGR
     IIAISNIGYQ GVPCVSGDDP VGNMVLYNPK DQSMRRITFD QDANWNPTIM ANGKVMYTRW
     EYTDLTHYYS RIVMHMNPDG TEQKALYGSG SMFPNSTFDV QPLPGHPSAF VGIISGHHGV
     ARSGRLILFD PSKARKGAAG MTQEIPYHDR PIVEEIKDQL VDGVWPQFVK PMPLNDKYYL
     VSAKLSPDDL WGLYLVDVFD NVTCIYKAEG EGYISPILVR KTTTPPAIPD RVKLNDKEAT
     VFIQDIYTGE GLQGVPRGTV KELRIHAYEY AYVKTQSDHN WHGIQSGWDI KRLLGTVPVE
     EDGSVIFKIP ANTPISIQPI DKDGAAIQWM RSWLTGQPGE VVSCVGCHED QNEIAIPKRV
     IASQKAATPL KAPEGGTRSF TFDLEIQPIL DRACIACHNG EGKAFDLRGG KKDKLGYGTS
     YLNLHPYVHR QGGEGDMLVL YPYEYYQNTS ELVRLLKKGH HNVKLTEEEW KTLYNWIDYN
     APDKGYFNAN VLGKEIIPYQ GFDQIQRRIE LNNKYAGGSG VDWKKEISDY AAYLEKKGPI
     TPVMPEPEKP VKVKNLKVKG WPFDAEAIKE KLADEKETRM EIELAPGVKM TFVRIPAGQF
     VMGSYHGESD ARPTAKVKID KSFWMGELEV TNQQYNVFFP KHDSRHMDQQ WKDHVHQGYV
     ANDPDQPVIR VSYNDAMEFC KQLSEKTGLH VTLPTEAQWE WACRAGSDED FWFGNLNADF
     GKMENLADET TNLLAVSGID PQPMPKTSFW YKYYTFLPKV ESVNDGSLIQ IAGKKYEANP
     FGLYNMHGNV AEWTRSDYLP YPYKENSKET SEYKVVRGGS WVERPKFSTA YSRKAYYPWQ
     PVFNVGFRVI IED
//
DBGET integrated database retrieval system