ID A0A239RMC9_9BACT Unreviewed; 1482 AA.
AC A0A239RMC9;
DT 25-OCT-2017, integrated into UniProtKB/TrEMBL.
DT 25-OCT-2017, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE SubName: Full=F5/8 type C domain-containing protein {ECO:0000313|EMBL:SNU11968.1};
GN ORFNames=SAMN06298210_1167 {ECO:0000313|EMBL:SNU11968.1};
OS Prevotellaceae bacterium KH2P17.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae.
OX NCBI_TaxID=1945886 {ECO:0000313|EMBL:SNU11968.1, ECO:0000313|Proteomes:UP000214740};
RN [1] {ECO:0000313|EMBL:SNU11968.1, ECO:0000313|Proteomes:UP000214740}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=KH2P17 {ECO:0000313|EMBL:SNU11968.1,
RC ECO:0000313|Proteomes:UP000214740};
RA Sun Z.S., Albrecht U., Echele G., Lee C.C.;
RL Submitted (JUL-2017) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FZRE01000016; SNU11968.1; -; Genomic_DNA.
DR Proteomes; UP000214740; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:UniProt.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR CDD; cd00146; PKD; 1.
DR Gene3D; 2.60.120.200; -; 2.
DR Gene3D; 3.90.70.10; Cysteine proteinases; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 4.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR006558; LamG-like.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR022409; PKD/Chitinase_dom.
DR InterPro; IPR000601; PKD_dom.
DR InterPro; IPR035986; PKD_dom_sf.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF13385; Laminin_G_3; 2.
DR SMART; SM00560; LamGL; 2.
DR SMART; SM00089; PKD; 3.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 2.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF49299; PKD domain; 2.
DR PROSITE; PS50022; FA58C_3; 1.
DR PROSITE; PS50093; PKD; 2.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000214740};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..19
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 20..1482
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5011228198"
FT DOMAIN 895..981
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT DOMAIN 979..1055
FT /note="PKD"
FT /evidence="ECO:0000259|PROSITE:PS50093"
FT DOMAIN 1286..1434
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
SQ SEQUENCE 1482 AA; 160565 MW; C86C39004F9CC70E CRC64;
MKNFTLIALT LLLSATAQAQ LYKDLPKGDY YPGYNPVVKP DASLSLRRQR MSRAPGAEAA
YPDHWNTGAQ RYFPPIFSQG GYGSCGVSSH VGYMMTSELN AWQNTDASLP ENQLTPMFEY
PFTYNGPGKD DMALYVGFPT ADVYGGRYES SIYMPSEHKG SNWSWMQGYK SFYNAMLHRI
SYASNFPEST NTEEGRAAVK AYLYNHNGDE TFGGRGGTAC VGVGIASSGL ATVPASDINR
ANGFVGQRYL EHWNIGSGNY DHAMCMVGYD DRIQFDLDGN GQVGEITNTL GQNENGAWIM
AQSYGAGWGN AGFVYCPYAL DGGVSQEVTT PAGKKAYKQY GGWQPYVYHY RTDYTPLRTM
KVLMDYSKRS EISVVAGVAQ DTAAAKPEKT FQFAYINYTG DGDGDGKDAA TPLLGTWNDG
ISHTEPMEFG IDLTDLTDGL DSSRPLKYFL IVNSKSTADG NGHIHAASII DYEFNKLGVE
FPLDSKNVEI QTAGKQTIIS VVVKGEPLHA PTNLTLSGNQ LSWTAPTGTP YIPTSYYIYR
DDNKVGESTS TNFTTDGTAG SYCVKAAYMI NGTEHLSAAS NLVGASSALT AEQAYDNNAL
AFESGKFFVP DVVNEAHDSY TVEMWLNPNK LINWADFMFH NLWGAKYLMH TSADGSISAG
WNNNTDRIDT PANTLVVGKW THIALVIEGN NHRIYADGNL VAEGQGSVSG FPAYWAGRLY
FGDKNSLNGT IDELRLWTCA RSAADIKENM HQPIADPTHT PNLQAYFKMD TYDKDGKTYI
KDWVGGHDAE ILGGTRTAAQ TNGATISRDS QPVAIIQAPA KAVMGEPISI QAATSVATTA
YTWTATNAVP ATSTLRVPTF TFNKTGIQTL KLKTTDLSGA TAEAEATIEI EQIMPTADFV
LSSEGTTGND RISFIAQNKA PGCTYLWSMP GADNETASTA NASASYATMG TKKVTLTVTG
PDGATYTSTQ SFEVRKSTPA TAYQITPQVV VKGSPVQLTD QSGYAPTSWK WSFRSSNSYF
AFVGKEGEIT PEVPGIYTLT YTVGNEIGNS SITANRALIV CNAASETGLN FYEGGSQRMD
ATLSAAPFGA WTIDYWLNPL QLSNSSLGIQ TADSNGAAGM KIVSDASGNV KLSFGEASAS
ADGFYIPGEW HHYAITHSGT TVSFYRDGTQ VGTTIDIGVA DFSNAFTQLT LGGTAAPMKG
SIDELRVWNR ALSQNNLREV AVAPISDVNA AMQNRGLLAY WQFNQPDGNA VDACGHAEGK
LTGFANTVNY YTESKGVFAL NFDPAANEPV VGELLAKSNF SFVSVSDYDS SENATGNQAI
DADEKTFWHS KWEGGETVYP HSITLKRTDC DTLRTLQFCY ARADRYRAAN VTVEQSADNV
NWQVLDSDHV LFSFAQQNMV LAQPATAPYV RITFNSGITG GNLLCLNEIN FYGTKHNTSN
GISVLQGTET GPQVIYDLQG RRVNRPLLPG IYIINGKKQI VR
//