ID W4B339_9BACL Unreviewed; 1741 AA.
AC W4B339;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=S-layer protein {ECO:0000313|EMBL:ETT37521.1};
GN ORFNames=C161_13458 {ECO:0000313|EMBL:ETT37521.1};
OS Paenibacillus sp. FSL R5-192.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=1226754 {ECO:0000313|EMBL:ETT37521.1, ECO:0000313|Proteomes:UP000019041};
RN [1] {ECO:0000313|EMBL:ETT37521.1, ECO:0000313|Proteomes:UP000019041}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=FSL R5-192 {ECO:0000313|EMBL:ETT37521.1,
RC ECO:0000313|Proteomes:UP000019041};
RX PubMed=24422886; DOI=10.1186/1471-2164-15-26;
RA Moreno Switt A.I., Andrus A.D., Ranieri M.L., Orsi R.H., Ivy R.,
RA den Bakker H.C., Martin N.H., Wiedmann M., Boor K.J.;
RT "Genomic comparison of sporeforming bacilli isolated from milk.";
RL BMC Genomics 15:26-26(2014).
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETT37521.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASPR01000014; ETT37521.1; -; Genomic_DNA.
DR RefSeq; WP_036670678.1; NZ_ASPR01000014.1.
DR PATRIC; fig|1226754.4.peg.2739; -.
DR Proteomes; UP000019041; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd00063; FN3; 4.
DR CDD; cd18825; GH43_CtGH43-like; 1.
DR Gene3D; 2.60.120.430; Galactose-binding lectin; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 4.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001119; SLH_dom.
DR PANTHER; PTHR22925:SF3; ARABINANASE_LEVANSUCRASE_INVERTASE; 1.
DR PANTHER; PTHR22925; GLYCOSYL HYDROLASE 43 FAMILY MEMBER; 1.
DR Pfam; PF00041; fn3; 1.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR Pfam; PF00395; SLH; 2.
DR SMART; SM00060; FN3; 4.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 3.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS50853; FN3; 3.
DR PROSITE; PS51272; SLH; 3.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..1741
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004838400"
FT DOMAIN 313..402
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 403..492
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1119..1212
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 1551..1612
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1613..1676
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1682..1741
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REGION 1297..1368
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1306..1345
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1741 AA; 188019 MW; A30AD92B3B94174B CRC64;
MDKLRKPMIF MLSATLALSS GPLMLPSKAY ADLSSIPATL LQDDFSDGNY TESPAWNVSS
GNWEVIADPT DASNFTLFQS DTNEGIISTG DSMSDMTVSM RFYTGAGQGY PGILPRFQDK
SNYYYFQMQV PNNKLVFSKK VNNGDTTLKT VDYAFAKDTW YTLKIVLSGA TIRGYIAENG
SDRLVFDLND SSIGSGVVGI RNKWQSVHMD DVIIAEQPPV NDALLAIAEQ TASSVSLQWS
EIVGASAYRL YRSSTPEGGY SLVTSTGSLE HTDVGLSGDT VYYYKLAYEY GGLTESLWTA
PLEVRTIAAA PQAPGELKAE APNATSVKLS WSAVDKSTGY RVVRAEAGSE QYEQIYEGKG
LTFTDNALEP GTSYSYRVTA YNAAGESAFT VAEATTYSID SPAEFAATAV TDTSISLGWN
VLPGSDVTYT VSRATSATGT YQQVYSGKEN TYNDSGLTMG TGYFYIIQAT VDGVTSPASA
PRGVATIRTS ITPGQLWPDL DGKPIDAHGA GFFYDEQTET YYWYGEYHKG GWPAVGVRVY
SSKDLMNWKD EGMALTTLQS MDDFDNDPLI SKLYAGREDR VDIWADIRKG RIIERPKVIY
NDKTKKYVMW AHMDGDKDPY NDNANYGKAR AGYAISDSPT GPFVYQKSYR MDRAPEGEKD
FFPTDKGMAR DMTLFKDDDG TGYLIYSSEE NLTLYISKLN EDYSDVTGWH KEGRTDDKGN
PVRDSTYQAE YGVDYVRVFP GGQREAPAMF KYQGKYYILT SGASGWAPNE NKVTVADNIF
GPWSTQTNPF VRTLPSDPDP GKAFGTQTTS VIPVDPEKGK FIYVGDTWNG GNFSNDAAKY
VFLPIEFGIG SDIAIKWYNS WTPDLLNSMG KVDIADPLPE AVALGKVPSL PTTLNVRDGG
ALVSTPAVWT IDNRTMTAED FAKPGPLTLQ VTTPEYNNKK QAVRVNVIPE NTLYFVNSGG
YETADYSLMG AYMKGTLANP GTADQMYAPA EGHNWGYVSA DALASGSNGG DIFSTVRYLN
GGNVSNSPKG TDLTYRFDVP NGTYDVYAGF NDPWTNTSRR ANFIINGANT GAVTFTPASV
RANKGISVSD NKLELTVRNT ASQDPMISWI MIVKPDAAPP ANDSAGLIAD AIDSTSATLR
WDAHLGAASY KLYRSDREQG EYKVVYSGNG REYTDSELNP GTEYYYKVEA FDATGQSLRG
VSSAYQVHTA QQSAADVATG ITALEQPSAG AKKLKLPSVP QGFTVKIASS SVPSVIQTDG
TIVPPSKETT VTLELEITRT SDDSRALTIP LTVKVPAYVR SPGGTDPGSG GNSGGNSGGN
PGNGSGSSSN GGQSGSGNSS AENAVPQPKP EKDRSVLELQ GQSDQKGVVQ SNVDVSTIKD
AFKVAPSTDA GQRLVELRLK PVSGATAYEL SLPASVLIDQ GESHVFNIVT ELGMLELPAT
LLMKDIVGDG IASIRLVRTE LPKTVADQLG TQYGVQLELQ LDGQPWPSES GLNLRLPFQS
SQNAQQDRIV AFAIGANGVA TPLPQSYYDQ KSGQLVLSVT SFTGNYAVVS VEQTFTDLAE
VLWAKKAMEA LAVRGVIDAE AKGDSTQLHP KQEMTRGQYM QWLMTALGLN ASSGNAFSDV
NEKASYYEAV TAARSLGITS GTGDGRFLPE STITRQEMMT LTVRALAVAG LVDSETAATD
NLTRFRDASE IRSYARDSVA LLVDLGIAHG YNGEVNPLAE ATRAESATLL YAMMDKLVWN
K
//