ID A0A0Q4QAU7_9BACL Unreviewed; 1056 AA.
AC A0A0Q4QAU7;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE RecName: Full=Cellulosome anchor protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=ASF12_24165 {ECO:0000313|EMBL:KQN97153.1};
OS Paenibacillus sp. Leaf72.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=1736234 {ECO:0000313|EMBL:KQN97153.1, ECO:0000313|Proteomes:UP000051722};
RN [1] {ECO:0000313|EMBL:KQN97153.1, ECO:0000313|Proteomes:UP000051722}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Leaf72 {ECO:0000313|EMBL:KQN97153.1,
RC ECO:0000313|Proteomes:UP000051722};
RA Gilbert D.G.;
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KQN97153.1, ECO:0000313|Proteomes:UP000051722}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Leaf72 {ECO:0000313|EMBL:KQN97153.1,
RC ECO:0000313|Proteomes:UP000051722};
RA Schulze-Lefert P.;
RT "Functional overlap of the Arabidopsis leaf and root microbiotas.";
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted, cell wall, S-layer
CC {ECO:0000256|ARBA:ARBA00004237}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KQN97153.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LMLV01000034; KQN97153.1; -; Genomic_DNA.
DR RefSeq; WP_056042872.1; NZ_LMLV01000034.1.
DR AlphaFoldDB; A0A0Q4QAU7; -.
DR STRING; 1736234.ASF12_24165; -.
DR Proteomes; UP000051722; Unassembled WGS sequence.
DR GO; GO:0030115; C:S-layer; IEA:UniProtKB-SubCell.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR CDD; cd00063; FN3; 3.
DR CDD; cd08548; Type_I_cohesin_like; 1.
DR Gene3D; 2.60.40.680; -; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR002102; Cohesin_dom.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001119; SLH_dom.
DR PANTHER; PTHR43308:SF4; OUTER MEMBRANE PROTEIN ALPHA; 1.
DR PANTHER; PTHR43308; OUTER MEMBRANE PROTEIN ALPHA-RELATED; 1.
DR Pfam; PF00963; Cohesin; 2.
DR Pfam; PF00041; fn3; 1.
DR Pfam; PF00395; SLH; 3.
DR SMART; SM00060; FN3; 3.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 2.
DR SUPFAM; SSF49265; Fibronectin type III; 2.
DR PROSITE; PS50853; FN3; 2.
DR PROSITE; PS51272; SLH; 2.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022601};
KW Reference proteome {ECO:0000313|Proteomes:UP000051722};
KW S-layer {ECO:0000256|ARBA:ARBA00022601};
KW Secreted {ECO:0000256|ARBA:ARBA00022601}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..37
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 38..1056
FT /note="Cellulosome anchor protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5006230655"
FT DOMAIN 325..424
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 517..610
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 896..955
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 956..1019
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
SQ SEQUENCE 1056 AA; 110772 MW; EF59812F23C817E6 CRC64;
MNYFYANRSG LRGRFSAFLA LLLVLALLLP TLPKASAAPS NTPSVRVGTV AGAPGAYIDV
PVFFDSNSSD LYVYRSDTTI NYDPNVLEPV AGEEAAISGL VKAIDSNALL LVDKATTGSL
KLKLTLTGFI EDSSELFTLR FKIKEGVPAG DSALTVVQGQ LYEDVNPVSA DLVSGRVTVS
PSGNAAIEIG SASIKAGEDD IVSVPVKAKL DKEVASYGIR IKFDPDAISY DGIVEQGFMS
NYNNTDGWLI VSWLDLTGGD SPIPASQDAQ TLFTIKFAVE DDAPIGDHAL TIETPNDVRE
LTFTDVHAVE MNKTVSNGKV SVVVPPAAPA LVTATPGDGK IRLTWPAVPM AETYKVYLYE
GEAGPHDVTD WVEIASGVST NAYTAEGLAN GTSYVFAVKS VNPSGASGLS PASEAVAPVS
VPGAPTNFNA TPGSGTAVIS FVKPINAVGE PILKYRIEAL LDNGIVSAKE VPESGEGLYS
ASFDGLFNGK TYTIRIYAVN VAGDSAPLIG TVRPVAKPLT PAILSVVPGN GQVQVTLGAP
ANVNEAPVDR YIVTSNPGDI SVTVGGEQLS VVISGLTNGT SYTFNAVAQN AAGMSDPSAA
SQAAIPTAPV SGGGGGASTS TEQIKVEVSD GSGKGGVVAS TTIARTTLAD GSKKDEVTLT
WAQAEQALKQ LKEAGSSTAR IVIPDKKDEV SDLIIKLPAE AAKQFAAQGV DLQIWTNNGG
ILIPSRSLQG LQEEVYFHIV PIKAAVERKK VEERLNEEPT LLQLATADGL QLIGRPMTIE
TNMSSREVTL ELPVGGDLPK AELEKLGVFI EHSDGTKEWV KGEQVGYNGT DKPGIRFTIT
KFSTFSIVKL SVDSSSHTAY IQGYSDGTFR PNQAVRRVEM AALLSRAMPG TTAGAADAAY
SDLPSAVWAK QAIAQATAQG YMQGSDGRFM PDKLITRAEM AVIVDRLLKN GMAEGDAKAS
FKDIDSHWAA EAIGRVSAAG IMSGSGNGLF RPNDSLTRAE AVVILNRVLE RGPLTGEKAA
WKDVPASHWA FGHIQEASTD HRYTRSSDGG EQSLAN
//