GenomeNet

Database: UniProt
Entry: A0A0Q4QAU7_9BACL
LinkDB: A0A0Q4QAU7_9BACL
Original site: A0A0Q4QAU7_9BACL 
ID   A0A0Q4QAU7_9BACL        Unreviewed;      1056 AA.
AC   A0A0Q4QAU7;
DT   20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT   20-JAN-2016, sequence version 1.
DT   27-MAR-2024, entry version 32.
DE   RecName: Full=Cellulosome anchor protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=ASF12_24165 {ECO:0000313|EMBL:KQN97153.1};
OS   Paenibacillus sp. Leaf72.
OC   Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX   NCBI_TaxID=1736234 {ECO:0000313|EMBL:KQN97153.1, ECO:0000313|Proteomes:UP000051722};
RN   [1] {ECO:0000313|EMBL:KQN97153.1, ECO:0000313|Proteomes:UP000051722}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Leaf72 {ECO:0000313|EMBL:KQN97153.1,
RC   ECO:0000313|Proteomes:UP000051722};
RA   Gilbert D.G.;
RL   Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|EMBL:KQN97153.1, ECO:0000313|Proteomes:UP000051722}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Leaf72 {ECO:0000313|EMBL:KQN97153.1,
RC   ECO:0000313|Proteomes:UP000051722};
RA   Schulze-Lefert P.;
RT   "Functional overlap of the Arabidopsis leaf and root microbiotas.";
RL   Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Secreted, cell wall, S-layer
CC       {ECO:0000256|ARBA:ARBA00004237}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:KQN97153.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LMLV01000034; KQN97153.1; -; Genomic_DNA.
DR   RefSeq; WP_056042872.1; NZ_LMLV01000034.1.
DR   AlphaFoldDB; A0A0Q4QAU7; -.
DR   STRING; 1736234.ASF12_24165; -.
DR   Proteomes; UP000051722; Unassembled WGS sequence.
DR   GO; GO:0030115; C:S-layer; IEA:UniProtKB-SubCell.
DR   GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR   GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR   CDD; cd00063; FN3; 3.
DR   CDD; cd08548; Type_I_cohesin_like; 1.
DR   Gene3D; 2.60.40.680; -; 2.
DR   Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR   InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR   InterPro; IPR002102; Cohesin_dom.
DR   InterPro; IPR003961; FN3_dom.
DR   InterPro; IPR036116; FN3_sf.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR001119; SLH_dom.
DR   PANTHER; PTHR43308:SF4; OUTER MEMBRANE PROTEIN ALPHA; 1.
DR   PANTHER; PTHR43308; OUTER MEMBRANE PROTEIN ALPHA-RELATED; 1.
DR   Pfam; PF00963; Cohesin; 2.
DR   Pfam; PF00041; fn3; 1.
DR   Pfam; PF00395; SLH; 3.
DR   SMART; SM00060; FN3; 3.
DR   SUPFAM; SSF49384; Carbohydrate-binding domain; 2.
DR   SUPFAM; SSF49265; Fibronectin type III; 2.
DR   PROSITE; PS50853; FN3; 2.
DR   PROSITE; PS51272; SLH; 2.
PE   4: Predicted;
KW   Cell wall {ECO:0000256|ARBA:ARBA00022601};
KW   Reference proteome {ECO:0000313|Proteomes:UP000051722};
KW   S-layer {ECO:0000256|ARBA:ARBA00022601};
KW   Secreted {ECO:0000256|ARBA:ARBA00022601}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..37
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           38..1056
FT                   /note="Cellulosome anchor protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5006230655"
FT   DOMAIN          325..424
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          517..610
FT                   /note="Fibronectin type-III"
FT                   /evidence="ECO:0000259|PROSITE:PS50853"
FT   DOMAIN          896..955
FT                   /note="SLH"
FT                   /evidence="ECO:0000259|PROSITE:PS51272"
FT   DOMAIN          956..1019
FT                   /note="SLH"
FT                   /evidence="ECO:0000259|PROSITE:PS51272"
SQ   SEQUENCE   1056 AA;  110772 MW;  EF59812F23C817E6 CRC64;
     MNYFYANRSG LRGRFSAFLA LLLVLALLLP TLPKASAAPS NTPSVRVGTV AGAPGAYIDV
     PVFFDSNSSD LYVYRSDTTI NYDPNVLEPV AGEEAAISGL VKAIDSNALL LVDKATTGSL
     KLKLTLTGFI EDSSELFTLR FKIKEGVPAG DSALTVVQGQ LYEDVNPVSA DLVSGRVTVS
     PSGNAAIEIG SASIKAGEDD IVSVPVKAKL DKEVASYGIR IKFDPDAISY DGIVEQGFMS
     NYNNTDGWLI VSWLDLTGGD SPIPASQDAQ TLFTIKFAVE DDAPIGDHAL TIETPNDVRE
     LTFTDVHAVE MNKTVSNGKV SVVVPPAAPA LVTATPGDGK IRLTWPAVPM AETYKVYLYE
     GEAGPHDVTD WVEIASGVST NAYTAEGLAN GTSYVFAVKS VNPSGASGLS PASEAVAPVS
     VPGAPTNFNA TPGSGTAVIS FVKPINAVGE PILKYRIEAL LDNGIVSAKE VPESGEGLYS
     ASFDGLFNGK TYTIRIYAVN VAGDSAPLIG TVRPVAKPLT PAILSVVPGN GQVQVTLGAP
     ANVNEAPVDR YIVTSNPGDI SVTVGGEQLS VVISGLTNGT SYTFNAVAQN AAGMSDPSAA
     SQAAIPTAPV SGGGGGASTS TEQIKVEVSD GSGKGGVVAS TTIARTTLAD GSKKDEVTLT
     WAQAEQALKQ LKEAGSSTAR IVIPDKKDEV SDLIIKLPAE AAKQFAAQGV DLQIWTNNGG
     ILIPSRSLQG LQEEVYFHIV PIKAAVERKK VEERLNEEPT LLQLATADGL QLIGRPMTIE
     TNMSSREVTL ELPVGGDLPK AELEKLGVFI EHSDGTKEWV KGEQVGYNGT DKPGIRFTIT
     KFSTFSIVKL SVDSSSHTAY IQGYSDGTFR PNQAVRRVEM AALLSRAMPG TTAGAADAAY
     SDLPSAVWAK QAIAQATAQG YMQGSDGRFM PDKLITRAEM AVIVDRLLKN GMAEGDAKAS
     FKDIDSHWAA EAIGRVSAAG IMSGSGNGLF RPNDSLTRAE AVVILNRVLE RGPLTGEKAA
     WKDVPASHWA FGHIQEASTD HRYTRSSDGG EQSLAN
//
DBGET integrated database retrieval system