ID A0A0Q9LBZ0_9BACL Unreviewed; 1103 AA.
AC A0A0Q9LBZ0;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE RecName: Full=Bacterial Ig-like domain-containing protein {ECO:0000259|Pfam:PF07532};
GN ORFNames=ASG81_09370 {ECO:0000313|EMBL:KRE47074.1};
OS Paenibacillus sp. Soil522.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=1736388 {ECO:0000313|EMBL:KRE47074.1, ECO:0000313|Proteomes:UP000051180};
RN [1] {ECO:0000313|Proteomes:UP000051180}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Soil522 {ECO:0000313|Proteomes:UP000051180};
RA Garrido-Oter R., Bai Y.;
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KRE47074.1, ECO:0000313|Proteomes:UP000051180}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Soil522 {ECO:0000313|EMBL:KRE47074.1,
RC ECO:0000313|Proteomes:UP000051180};
RA Schulze-Lefert P.;
RT "Functional overlap of the Arabidopsis leaf and root microbiotas.";
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRE47074.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LMRV01000033; KRE47074.1; -; Genomic_DNA.
DR RefSeq; WP_056632386.1; NZ_LMRV01000033.1.
DR AlphaFoldDB; A0A0Q9LBZ0; -.
DR STRING; 1736388.ASG81_09370; -.
DR OrthoDB; 273314at2; -.
DR Proteomes; UP000051180; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd18825; GH43_CtGH43-like; 1.
DR Gene3D; 2.60.120.430; Galactose-binding lectin; 2.
DR InterPro; IPR011081; Big_4.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR PANTHER; PTHR22925:SF3; ARABINANASE_LEVANSUCRASE_INVERTASE; 1.
DR PANTHER; PTHR22925; GLYCOSYL HYDROLASE 43 FAMILY MEMBER; 1.
DR Pfam; PF07532; Big_4; 1.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
PE 3: Inferred from homology;
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..31
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 32..1103
FT /note="Bacterial Ig-like domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5006377636"
FT DOMAIN 682..731
FT /note="Bacterial Ig-like"
FT /evidence="ECO:0000259|Pfam:PF07532"
SQ SEQUENCE 1103 AA; 119838 MW; 5D764A68D848E063 CRC64;
MLFPLSKKWV SAALGCMLAL SAPLSIPLAH ASDAEVNAIN PNLPFTNEEL KNNDYILYFV
NAGDSTPATV EGTDKFGLLS SVTEQVYGID PVSGKYWGLN NPAASKTSVS DSSSKSGSLR
YYSGTQVRDK ALKYSFELPE GDYDVTFGYK NPWSGRSVNM FAEGTNLSGD YAIGSYSAET
EVTYNKIHVS DGQLNVAIQG PATAALTNHN DPLINYLIIR QNVTIPLSDL EDKLAEALVY
SADATYTKYS VNFLNTVIDA AQYVARTLSA SGTDISSESN QKQIRSSIAS LNEAIASLVV
FKVNTSFSPG DVWTDTNGAP IQAHGGGIIY DEKTSKYYWY GEDKTDGYLP ARGVHVYSST
DLYNWTDEGL ALRAIASMEA FETDPQFSQL YAGRDDKAEI LNDIGTNRII ERPKVIYNET
TGKYVMWMHT DGPTATSTAN YAKAEAGYAL SDSPTGPFVY GESFRMDRAP KDATYNGQPN
QPGMARDMTL FKDDDGTAYL IYSSEENLTM YISKLNDTYT DVVGWHKDGN LERDTEYKAV
YGEDYVRVFP GAQREAPQVF KYEGKYYMVS SGATGWDPNA AKYTVADDIF GEWKALRYFA
PSSSTTFGSQ GTAIIPVDAE EGKFIYMGDR WKSSDLADSR YIWLPIEFGN DDEIVLNWYD
EWELSELDRM GKITVNTELP SQTILGEQPQ FPSTVNVTKS NGEVINSPVV WNITASTFAK
PGVVNVTGTL SNLADKVINT TVYIVPDTYS YFVHAGGAAT SDYLTMTSYM QDVLLNPGTI
DQAYDPAKGQ TWGYVGTGTN SSGSAGDDLY SALRYLKSNS GDDLTYQFDL ENGVYHVYTG
LYDPWYQYTN GSRKANIVIN GETKTSNYVF TSAKDTLGYM NVKVTDGKLT VTVHRVAGAP
EPQISWIMVS NAEKTAGQAA NTVTNVDAPA QDATALILPA VEEGFEIAIK SSDSEIITAD
GTIAPPKADT TVTLVFTVTR ASDGSVADTR EIQVVVPART VTAADVAETI TSIAEPERKA
AQLALPAVPE GFAIVIKSSD SAVVTTDGVI DPPKLDTVVH FVLEVTRLLD GTTSAVSIAV
TIPSQNNGNH NGMVKGNSNE NSI
//