ID A0A0Q9U3W0_9BACL Unreviewed; 855 AA.
AC A0A0Q9U3W0;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE RecName: Full=Solute-binding protein family 5 domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=ASG93_01230 {ECO:0000313|EMBL:KRF43574.1};
OS Paenibacillus sp. Soil787.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=1736411 {ECO:0000313|EMBL:KRF43574.1, ECO:0000313|Proteomes:UP000051948};
RN [1] {ECO:0000313|EMBL:KRF43574.1, ECO:0000313|Proteomes:UP000051948}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Soil787 {ECO:0000313|EMBL:KRF43574.1,
RC ECO:0000313|Proteomes:UP000051948};
RA Gilbert D.G.;
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KRF43574.1, ECO:0000313|Proteomes:UP000051948}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Soil787 {ECO:0000313|EMBL:KRF43574.1,
RC ECO:0000313|Proteomes:UP000051948};
RA Schulze-Lefert P.;
RT "Functional overlap of the Arabidopsis leaf and root microbiotas.";
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the bacterial solute-binding protein 5 family.
CC {ECO:0000256|ARBA:ARBA00005695}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KRF43574.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LMSP01000001; KRF43574.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0Q9U3W0; -.
DR STRING; 1736411.ASG93_01230; -.
DR Proteomes; UP000051948; Unassembled WGS sequence.
DR CDD; cd08504; PBP2_OppA; 1.
DR Gene3D; 3.30.457.10; Copper amine oxidase-like, N-terminal domain; 1.
DR Gene3D; 2.20.110.10; Histone H3 K4-specific methyltransferase SET7/9 N-terminal domain; 2.
DR Gene3D; 3.40.190.10; Periplasmic binding protein-like II; 1.
DR InterPro; IPR012854; Cu_amine_oxidase-like_N.
DR InterPro; IPR036582; Mao_N_sf.
DR InterPro; IPR003409; MORN.
DR InterPro; IPR039424; SBP_5.
DR InterPro; IPR000914; SBP_5_dom.
DR PANTHER; PTHR30290; PERIPLASMIC BINDING COMPONENT OF ABC TRANSPORTER; 1.
DR PANTHER; PTHR30290:SF10; PERIPLASMIC OLIGOPEPTIDE-BINDING PROTEIN-RELATED; 1.
DR Pfam; PF07833; Cu_amine_oxidN1; 1.
DR Pfam; PF02493; MORN; 5.
DR Pfam; PF00496; SBP_bac_5; 1.
DR SMART; SM00698; MORN; 3.
DR SUPFAM; SSF55383; Copper amine oxidase, domain N; 1.
DR SUPFAM; SSF82185; Histone H3 K4-specific methyltransferase SET7/9 N-terminal domain; 2.
DR SUPFAM; SSF53850; Periplasmic binding protein-like II; 1.
PE 3: Inferred from homology;
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..855
FT /note="Solute-binding protein family 5 domain-containing
FT protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5006384979"
FT DOMAIN 52..144
FT /note="Copper amine oxidase-like N-terminal"
FT /evidence="ECO:0000259|Pfam:PF07833"
FT DOMAIN 387..776
FT /note="Solute-binding protein family 5"
FT /evidence="ECO:0000259|Pfam:PF00496"
SQ SEQUENCE 855 AA; 94886 MW; F2E7A5134DAA2277 CRC64;
MEGVVFVRKK LIAFLLSLAL LTSIPSLSQA DDSIPVLLNG EAIHFEVQPF IEEGTTMVPF
RSIFEKLGLA VSWDGEAQTI TGYSSDLEIR LQIGSATAIV DGKPQELTIA PEIKDGSTFV
PLRFIGESSG KKVIWDARNH TVLIRDGLSN YLENTLYITS KNLTYVGDQK DGHVSIYSDG
KLVFEGEFKD YKINGTGTLY WQGGQKYYEG QWFNGLMNGI GKLYNEDGTL WYDQIKMSDN
IIGGNGSFYL SNGWVYKGEL VNGTATGIGK YYDPSGALFF EGESKNFKFE GQGKIYFTDG
KVLFVGEFHN SMANGEGIQY NEDGSIKFKG LFKDGNPVID KVATQEIGIN FSAEPPVLDS
SIATANAAFT MINAFNEGLY RLDKDGKVQP GLAKEMPKIT NNGLTYTIAL RDAKWSDGTS
VKAADFVNAW KRTLDPATKA QYSFLLERIK GGEDVTKAKT PNAVKKAKDS LGLKAIDDNT
LEINLERPVA YFPSLLAFPV FFPQKMDFVA ALGNQYGTDA DKVIGAGPFK LVKWNHGQTL
EFVKNDNYWD AMNVKLNKIT VYIVKDGSTG LNLYASNVAD VSELGADFLN LYQGKPDIVF
KPELTTSYLM FQEKKFPAFA NAKVRQALTL AIDRYAFIKK VLNNGSAPST GFVPNGTLDG
NNQSFRAVAG DLTEPKFDTV KAKQLFAEGL QELGMTSLPS FKLTSDDTVT AKKTLEFIQA
QWKTNLGIDM TPDPILHMNR VEKQSKHDFD AVVALWGADY NDPMTFLDMW GTGGEFNEVD
WSNAQYDELV SSARNETDPE TRSKLLVDAE KILMQEMPVG PLYFRNRIFV RKPNVEGIFF
PSFGVEWELK WASVK
//