ID A0A3D9JN31_9BACL Unreviewed; 959 AA.
AC A0A3D9JN31;
DT 16-JAN-2019, integrated into UniProtKB/TrEMBL.
DT 16-JAN-2019, sequence version 1.
DT 27-MAR-2024, entry version 14.
DE SubName: Full=Putative repeat protein (TIGR02543 family) {ECO:0000313|EMBL:RED75209.1};
GN ORFNames=DFP98_11611 {ECO:0000313|EMBL:RED75209.1};
OS Cohnella phaseoli.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Cohnella.
OX NCBI_TaxID=456490 {ECO:0000313|EMBL:RED75209.1, ECO:0000313|Proteomes:UP000256977};
RN [1] {ECO:0000313|EMBL:RED75209.1, ECO:0000313|Proteomes:UP000256977}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CECT 7287 {ECO:0000313|EMBL:RED75209.1,
RC ECO:0000313|Proteomes:UP000256977};
RA Whitman W.;
RT "Genomic Encyclopedia of Type Strains, Phase III (KMG-III): the genomes of
RT soil and plant-associated and newly described type strains.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Secreted, cell wall, S-layer
CC {ECO:0000256|ARBA:ARBA00004237}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RED75209.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRDZ01000016; RED75209.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3D9JN31; -.
DR OrthoDB; 663332at2; -.
DR Proteomes; UP000256977; Unassembled WGS sequence.
DR GO; GO:0030115; C:S-layer; IEA:UniProtKB-SubCell.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR CDD; cd08548; Type_I_cohesin_like; 1.
DR Gene3D; 2.60.40.2700; -; 1.
DR Gene3D; 2.60.40.680; -; 1.
DR Gene3D; 2.60.40.4270; Listeria-Bacteroides repeat domain; 2.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR002102; Cohesin_dom.
DR InterPro; IPR013378; InlB-like_B-rpt.
DR InterPro; IPR042229; Listeria/Bacterioides_rpt_sf.
DR InterPro; IPR001119; SLH_dom.
DR NCBIfam; TIGR02543; List_Bact_rpt; 1.
DR PANTHER; PTHR43308:SF4; OUTER MEMBRANE PROTEIN ALPHA; 1.
DR PANTHER; PTHR43308; OUTER MEMBRANE PROTEIN ALPHA-RELATED; 1.
DR Pfam; PF00963; Cohesin; 1.
DR Pfam; PF09479; Flg_new; 2.
DR Pfam; PF00395; SLH; 3.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 1.
DR PROSITE; PS51272; SLH; 3.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022601};
KW S-layer {ECO:0000256|ARBA:ARBA00022601};
KW Secreted {ECO:0000256|ARBA:ARBA00022601}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..27
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 28..959
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017694708"
FT DOMAIN 770..829
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 830..893
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 900..959
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
SQ SEQUENCE 959 AA; 101764 MW; A6F62CC21A4656CB CRC64;
MNNRMKTSLI AVLTFCLFLS LCSVAAAAPA KITFNLSNTA GEPGEEVAVN LTVDIPGDGF
YYTYLPFVDW ELPTSGYRFG SWVNGTSITI FLTIDENAPS GSYSVFMDRE NTEVNKGPTS
NTVNQPVNDY VDVAGTVTVH KQATVGTVSG TAGSLVNVPV TTNSTEPIGY YDLTIPYDPN
ALEVSVVTAT YGGNLDYQDE SGTLRVTWDG QADNTSIVNG QLFNVAFKIK TDSPLGDKAL
AIASGAVFKN TTGSVMTPFY TRPGSAKVTS NLAPVAENVS VTGTEVVGQT MTGTYSYSDH
EEDAEDASMF KWYRADTAGG ANEQLISGES EGDYTLQPAD KGKYIRFEVT PAASAGTSPG
VPVKSGFTGQ IAAPTYPVAY DGNGATFGSL PTTPQGYEEG DVVAVLGNVG SLENPHHSFA
GWTLDSLNSG SVYLPGDTLT IGTEPITLYA KWVVNRYTVS FHSNGGSAIE SVQIDYDTAF
AAPTRPSRSG YSFVGWYKDE QLTEAWSFDS DKIAGDTDLY AKWSRNSESS GGGGGPTVPA
AQEGDSFKVK INGKEGIVGT LTTAVDKGRT VSLVSLDRTK LAALLESGDE GFKLAIPIAG
AAASVTVELN GESVRTLESK QAILEFKTDK GSYVLPVSQL GIQEIARQLG ASADLQQIKL
RIEMADSSEA IANAFNGAAS KGGFTIVARP LEFHVTAVNG SRSAEISVFS TFVEREIALP
DGASADRVTT GVAVDPDGTA RHVPTKIVQE NGRQTARIHS LTNSSYAVIW NPIEFKDVAD
HWAKDAVNDM GSRLVIGGVG EERFSPDLDI TRAEFAAIVV RALGLKPVQG ISSYPDVDSK
AWYQGEVEAA KAYGLVNGYS DGAFRPAERI TREQALVILA QAMKLTGLKD KLQSSAQPSI
DGFADADQIS AWAKASVSES LQAGLVNGRK DNRLAPKSFI SRAEVAVLVQ RLLKESDLI
//