ID A0A1Y4LC50_9FIRM Unreviewed; 1252 AA.
AC A0A1Y4LC50;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 21.
DE RecName: Full=Bacterial repeat domain-containing protein {ECO:0000259|Pfam:PF18998};
GN ORFNames=B5F19_11610 {ECO:0000313|EMBL:OUP54307.1};
OS Pseudoflavonifractor sp. An184.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Pseudoflavonifractor.
OX NCBI_TaxID=1965576 {ECO:0000313|EMBL:OUP54307.1, ECO:0000313|Proteomes:UP000196588};
RN [1] {ECO:0000313|Proteomes:UP000196588}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=An184 {ECO:0000313|Proteomes:UP000196588};
RA Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA Rychlik I.;
RT "Function of individual gut microbiota members based on whole genome
RT sequencing of pure cultures obtained from chicken caecum.";
RL Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OUP54307.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NFKI01000035; OUP54307.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Y4LC50; -.
DR Proteomes; UP000196588; Unassembled WGS sequence.
DR Gene3D; 2.160.20.10; Single-stranded right-handed beta-helix, Pectin lyase-like; 1.
DR InterPro; IPR044060; Bacterial_rp_domain.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR012334; Pectin_lyas_fold.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR Pfam; PF18998; Flg_new_2; 2.
DR SMART; SM00710; PbH1; 6.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000196588};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..32
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 33..1252
FT /note="Bacterial repeat domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012892875"
FT DOMAIN 317..354
FT /note="Bacterial repeat"
FT /evidence="ECO:0000259|Pfam:PF18998"
FT DOMAIN 607..669
FT /note="Bacterial repeat"
FT /evidence="ECO:0000259|Pfam:PF18998"
FT REGION 44..82
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 63..82
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1252 AA; 135561 MW; D4A3C7E22C9AA85F CRC64;
MTRKKQKLRK RIGSLLLCMS LVSSIMGMPA YAAYTEDTED MSEAVVSEAP EDVTLPTDSE
TGAEETDAVE AEDVGTAEAE EETHVEEILE ADENNVANVS ASHIDTLDRV LIEENFDQGN
QGYWDVLNSN GTVSFTGGKA VFNGSGPCNR IGYNGDIIDA DDFLIQLNMT LNEGNTNSNA
KIAFKTADQY EGDRLQVRFN FVQGGIYVER TQNNSVINFT KVWSQGGDFT WETGKTYAIN
VLVEGDQITV YVDGTEIVSV TNADIQSMEK GYFAIAGQYP AQNFSIDDLC ISTDEELSGE
LYTVTLQTAT NGVIDSVPGS GGGTLTADKD SGYDGEKVTL SPVPAHNYVF DSYGFFLENG
TSADGLITVI NNTITLDSKF GNITVVAYFV SREPGAYELF YDDFAASEID DAYKTVGDLS
YIEQNSGELT IDAVANGSNY LLLDQSVFDE MTADGDYRIS TDVKKGNATN GTMQIMFKGE
DTSINGRYVL VLNGGAAVFR YIDPSNNTNI QLAATNFTFS NEYVHADVEV ISDTVTFYAD
DREILSYTTD DNWKNADNCV GLINMTGGAP VVFDELLVEQ IPSAVDITLT VKLEENGSQT
VDTDYVSGTV TADKTSAVSG ETVTLTVVEK AGYQLKSITV NGEEISNLTF TVPSTVTDSL
DIVAVFEPAQ LRTPRNFYID SENGDDSNSG TIDAPWKTLS QLEEYSQTYA LVPGDQVLLK
RGSVFENESL QFSGMGTEQN PIVISAYGDG EDLPRLDGNG VVENVISLFN QEYITISNLE
ITNTSPNYNS QFGLNTSTNT SLALRAINVS AKDFGVVSGI KIQDCYIHDI NGNINLKWNG
GIFFDVQADV IGGELSGIPT KYDDVLIEGC TFINVDRSGI KLVNSAWCNQ WLPNSPDIPL
NWYPSTNVVV RNNYMEKIGG DGITTRDTDG ALIEYNLAKD CRYQNTGYNV GIWPFEAANT
VIQYNEAYNT HGTQDGQGLD CDHASSYSLM QYNYSHDNEG GFMLIMGGYP HTAPTVRYNI
SQNDCDKTFE FAQGIPKGTM IYNNTIYSDQ IVSRGVLFLS NTAAGLGVND FYMFNNLFCY
PDGQTFYGGG NASNITDLTN KGKLYNNAYV GGISAPVADT GAVVVADIDT VLVNAGSGPD
SNPSTTPITG ASGELDGYQL VAGSPMIDAG ATMAESVTHF GGAMNEIVDG TAQSPNELYY
QAQSADSIDY IMGEYFPEIA GVDYEVDFFG NSLYAGNGPD IGAAEYLNNS QS
//