ID R9KM76_9FIRM Unreviewed; 2712 AA.
AC R9KM76;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE RecName: Full=Bacterial repeat domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=C810_00624 {ECO:0000313|EMBL:EOS47540.1};
OS Lachnospiraceae bacterium A2.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=397290 {ECO:0000313|EMBL:EOS47540.1, ECO:0000313|Proteomes:UP000014150};
RN [1] {ECO:0000313|EMBL:EOS47540.1, ECO:0000313|Proteomes:UP000014150}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=A2 {ECO:0000313|EMBL:EOS47540.1,
RC ECO:0000313|Proteomes:UP000014150};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium A2.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS47540.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASSX01000005; EOS47540.1; -; Genomic_DNA.
DR STRING; 397290.C810_00624; -.
DR PATRIC; fig|397290.3.peg.692; -.
DR eggNOG; COG3210; Bacteria.
DR eggNOG; COG5263; Bacteria.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_230299_0_0_9; -.
DR OrthoDB; 1864276at2; -.
DR Proteomes; UP000014150; Unassembled WGS sequence.
DR Gene3D; 2.10.270.10; Cholin Binding; 1.
DR InterPro; IPR044060; Bacterial_rp_domain.
DR InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR006626; PbH1.
DR InterPro; IPR001119; SLH_dom.
DR InterPro; IPR041248; YDG.
DR Pfam; PF01473; Choline_bind_1; 2.
DR Pfam; PF18998; Flg_new_2; 3.
DR Pfam; PF00395; SLH; 3.
DR Pfam; PF18657; YDG; 2.
DR SMART; SM00710; PbH1; 6.
DR SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR PROSITE; PS51170; CW; 2.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS51272; SLH; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000014150};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..2712
FT /note="Bacterial repeat domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039157128"
FT DOMAIN 1681..1775
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT DOMAIN 2445..2504
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 2506..2569
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 2572..2635
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REPEAT 2654..2673
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 2675..2694
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REGION 83..109
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 134..157
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2184..2248
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 134..148
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2184..2217
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2225..2248
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2712 AA; 279097 MW; DA5AC64BAA14A301 CRC64;
MRNKLRRPLA FLLSAVMVVT MSGTPVHAVA DRGQPETGLC EHHTAHTDDC GYTEETPGTP
CGHEHTEDCY TEVTECVHEH TPECYPEETE DSVSDNEATP ANAEEREPEN CPHICDGESG
CITEKLDCRH EHKVNGGEAD REGGLGRDSE CGYTESTPGT PCTYVCEICN PQDSGEADEE
PETGIIKQEQ CSCLTLCTEG QINPDCLVCG AENAGLSDCK GKAEKEDTKQ PEDTGICKHH
QEHDDACGYQ PESEDSEGSP CTYECRICPI EDLIAALPDK VTEDNAEDVR AQIDKILALF
GELTEDEQEQ IDLSRCYELQ GALDGANDPD PITESVEYQE ASWDGSQVTY ESKTETCTLV
ENSAEAVTWT AGWYAVSGTV TIDQPITVNG EVHLILTNGC TLTAEKGIVV TSTNSLTIYA
QSENGGTLNA TGMTDDSGNA SAGIGGSTSS VDSGSITIHG GIINVTGGGQ SGRYGGAGIG
GGTSSSGNGG SSRGIVTIYG GTITANSGAG NVAGAGIGGG GGGNGRNGGD GGGITIYGGS
ITATSRGTDS GGAGIGGGAG NNLNGGAGNN IQINGGMVHA TGGNLGAGIG GGGGSESGDG
TVTISGGTVT AVGGNYAAGI GGGGGYQNTY GGCTGGTGSV TITGGIVDAS SPTNVYWEGY
EGAPIGNGGN ATATATVNKT TGIIFENGVG TVCGDVTLAG SYTVPAGYSL HIPAGASLSG
SGTLSGGETF TTDLSEDMVS VPTDLYYNGQ DRSNDIKTRL SDGLTQGIAI CGQTFAVSGW
TVAVSRTDDL HYTATYTNTD NSTTFQKTIT LQKSGTDLTS EGKVQTYKGD TLTKDFTASD
TITVKATPTA TGQAPAKAAA RLRGDPTAGQ MVVFVGDTQV CAPADVGADG SYTMNVSAAD
VLVAAGGPGT GITLTAKFVG NDNMADGAGT VDVSISAVAK IENGSTTTYV GNLDDAFKTE
NDGATITLLD DVTRATTLRI QINCTLDLGG HTITFTDSGN VWVQGSVTAM TIRGEGEIIS
EQSHALVVAG SVTLEGGTFT SNKGDYAGVY VNGGTLSVTG KNVVIQNTGS GYGLTVNNAQ
SVQLSGGTYS GTAGAISIVG GSLTLGGLLP QGGDTRYAYF DESGTTPFTG VLGNKSLTGT
VTVKKCNHTG EGVCEYTPNE GAETHAMTCL ACGYAGAAES CAYSDDYGHD ETNHWQTCTL
CGGKKTEAHG WVHQCTSATG IIRRSCDKCE IETVVGTVSI TPDFSVTYGK TGSATLVCTA
ELADGYSLEP ADSADNCWVL MALSDGKSWN LGRELEVKLP ADLPAGEYWY DAYPRLSYQG
NNTIVKRFNV IGKVTVTPAP LTVTGAAAKD RTYDGTNSVQ ITGVTLDGVL NSDNVSVDLT
GLTGTLSGSD AETYTSVTLP RLTLTGGAAS NYTLPQSMGA VPASVTISKA TLTATGATVA
SKTYDGNTAA SVSSVTFTGL VPGEALALGT DYTATGTFAD ANAGTDKSVS VAVALKDSAK
ANNYNLTNGS VNATGTIAKA SSSITTAPSA TGITYGQALS DSTLTGGTGS VPGAFAWTDS
TAKPNAGTAQ FEVTFTPTDT NYNSTTTNVS VTVAKATPTL TAPTATAIQY GQKLTDSALT
GGTATNPNGS AAIAGSWHWA SGNTQPTATG TFPVSFAPSD TANYNTPANV DTSVTVNPAA
PKISLTVPAY QVAGEDVIVT CTVENPHDAT FKEGIPENIT LTYQIGSGTP QTITNGKFSI
PAGTKKDTVI TVTASTDAVN GKYTAATKTA TVTVTDKIPV EISGISVTGR VYNGQPVNYT
GTPVVKKLDG TVVTDALVSY TWSSVTAPVN AGDYSLVVAV GGGKYIGSTT IPVVIEQAEI
RVTAPSKTIY VGETAPVFSA ADCNITGLVQ GENLKTPPTV AYAEAPDTSK TGSVTVTASG
AEVPEGGNYK DRIVYENGTL TITSKPLPPA KYTITVQAGT GGTASASPTS AEKGTKITLN
ATPDGGYHFK EWQVISGGVT ISNNSFTMPA TNVTVKAVFE KDSDPPQATR YSVTVQAGTG
GTASASPTSA EKGTKITLTA TPDSGYHFKE WQVISGGVTI SNNSFTMPDT NVTVKAVFEK
DGSTPPQPTR YSVTVQAGTG GTASASPTSA EKGTKITLTA TPDSGYHFKE WQVISGGVTI
SNNSFTMPDT NVTVKAVFVK NSNGGGNSGG SSSGGGSSSG GGSSSGDNGS SGGGSTIVAR
PDETKPDTPT TSQTKPATPD KNGNVAVDNG TVQSAINTAK NDAKKNGNTA NGVAVVIPVT
PKEGQNSFNV TINAQTLNTL VREKVKRLEI NIEGVVVGGM DTKLLKWLDT LSANGDVIFR
VKKTDPSGLS KEAKAAIGTR PVYDLSLVYL SGGKETPITD FDGHTIAVRM PYAPAKDEKT
GNLYAVYVDG KGKVEWLTKS SYDPDLGTVV FETGHFSIYG IGYKNPVPVF TDIKNHWAED
NIIFVASRGL LAGTGNNQFS PDTGMTRGMF VTALGRLADI DPNSYKTGRF TDVKADAYYA
PYVNWAAEKG IVNGTSATTF SPDTNITREQ MAVIMANYAK KLGYDLPVAH EAVTFADNAQ
ISGWAAKEVK AMQQAGILAG KGGNRFDPKG NATRAEVATV LRRFVEIVID PQTAQGWMQN
HSGSWQYMKN GKPVTGWLQD NKKWYWLDNN GWMFASGWKQ IDGKWYYFYP DGSMAVSTTI
DGYTIGPDGA RK
//