GenomeNet

Database: UniProt
Entry: R9KM76_9FIRM
LinkDB: R9KM76_9FIRM
Original site: R9KM76_9FIRM 
ID   R9KM76_9FIRM            Unreviewed;      2712 AA.
AC   R9KM76;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   24-JAN-2024, entry version 37.
DE   RecName: Full=Bacterial repeat domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=C810_00624 {ECO:0000313|EMBL:EOS47540.1};
OS   Lachnospiraceae bacterium A2.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX   NCBI_TaxID=397290 {ECO:0000313|EMBL:EOS47540.1, ECO:0000313|Proteomes:UP000014150};
RN   [1] {ECO:0000313|EMBL:EOS47540.1, ECO:0000313|Proteomes:UP000014150}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=A2 {ECO:0000313|EMBL:EOS47540.1,
RC   ECO:0000313|Proteomes:UP000014150};
RG   The Broad Institute Genomics Platform;
RG   The Broad Institute Genome Sequencing Center for Infectious Disease;
RA   Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA   Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA   Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA   Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA   Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA   Birren B.;
RT   "The Genome Sequence of Lachnospiraceae bacterium A2.";
RL   Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EOS47540.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ASSX01000005; EOS47540.1; -; Genomic_DNA.
DR   STRING; 397290.C810_00624; -.
DR   PATRIC; fig|397290.3.peg.692; -.
DR   eggNOG; COG3210; Bacteria.
DR   eggNOG; COG5263; Bacteria.
DR   eggNOG; COG5492; Bacteria.
DR   HOGENOM; CLU_230299_0_0_9; -.
DR   OrthoDB; 1864276at2; -.
DR   Proteomes; UP000014150; Unassembled WGS sequence.
DR   Gene3D; 2.10.270.10; Cholin Binding; 1.
DR   InterPro; IPR044060; Bacterial_rp_domain.
DR   InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR   InterPro; IPR007110; Ig-like_dom.
DR   InterPro; IPR006626; PbH1.
DR   InterPro; IPR001119; SLH_dom.
DR   InterPro; IPR041248; YDG.
DR   Pfam; PF01473; Choline_bind_1; 2.
DR   Pfam; PF18998; Flg_new_2; 3.
DR   Pfam; PF00395; SLH; 3.
DR   Pfam; PF18657; YDG; 2.
DR   SMART; SM00710; PbH1; 6.
DR   SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR   PROSITE; PS51170; CW; 2.
DR   PROSITE; PS50835; IG_LIKE; 1.
DR   PROSITE; PS51272; SLH; 3.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000014150};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..28
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           29..2712
FT                   /note="Bacterial repeat domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5039157128"
FT   DOMAIN          1681..1775
FT                   /note="Ig-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50835"
FT   DOMAIN          2445..2504
FT                   /note="SLH"
FT                   /evidence="ECO:0000259|PROSITE:PS51272"
FT   DOMAIN          2506..2569
FT                   /note="SLH"
FT                   /evidence="ECO:0000259|PROSITE:PS51272"
FT   DOMAIN          2572..2635
FT                   /note="SLH"
FT                   /evidence="ECO:0000259|PROSITE:PS51272"
FT   REPEAT          2654..2673
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REPEAT          2675..2694
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REGION          83..109
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          134..157
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2184..2248
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        134..148
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2184..2217
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2225..2248
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   2712 AA;  279097 MW;  DA5AC64BAA14A301 CRC64;
     MRNKLRRPLA FLLSAVMVVT MSGTPVHAVA DRGQPETGLC EHHTAHTDDC GYTEETPGTP
     CGHEHTEDCY TEVTECVHEH TPECYPEETE DSVSDNEATP ANAEEREPEN CPHICDGESG
     CITEKLDCRH EHKVNGGEAD REGGLGRDSE CGYTESTPGT PCTYVCEICN PQDSGEADEE
     PETGIIKQEQ CSCLTLCTEG QINPDCLVCG AENAGLSDCK GKAEKEDTKQ PEDTGICKHH
     QEHDDACGYQ PESEDSEGSP CTYECRICPI EDLIAALPDK VTEDNAEDVR AQIDKILALF
     GELTEDEQEQ IDLSRCYELQ GALDGANDPD PITESVEYQE ASWDGSQVTY ESKTETCTLV
     ENSAEAVTWT AGWYAVSGTV TIDQPITVNG EVHLILTNGC TLTAEKGIVV TSTNSLTIYA
     QSENGGTLNA TGMTDDSGNA SAGIGGSTSS VDSGSITIHG GIINVTGGGQ SGRYGGAGIG
     GGTSSSGNGG SSRGIVTIYG GTITANSGAG NVAGAGIGGG GGGNGRNGGD GGGITIYGGS
     ITATSRGTDS GGAGIGGGAG NNLNGGAGNN IQINGGMVHA TGGNLGAGIG GGGGSESGDG
     TVTISGGTVT AVGGNYAAGI GGGGGYQNTY GGCTGGTGSV TITGGIVDAS SPTNVYWEGY
     EGAPIGNGGN ATATATVNKT TGIIFENGVG TVCGDVTLAG SYTVPAGYSL HIPAGASLSG
     SGTLSGGETF TTDLSEDMVS VPTDLYYNGQ DRSNDIKTRL SDGLTQGIAI CGQTFAVSGW
     TVAVSRTDDL HYTATYTNTD NSTTFQKTIT LQKSGTDLTS EGKVQTYKGD TLTKDFTASD
     TITVKATPTA TGQAPAKAAA RLRGDPTAGQ MVVFVGDTQV CAPADVGADG SYTMNVSAAD
     VLVAAGGPGT GITLTAKFVG NDNMADGAGT VDVSISAVAK IENGSTTTYV GNLDDAFKTE
     NDGATITLLD DVTRATTLRI QINCTLDLGG HTITFTDSGN VWVQGSVTAM TIRGEGEIIS
     EQSHALVVAG SVTLEGGTFT SNKGDYAGVY VNGGTLSVTG KNVVIQNTGS GYGLTVNNAQ
     SVQLSGGTYS GTAGAISIVG GSLTLGGLLP QGGDTRYAYF DESGTTPFTG VLGNKSLTGT
     VTVKKCNHTG EGVCEYTPNE GAETHAMTCL ACGYAGAAES CAYSDDYGHD ETNHWQTCTL
     CGGKKTEAHG WVHQCTSATG IIRRSCDKCE IETVVGTVSI TPDFSVTYGK TGSATLVCTA
     ELADGYSLEP ADSADNCWVL MALSDGKSWN LGRELEVKLP ADLPAGEYWY DAYPRLSYQG
     NNTIVKRFNV IGKVTVTPAP LTVTGAAAKD RTYDGTNSVQ ITGVTLDGVL NSDNVSVDLT
     GLTGTLSGSD AETYTSVTLP RLTLTGGAAS NYTLPQSMGA VPASVTISKA TLTATGATVA
     SKTYDGNTAA SVSSVTFTGL VPGEALALGT DYTATGTFAD ANAGTDKSVS VAVALKDSAK
     ANNYNLTNGS VNATGTIAKA SSSITTAPSA TGITYGQALS DSTLTGGTGS VPGAFAWTDS
     TAKPNAGTAQ FEVTFTPTDT NYNSTTTNVS VTVAKATPTL TAPTATAIQY GQKLTDSALT
     GGTATNPNGS AAIAGSWHWA SGNTQPTATG TFPVSFAPSD TANYNTPANV DTSVTVNPAA
     PKISLTVPAY QVAGEDVIVT CTVENPHDAT FKEGIPENIT LTYQIGSGTP QTITNGKFSI
     PAGTKKDTVI TVTASTDAVN GKYTAATKTA TVTVTDKIPV EISGISVTGR VYNGQPVNYT
     GTPVVKKLDG TVVTDALVSY TWSSVTAPVN AGDYSLVVAV GGGKYIGSTT IPVVIEQAEI
     RVTAPSKTIY VGETAPVFSA ADCNITGLVQ GENLKTPPTV AYAEAPDTSK TGSVTVTASG
     AEVPEGGNYK DRIVYENGTL TITSKPLPPA KYTITVQAGT GGTASASPTS AEKGTKITLN
     ATPDGGYHFK EWQVISGGVT ISNNSFTMPA TNVTVKAVFE KDSDPPQATR YSVTVQAGTG
     GTASASPTSA EKGTKITLTA TPDSGYHFKE WQVISGGVTI SNNSFTMPDT NVTVKAVFEK
     DGSTPPQPTR YSVTVQAGTG GTASASPTSA EKGTKITLTA TPDSGYHFKE WQVISGGVTI
     SNNSFTMPDT NVTVKAVFVK NSNGGGNSGG SSSGGGSSSG GGSSSGDNGS SGGGSTIVAR
     PDETKPDTPT TSQTKPATPD KNGNVAVDNG TVQSAINTAK NDAKKNGNTA NGVAVVIPVT
     PKEGQNSFNV TINAQTLNTL VREKVKRLEI NIEGVVVGGM DTKLLKWLDT LSANGDVIFR
     VKKTDPSGLS KEAKAAIGTR PVYDLSLVYL SGGKETPITD FDGHTIAVRM PYAPAKDEKT
     GNLYAVYVDG KGKVEWLTKS SYDPDLGTVV FETGHFSIYG IGYKNPVPVF TDIKNHWAED
     NIIFVASRGL LAGTGNNQFS PDTGMTRGMF VTALGRLADI DPNSYKTGRF TDVKADAYYA
     PYVNWAAEKG IVNGTSATTF SPDTNITREQ MAVIMANYAK KLGYDLPVAH EAVTFADNAQ
     ISGWAAKEVK AMQQAGILAG KGGNRFDPKG NATRAEVATV LRRFVEIVID PQTAQGWMQN
     HSGSWQYMKN GKPVTGWLQD NKKWYWLDNN GWMFASGWKQ IDGKWYYFYP DGSMAVSTTI
     DGYTIGPDGA RK
//
DBGET integrated database retrieval system