ID R9N6S6_9FIRM Unreviewed; 2286 AA.
AC R9N6S6;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE RecName: Full=SLH domain-containing protein {ECO:0000259|PROSITE:PS51272};
GN ORFNames=C817_03506 {ECO:0000313|EMBL:EOS78676.1};
OS Dorea sp. 5-2.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae; Dorea.
OX NCBI_TaxID=1235798 {ECO:0000313|EMBL:EOS78676.1, ECO:0000313|Proteomes:UP000014211};
RN [1] {ECO:0000313|EMBL:EOS78676.1, ECO:0000313|Proteomes:UP000014211}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=5-2 {ECO:0000313|EMBL:EOS78676.1,
RC ECO:0000313|Proteomes:UP000014211};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Dorea bacterium 5-2.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS78676.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASTD01000064; EOS78676.1; -; Genomic_DNA.
DR RefSeq; WP_016220198.1; NZ_KE159716.1.
DR STRING; 1235798.C817_03506; -.
DR PATRIC; fig|1235798.3.peg.3691; -.
DR eggNOG; COG2755; Bacteria.
DR eggNOG; COG5263; Bacteria.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_230299_0_0_9; -.
DR OrthoDB; 1864276at2; -.
DR Proteomes; UP000014211; Unassembled WGS sequence.
DR Gene3D; 2.10.270.10; Cholin Binding; 1.
DR InterPro; IPR044060; Bacterial_rp_domain.
DR InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR InterPro; IPR001119; SLH_dom.
DR InterPro; IPR041248; YDG.
DR Pfam; PF19127; Choline_bind_3; 1.
DR Pfam; PF18998; Flg_new_2; 1.
DR Pfam; PF00395; SLH; 3.
DR Pfam; PF18657; YDG; 1.
DR SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR PROSITE; PS51170; CW; 2.
DR PROSITE; PS51272; SLH; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000014211};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..2286
FT /note="SLH domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039222345"
FT DOMAIN 2019..2079
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 2080..2143
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 2146..2209
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REPEAT 2228..2247
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 2249..2268
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REGION 87..107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1757..1822
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1757..1791
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1799..1822
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2286 AA; 235400 MW; 9DBAB50862DD7D90 CRC64;
MRNKLRRPLA FLLSAVMIVT MSGTPVHAVA DRGQPETGLC EHHAAHTDDC GYTEETPGTP
CGHEHTEDCY TEVTECVHEH TSECYPEETE DSVSDNEATP ANAKEREPEN CPHICDGESG
CITEKLDCRH EHDSECGYTE STPGTPCTYV CEICNPQDSG EADEEPETGI IKQEQCSCLT
LCTEGQINPD CPVCGAEDAG LSDCKGKAEK EDTKQPEDTG ICKHHQEHDD ACGYQPESED
SEGSPCTYEC RICPIEDLIA ALPDKVTEDN AEDVRAQLDK ILALFGELTE DEQEQIDLSR
CYELQGALDG ANDPAPIPES VEYQEASWDG SQVTYESKTE TCTLVENSAE AVTWTTGWYA
VSGTVTIDQP ITVTGDVSLI LTDGCTLNAS QGIVVTAGNS LTIYAQSGGT GTLNATGTTD
SGNNASAGIG GSTETLDSGS ITIHGGVINA TSGASRWYSG AGIGGSTPSS GNGGNSGAIK
IYGGTITAES TGYTTGAGIG GGGSGGTGDG GAGTNIIIYG GSITATSHST GNGGAGIGGG
SGQQNGGTGD AIAIHGGVVR ATGGSLGAGI GGGGGSKKSG DGTVTISGGT VTAVGGSNAA
GIGGGGGYSG DYGMNITGGT GSVTITGGIV DATENGGGAS IGNGGNASGT ATVNKTTGIV
FENGVGAVCG DVTFDGSYEV PTGYTLNIPA GASLSGSGTL TGGGTFTADL SEDMVSVPTN
LYYDGTDRTA KLQSDLSAEL NKGIEICGQP FTVSGWTLAV AKAADSDLTY TATYTNNNDN
TNTFTKTITL QKSGTDLTSE GKVQTYKGDT LTKDFTASDT ITVKATPTAT GEAPAKAAAR
LRGDPTAGQM AVFVGTTQVS VSADKGEDGS YTMTVSAADV LLAAGGPGTE IPLTAKFVGN
DNMADGAGTA TVNITAVAKA ERGGTVIGYY GESGLNAALA DKNNEGAVFT LLDNMERETE
LNIKISCTLD LNGKTIHSTG EYASALGISS GATATIRGAG TVLSEHRHGL SVGGTATLEG
GTFISGAADY SGVYVRVGKL IVTGEAVVIE NTGGGTGLGV NSGTGAIAQL SAGTCEGGAA
AVKIYGSTGT LAGLLNQAGT SRVAYYKDNT TLVTEGLDGQ ALPGGSYTVK ACNHTGEGVC
EYTPNEGAET HAMTCLACGY AGAAEDCQYT YTPDSGGEHH ITACQFCGRA KTEAHTYGSW
TPEHNENAVT IRRSCENCGY TKTDGTFTAP AHGQTMQYGK PITLTCSSTL PGATFSWSLI
GSTAEGTGAS FTLPETLAVG SYLYHVTVKW PGSESNICTE MDIFEVTPAP LTATGATAEG
RAYDGTNSVE ITGVTLAGIL NGDDVSVDTT GLTGTLNGSN AGTYTNLTLP AMTLTGADAE
NYTVTRPAGA VPASVTIRKA APLTPKTGDL AVANKQEHTY TYGLGALRPD VPEGMSLGST
AVTYELGPVN LGSYYDSGAM IDGQTLTLPI KAVESDSETK IGTITVTIHT QNFEDMTATI
DVRSVNKQSV DISGVTLTGR TYNGSPIEYQ QTATASVDGK TVNVNGFVYT WDTPNHAAPV
NAGNYTLTVS VDPEDRNYTG STTIPVVIEQ AEIRVTAPSK TIYVGETAPV FSAADCNITG
LVQGENLKTP PTVAYAEAPD TSKTGSVTVT ASGAEVPEGG NYKDRIVYEN GTLTITSKPL
PPAKYTVTVQ AGKGGTASAS PSSAEKGTKI TLNATPDGGY HFKEWQVISG GVTISNNSFT
MPAANVTVKA VFERNSNGGG NSGGSGGGGG SSSGGGSSSG DNGSSGGGST IVARPDETKP
DTPTTSQTKP ATPDKNGNVA VDNGTVQSAI NTAKNDAKKN GNTANGVAVV IPVTPKEGQN
SFNVTINAQT LNTLVREKVK RLEINIEGVV VGGMDTKLLK WLDTLSANGD VIFRVKKTDP
SGLSKEAKAA IGTRPVYDLS LVYLSGGKET PITDFDGHTI AVRLPYAPAK DEKTGNLYAV
YVDGKGKVEW LTKSSYDPDL GTVVFETGHF SIYGIGYKNP VPVFTDIKNH WAEDNIIFVA
SRGLLAGTGN NQFSPDTGMT RGMFVTALGR LAGIDQADYK TGKFTDVKAD AYYAPYVNWA
AEKGIVNGTS ATTFSPDTNI TREQMAVIMA NYAKKLGYDL PVAHDAVTFA DNAQISGWAA
KEVKAMQQAG IMAGKGGNRF DPKGTATRAE VATVLRRFVE IVIDPQTAQG WMQNHSGSWQ
YLKNGKPVTG WLQDDKKWYW LDSNGWMFAG GFKQIDGKWY YFYADGSMSA NTTIDGYTIG
PDGARK
//