ID R6PLW4_9FIRM Unreviewed; 1182 AA.
AC R6PLW4;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Multidomain protein with s-layer homology region glug motif ig motif i-set domain pkd domain {ECO:0000313|EMBL:CDC19416.1};
GN ORFNames=BN582_00243 {ECO:0000313|EMBL:CDC19416.1};
OS Eubacterium sp. CAG:274.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Eubacteriaceae;
OC Eubacterium.
OX NCBI_TaxID=1262888 {ECO:0000313|EMBL:CDC19416.1, ECO:0000313|Proteomes:UP000017904};
RN [1] {ECO:0000313|EMBL:CDC19416.1, ECO:0000313|Proteomes:UP000017904}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:274 {ECO:0000313|Proteomes:UP000017904};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDC19416.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBEX010000041; CDC19416.1; -; Genomic_DNA.
DR AlphaFoldDB; R6PLW4; -.
DR STRING; 1262888.BN582_00243; -.
DR Proteomes; UP000017904; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR Gene3D; 2.60.40.680; -; 1.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR001119; SLH_dom.
DR PANTHER; PTHR21559:SF21; DYSTROGLYCAN 1; 1.
DR PANTHER; PTHR21559; DYSTROGLYCAN-RELATED; 1.
DR Pfam; PF00395; SLH; 3.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 2.
DR PROSITE; PS51272; SLH; 3.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000017904};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..32
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 33..1182
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004415434"
FT DOMAIN 951..1014
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1015..1073
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT DOMAIN 1078..1141
FT /note="SLH"
FT /evidence="ECO:0000259|PROSITE:PS51272"
FT REGION 618..644
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 819..914
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1147..1182
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 819..881
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 893..914
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1182 AA; 122169 MW; E3B19C9D7E234CA6 CRC64;
MKKSFKKIVS AVLAASMTLT SVMVVNVAPV TAAEAFSGKA TINAETVKTL FASGLTDGNE
LGSGVSIVNG GSAFTYKANN LKFTDGSAYS TEFQIGNGNE KLSQTLKAGD DLGGAVEKGL
KIEAAKAGTI KIYAGLGSSG VMEAPLCLVD ATTNKVVDAQ SLVNTGDAKS KVYNTPTFTV
PAAGTYYLVQ NGKISSVNLA VIELDLNGGS APTTETTTTV TTTETTTETT TVAPTTTTET
TTEVSTEAVT AKVVVDKATV KVGETVNANV ALVGNKGFNN YTAFVEFDPT IVEPVDAING
TITKEVAGVT YTASDAAALK GQFANVPAVG NTDFTGADGA KTQAQLGKVK YAYVVPQQMV
NGTALQEFNT DGNLFGISFK AIKAGDANIK VTFVGNQLST VSSDATIAKT ATVETVDAAV
KVEEGSTPAP VDGFKGQATI DSNAINASGV TSLKGGDSLL ADGGKNYVQI KLAAGASAFT
FKSNITNFTD GKSFDAEFKK FQVGKGNELI PADLTVGSDV TADVEKGISM KLGGSGTVTI
HAGLGSSDAM SVPVYLINTT TNKVVSISNI DRDAATANLF SVCTLSVPEA GEYVVVQPGK
GSSLNICTID LNIAGGSGSD DTTTTTEVTT TTETTTETTT TTETTETTTL GADGGLGFLA
DKTVKTGDTE VTLDFGAKSI NTAMSAFTGF IVYDASKVEM VSAEGNNTGI VNMINEQIAY
APSASNPDYP GANGTLTTAQ LGKMKVAYIY SDGSAISSIG TPDATLASVK FTIKNVNAGD
VIPVQFVSTE SDGSVAYTVD GKLATVGLNA NIIVAGGSTE STTETTTVST SESTSETTTV
STTETTTAAP VVPTTETTTA PTTTTTTEAS SETTTKRTSS GGGGGGGGGA SVKKTTAATT
EVTEETTSKV VEGSTEATTN QIVISDNGEV KIVTPDGTEV KVNVPKDVVN SDVNFSDLGN
YGWAKDYINK LAELGIVTGT EDGIYSPELG CKRGDFAILI NRTLGLQVTP TKNFSDNEEG
KYYYNDVRMG YTAGILSGYG DNTYKPEKYC NRDEMFVLVA KTLEYLGVDV TSTPTSVNNK
YNDVADVAWW SAPYLAFLTQ EGIVTGSSNG NVEPKKNINR AEMAVMMYKD YEFVKDYVDG
LVKDAATTTT EETTEETTVD ESASTTETTT AETTAETTTV AE
//