ID D1PS97_9FIRM Unreviewed; 959 AA.
AC D1PS97;
DT 09-FEB-2010, integrated into UniProtKB/TrEMBL.
DT 09-FEB-2010, sequence version 1.
DT 24-JAN-2024, entry version 39.
DE SubName: Full=Repeat protein {ECO:0000313|EMBL:EFB74382.1};
GN ORFNames=SUBVAR_07280 {ECO:0000313|EMBL:EFB74382.1};
OS Subdoligranulum variabile DSM 15176.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Subdoligranulum.
OX NCBI_TaxID=411471 {ECO:0000313|EMBL:EFB74382.1, ECO:0000313|Proteomes:UP000003438};
RN [1] {ECO:0000313|EMBL:EFB74382.1, ECO:0000313|Proteomes:UP000003438}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 15176 {ECO:0000313|EMBL:EFB74382.1,
RC ECO:0000313|Proteomes:UP000003438};
RA Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA Hall O., Minx P., Tomlinson C., Mitreva M., Nelson J., Hou S., Wollam A.,
RA Pepin K.H., Johnson M., Bhonagiri V., Nash W.E., Warren W., Chinwalla A.,
RA Mardis E.R., Wilson R.K.;
RL Submitted (DEC-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFB74382.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACBY02000071; EFB74382.1; -; Genomic_DNA.
DR AlphaFoldDB; D1PS97; -.
DR STRING; 411471.SUBVAR_07280; -.
DR eggNOG; COG4733; Bacteria.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_307939_0_0_9; -.
DR Proteomes; UP000003438; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.1080; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 3.
DR Gene3D; 2.60.40.4270; Listeria-Bacteroides repeat domain; 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR036179; Ig-like_dom_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR003599; Ig_sub.
DR InterPro; IPR013378; InlB-like_B-rpt.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR InterPro; IPR042229; Listeria/Bacterioides_rpt_sf.
DR NCBIfam; TIGR02543; List_Bact_rpt; 1.
DR Pfam; PF02368; Big_2; 1.
DR Pfam; PF09479; Flg_new; 1.
DR SMART; SM00635; BID_2; 1.
DR SMART; SM00409; IG; 1.
DR SUPFAM; SSF48726; Immunoglobulin; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
PE 4: Predicted;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023001};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023001};
KW Reference proteome {ECO:0000313|Proteomes:UP000003438};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 934..954
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 398..485
FT /note="Ig-like"
FT /evidence="ECO:0000259|PROSITE:PS50835"
FT REGION 866..927
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 868..895
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 896..912
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 959 AA; 97196 MW; E8973DC1A793B13F CRC64;
MLEIPIKMGV MPPGAVGGAA DPARKFLPAE GGTRMKKTIV SLICTLALCL GLLPTAALAA
GEGAPDMLYV GNQSVRNGDN TTYWTTDDSG ALTQSNESAA WNVKYDPATA TLTLNGATIT
GNSEYASASK GAGIYALSRS GQPVSLTIEL IGENTITGIY GIYVNSELSE NSYGTDASLT
ITGESNGSLK VSGSYHGIYV KSGTGGASLN INDASVVASC SSSYDGYAGV CVQSSSDATS
SPKLSLAVDG GSLTTSASEG NDGIQFYVGS SQATGATTSL TISNNAIVRA QNGIKASRVD
EPTPSGTGIV FDGTEGTVYG NVTLDESLTI AQGETLTIPQ GSTLNVNSNL TNNGTVTIEN
GGTLTGGDAI NNTDGTINVE NGGILTGKPT TGTVVNAPAI TTQPESKTVT VGQTATFTVA
ADGTSPTYQW QQKTTDSGAT WTDISGATSA TYTTAATELD MSGYQYRCVV SNTAGTVTSA
PATLTVTQPV TGVKLDKDTL SLVVDGTATL QATVAPENAT NKDVTWTSDK PEIAKVENGK
VTALKQGTAT ITVTTNDGGK TATCTVNVTA KTYQIAVDKS TLDFGSMFAG NSVPGAQTVT
VKNTGNQTVT LNLPASTYYN VTAGTGFTNG NAVIEPEKTA TFTVQPKTGL AAGTWQETLT
VSGDNGVQAA LLCKLTVTAK TYSLSIDPGA IGFGNVQVGY NQPAVKEVVV KNTGNQTLTL
TQPTATNYQV GSLNQFTVAP GDTAVFTVQP KAGLLAGSYN ETVLVAANGG AGASINLTFT
VGSTASSAAT EKRTLHFDTN GGLAMADVVR GLGAPVELWP YTPVRAGYLF QGWYADQALT
KAVSSVVLTK DTTVYAKWAV DPAAAAAQSG SGSGSGSGSG NKGGSGTTVT VTPAPTATPT
PTPEPTATPT PEPTATPEAS AEPETDADTA SFPVVPVAAG VIVLVVLVGG IAIYRRFHD
//