ID C0EHK7_9FIRM Unreviewed; 1238 AA.
AC C0EHK7;
DT 05-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 05-MAY-2009, sequence version 1.
DT 28-MAR-2018, entry version 37.
DE SubName: Full=Cohesin domain protein {ECO:0000313|EMBL:EEG29025.1};
GN ORFNames=CLOSTMETH_03352 {ECO:0000313|EMBL:EEG29025.1};
OS [Clostridium] methylpentosum DSM 5476.
OC Bacteria; Firmicutes; Clostridia; Clostridiales; Ruminococcaceae;
OC Ruminiclostridium.
OX NCBI_TaxID=537013 {ECO:0000313|EMBL:EEG29025.1, ECO:0000313|Proteomes:UP000003340};
RN [1] {ECO:0000313|EMBL:EEG29025.1, ECO:0000313|Proteomes:UP000003340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 5476 {ECO:0000313|EMBL:EEG29025.1,
RC ECO:0000313|Proteomes:UP000003340};
RA Fulton L., Clifton S., Fulton B., Xu J., Minx P., Pepin K.H.,
RA Johnson M., Bhonagiri V., Nash W.E., Mardis E.R., Wilson R.K.;
RL Submitted (JAN-2009) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EEG29025.1, ECO:0000313|Proteomes:UP000003340}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 5476 {ECO:0000313|EMBL:EEG29025.1,
RC ECO:0000313|Proteomes:UP000003340};
RA Sudarsanam P., Ley R., Guruge J., Turnbaugh P.J., Mahowald M.,
RA Liep D., Gordon J.;
RT "Draft genome sequence of Clostridium methylpentosum (DSM 5476).";
RL Submitted (FEB-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an
CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC preliminary data. {ECO:0000313|EMBL:EEG29025.1}.
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; ACEC01000118; EEG29025.1; -; Genomic_DNA.
DR RefSeq; WP_006356123.1; NZ_EQ973344.1.
DR STRING; 537013.CLOSTMETH_03352; -.
DR EnsemblBacteria; EEG29025; EEG29025; CLOSTMETH_03352.
DR eggNOG; ENOG4105D2F; Bacteria.
DR eggNOG; COG5492; LUCA.
DR OrthoDB; POG091H01S9; -.
DR Proteomes; UP000003340; Unassembled WGS sequence.
DR GO; GO:0016021; C:integral component of membrane; IEA:UniProtKB-KW.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR002102; Cohesin_dom.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR Pfam; PF02368; Big_2; 9.
DR Pfam; PF00963; Cohesin; 1.
DR SMART; SM00635; BID_2; 10.
DR SUPFAM; SSF49373; SSF49373; 10.
DR SUPFAM; SSF49384; SSF49384; 1.
PE 4: Predicted;
KW Complete proteome {ECO:0000313|Proteomes:UP000003340};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000003340};
KW Signal {ECO:0000256|SAM:SignalP};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT SIGNAL 1 31 {ECO:0000256|SAM:SignalP}.
FT CHAIN 32 1238 {ECO:0000256|SAM:SignalP}.
FT /FTId=PRO_5002897930.
FT TRANSMEM 1213 1232 Helical. {ECO:0000256|SAM:Phobius}.
FT DOMAIN 176 254 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 258 336 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 340 417 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 422 499 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 503 579 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 583 663 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 673 756 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 761 838 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 843 921 BID_2. {ECO:0000259|SMART:SM00635}.
FT DOMAIN 925 1000 BID_2. {ECO:0000259|SMART:SM00635}.
SQ SEQUENCE 1238 AA; 129758 MW; 666A72C8EA758A7E CRC64;
MRKTSIPKRV LALLLSAAIF ASIGVATIVD AAAKAANTVD LTVGTVEGTK GQEVQIPVTM
NAKDNEVGSF DATINYDPEQ LELVHNDRGE PKVIFGSTVN ANAISNSPSP GILMLSGGSL
YGITDETIFT VSFVVKEGAS GNCPISLSDV FVSDNTQQAN ELETKLISGG VNVSVPLNSI
SLSQTSLDLA KGESSKLDVI YNPDNTTDDK TVEWSTGDDT VATVSKDGTV QAVGKGTTTI
TAKVGEKTAT CNVTVSVPLQ SISLNQDKLT LDKGENSQLT VSALPEDTTD TNPYSWTSDN
QSVATVNQNG LVTAVGQGTA TITVSRGDKT ASCEVVVSAP LESISIQPTL ELLKKQTAAL
TITYNPENTT DDKTAVWSSS DDKIATVDQN GVVTAVAPGK ATITAKVGKH EAACTVTVKE
QPLNSIALNK QEMTLDKGKT ESLTVTYNPE DTTDDKTVVW STSDSNVATV KNGVVTAVGV
GKATITATVG EKQAKCEVTV TSPLDHITIP STAKVNKGET TSLSVSYFPE DTTDSKDVVW
TSSNSAVASV KDGVVTGHMA GTAVITANVG GKQASCTVTV EVPLQYVSIA GIEEYTTMSR
GESLQLGVNY YPQDTTANKA VRWTSSDESV VKIDENGKMT AVGGGEAVVN VVSDYAVGKN
PTVKEYRVKV IVPLESISLV PAELTLEVGD TSTTELNYFP VDFTVSDTMK YSIDIEDPNV
VSVARDGDKF TVKALSQGST SFTVTVDEKY QATCTVQVLQ PIQSISLNKT ELSLLKGGSE
TLTVNIDPPN ADGDKSVVWS SSDETIATVN NGVVTGLKAG TAKITAAVGK HTATCTVTVQ
EIKIESVTLD KTELSLDKGD TAELTATVNP ENTTDDKTVL WSTSDPNVAL VNSNGVVTAV
GGGTAKITAK AGDKSAECTV TVEIPLGSIA LSQTTASLIS GRDTTLVVRF NPEDTTADKT
VLWSTSDPSV VTVSNGVVTA VGKGKATVTA RVGSLSATCE FTVIEASDKI PAQTVIDQLK
DESVDEVVVN LTQPDVVTKE IFEAAKGIDK NITFNIVDES GSILYSWSFN GKDVKDVSKD
VDLAIEVGAE NDAITSLVGD ERALILSFAH KGELPVNMSV RVFVGDHFAD GQPLWYYTYN
PETSSLELVS SELTVEDHFI ELSLTSCSDL VFSAKEVKDP EPGESSTPSN SGSSSQGSGS
HGDTPKTGEV TTVALMAAAM ACALGAAFVM YTRKKHED
//