ID B5CMC1_9FIRM Unreviewed; 1535 AA.
AC B5CMC1;
DT 14-OCT-2008, integrated into UniProtKB/TrEMBL.
DT 14-OCT-2008, sequence version 1.
DT 24-JAN-2024, entry version 57.
DE SubName: Full=Carbohydrate binding domain protein {ECO:0000313|EMBL:EDY33586.1};
GN ORFNames=RUMLAC_00596 {ECO:0000313|EMBL:EDY33586.1};
OS [Ruminococcus] lactaris ATCC 29176.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Mediterraneibacter.
OX NCBI_TaxID=471875 {ECO:0000313|EMBL:EDY33586.1, ECO:0000313|Proteomes:UP000003254};
RN [1] {ECO:0000313|EMBL:EDY33586.1, ECO:0000313|Proteomes:UP000003254}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 29176 {ECO:0000313|EMBL:EDY33586.1,
RC ECO:0000313|Proteomes:UP000003254};
RA Sudarsanam P., Ley R., Guruge J., Turnbaugh P.J., Mahowald M., Liep D.,
RA Gordon J.;
RT "Draft genome sequence of Ruminococcus lactaris ATCC 29176.";
RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EDY33586.1, ECO:0000313|Proteomes:UP000003254}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 29176 {ECO:0000313|EMBL:EDY33586.1,
RC ECO:0000313|Proteomes:UP000003254};
RA Fulton L., Clifton S., Fulton B., Xu J., Minx P., Pepin K.H., Johnson M.,
RA Bhonagiri V., Nash W.E., Mardis E.R., Wilson R.K.;
RL Submitted (AUG-2008) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EDY33586.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ABOU02000019; EDY33586.1; -; Genomic_DNA.
DR RefSeq; WP_005610271.1; NZ_DS990183.1.
DR GeneID; 77334967; -.
DR eggNOG; COG0366; Bacteria.
DR eggNOG; COG5492; Bacteria.
DR HOGENOM; CLU_001559_0_0_9; -.
DR Proteomes; UP000003254; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0033926; F:glycopeptide alpha-N-acetylgalactosaminidase activity; IEA:InterPro.
DR Gene3D; 2.70.98.10; -; 1.
DR Gene3D; 1.20.1270.90; AF1782-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.1180; Golgi alpha-mannosidase II; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR005102; Carbo-bd_X2.
DR InterPro; IPR025706; Endoa_GalNAc.
DR InterPro; IPR040633; Gal_mutarotas_3.
DR InterPro; IPR014718; GH-type_carb-bd.
DR InterPro; IPR049314; GH101_dom-5.
DR InterPro; IPR040502; GH101_dom-6.
DR InterPro; IPR035364; Glyco_hyd_101_beta.
DR InterPro; IPR013780; Glyco_hydro_b.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR InterPro; IPR001202; WW_dom.
DR Pfam; PF03442; CBM_X2; 1.
DR Pfam; PF18080; Gal_mutarotas_3; 1.
DR Pfam; PF17974; GalBD_like; 1.
DR Pfam; PF21466; GH101_dom-5; 1.
DR Pfam; PF17451; Glyco_hyd_101C; 1.
DR Pfam; PF12905; Glyco_hydro_101; 1.
DR SUPFAM; SSF81296; E set domains; 1.
DR PROSITE; PS50020; WW_DOMAIN_2; 1.
PE 4: Predicted;
KW Membrane {ECO:0000256|SAM:Phobius};
KW Reference proteome {ECO:0000313|Proteomes:UP000003254};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 1510..1529
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 896..929
FT /note="WW"
FT /evidence="ECO:0000259|PROSITE:PS50020"
FT REGION 805..831
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1426..1510
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1430..1446
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1447..1465
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1466..1505
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1535 AA; 168314 MW; C24E9C95F7691757 CRC64;
MIIDQTVDAF SDTMSGQIGM RVWGEPTETH GCALKFDNIK TSDIFSAVSI DPKSVKLEQD
TAAAQDIEVA ITGENTLSAI KNGSEVLERG TDYTVNGQNV TIKTTYLEKI KSQSSTKLVF
EFEDGQTQTF TININIPEPT VSYTRNFAAD GADGFTKKSG SGSMSLENGA MKLQGDGVFI
DDNSASLKDQ EIEFTYDPMN DNCGYGAVLR YVSDNEYFYV GPSSQNNQHY TRWNIWNAQG
QSLLGSEYND SGFILANRVV PYKIKVRIVD KYVTVFVDNE EILNRELSNV TTNPGKVGFR
TGSNNGMLIQ KFTQENAAVP TVVENTEPVT IQSDAMTVRL DRAFPRVIDY TLKNGGETVK
GQELALHQIE LNNKLYTPDV TADISENKAV YHVSEATTGI SFDVIFTVEG NVLSMNVKNI
VDDQTKLYTL NFPRHSLISM SSKDADGKLT VNNYQGQNAI SLSSANASEA YNETTLAVLS
NRNVAAALSG ESYKNRHEVA YQTFAAGDHN STGLWMNEYT YRGLDGEIMY LPAVKVAVTA
DCNGDGKVDA QDGAIVLRDK CMTRKSGADE VTDSWTMIAM NVGSEAQYPF LRILDNVKKM
YLATDGFGQN IIIKGYQGEG HDSSHPDYAN YNKHAGGLED FNTLLSEAEK YNAQIGIHVN
QTDTYPEAPQ YGKLAASLPA WDWYDSSKGI IRENDDLDTS ENGLDGRFSQ LYDKDTQGKI
DTTYVDVFFG TRWPMYKLIQ NIKGRNIILG TENPDEMVSH SVFVHGIQTG AGNFKGAGNL
VRFVENNQSD IFQGTTLFRG IQSRNNGGDT GGASKGGAGI DGWQQSSQGN NAASMNDSLD
TFYCEVLPAK FLAQYPLMQY ESESRAVLGN SNEVVTEIVN NVNVITLDGE KVAEGNKIFI
PWEKDDDEQG KIYHYNKDGG SSTWTLPESW GNVTEVTIYK LSAEGKSDKK TLPVTGRKVT
IDATAKTGYV LYKEDAVKVD TADTMEWSTG SPVKDMGFDS YNFDEWKPSS TSDSVDHIKI
KDNSLGNAHL YIEGTKDGQV SQTLTGLVPG QAYSASVWCI TDDGRKASIE VKNGDEVVSN
YMEQSNVTYG VHHNDKYLTK AQRMQVRFTA ASDTVKLTLS AAAGKSDTSV VDFDDVRVAK
VDASTNPDPN KYTYWEDFEN VDQGFGVFVS TESDQSHLSQ KNPVNPEYTT DVIDGNYSLK
IRAKDYMRTI PSTVRFKPNT EYKVGIEYKA PSANAFTFAV KSDKASKTLA SAVANAQSGK
LVLEFTTGDE EDCYVDITGQ SSEYYVDNFY VEEAYIPADF SELQKAVDEA EKLDRTVYMT
EYYAGLDKIL AEAEKVLDNP RTEQSEVDEM TQKVRDAINA LQPLATNADF AALEKAVEEA
EKIIVKDYKD TSDFESALAA AKLLLEGKET KKELKHTEVA TATDTLVEKQ KALKPVDPKP
VDPDPVDPKP VDPNPVDPDP VAPGKPSTPD NSNGAGQTTG SGKPSADQKN PAMGTKKSGK
ATKTGDTTPV VPVAGGVLVS AMMLLANFLK KKKDN
//