ID M0LYL4_9EURY Unreviewed; 1237 AA.
AC M0LYL4;
DT 03-APR-2013, integrated into UniProtKB/TrEMBL.
DT 03-APR-2013, sequence version 1.
DT 24-JAN-2024, entry version 35.
DE RecName: Full=CBM6 domain-containing protein {ECO:0000259|PROSITE:PS51175};
GN ORFNames=C447_13347 {ECO:0000313|EMBL:EMA37205.1};
OS Halococcus hamelinensis 100A6.
OC Archaea; Euryarchaeota; Stenosarchaea group; Halobacteria; Halobacteriales;
OC Halococcaceae; Halococcus.
OX NCBI_TaxID=1132509 {ECO:0000313|EMBL:EMA37205.1, ECO:0000313|Proteomes:UP000011566};
RN [1] {ECO:0000313|EMBL:EMA37205.1, ECO:0000313|Proteomes:UP000011566}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=100A6 {ECO:0000313|EMBL:EMA37205.1,
RC ECO:0000313|Proteomes:UP000011566};
RX PubMed=25393412; DOI=10.1371/journal.pgen.1004784;
RA Becker E.A., Seitzer P.M., Tritt A., Larsen D., Krusor M., Yao A.I., Wu D.,
RA Madern D., Eisen J.A., Darling A.E., Facciotti M.T.;
RT "Phylogenetically driven sequencing of extremely halophilic archaea reveals
RT strategies for static and dynamic osmo-response.";
RL PLoS Genet. 10:E1004784-E1004784(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EMA37205.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AOMB01000036; EMA37205.1; -; Genomic_DNA.
DR AlphaFoldDB; M0LYL4; -.
DR PATRIC; fig|1132509.6.peg.3095; -.
DR eggNOG; arCOG03256; Archaea.
DR eggNOG; arCOG03771; Archaea.
DR eggNOG; arCOG09022; Archaea.
DR Proteomes; UP000011566; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR CDD; cd09626; DOMON_glucodextranase_like; 1.
DR Gene3D; 2.60.40.1190; -; 1.
DR Gene3D; 2.70.98.10; -; 1.
DR Gene3D; 3.20.20.70; Aldolase class I; 2.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR InterPro; IPR013785; Aldolase_TIM.
DR InterPro; IPR005084; CMB_fam6.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR014718; GH-type_carb-bd.
DR InterPro; IPR029483; GH97_C.
DR InterPro; IPR019563; GH97_catalytic.
DR InterPro; IPR029486; GH97_N.
DR InterPro; IPR019248; Glucodextran_C.
DR PANTHER; PTHR35803; GLUCAN 1,4-ALPHA-GLUCOSIDASE SUSB-RELATED; 1.
DR Pfam; PF14509; GH97_C; 1.
DR Pfam; PF14508; GH97_N; 1.
DR Pfam; PF09985; Glucodextran_C; 1.
DR Pfam; PF10566; Glyco_hydro_97; 1.
DR SUPFAM; SSF49344; CBD9-like; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS51175; CBM6; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000011566}.
FT DOMAIN 588..732
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT REGION 153..178
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 812..860
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 823..855
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1237 AA; 134371 MW; DEDAEDD9EAAA641D CRC64;
MVTVDVSDGV PVYAVEYGGT TYLDESALGF DFRNQPTFGR SADGATGSAI EVTGSERGRA
TEHWDPVWGE FERVSAEYTS LVLGLAETDG EGRSANLEFR VFDEGVGFRF VLSDDFASNS
ERAVITTENT EFNFAGDYTS WWIRNEVTNP RFEQEYSETP LSEIPSGTRE TRPTSTLLRN
GAHTPLTVQA ADDAYLSIHE SNLEDYAAAT IAPRSENGGT EFSIELTPLP DRTKVSFELP
NKTPWRTIGV GSTPGALVES QLIPLLSDPL DESAFPSADS GIDTDWVTPR KYVGIWWIMI
AGGNNWEYRP DDSFDSQEAA ASYIHGARTE RMKRYMTFAS ENGMDSVLAE GWNKGWDTYP
GNGLGFEFGV DDSYPDFDVR EVTSFGQSLD EEVEMTMHNE TAGNLPNYEE EIQDDDIFEE
YEEVGIRSIK NGYVSDPGLG IDGNGAEPTH NHHNQLAVNH HRLVMQNAAA NRQMLEIHEG
IKPTGEIRTY PNVGAREVVK AQEYDGFTYL DSDVDRDHHV TLPFTRMLAG PTSYQPGIFD
ITFNDDRGGQ IQTTRAKQLA MYPNYLGGLQ MAADRIEAYV NNELEVGQLV QAPSGELDGL
TTLNDWRNAF GAHFVPLDPS KSPSGSSVEF VVKNVEAAGQ YDLHLRYAAD AEQNASQVLD
AGKTQATLRV NGSATTISPS FTDYWDEWDV FTTTVDLEAG ENTVALELNY EDTDDGFTGD
VGGFNLNTVA VSEAGSGSPV PASYEGYTPE NENFETKPEF AFIEDVPAAG WDETRVIDSA
VADYIVTARQ KDEEWYLGAM TDENGRAIDV PLDFLAPGQS GDDHHPGRGK GRERGREKGE
HGEHGRGKDK GRPPNGPKYV AEIYSDDVGA GVDTDPTAVR IDEAIVTSRT TLLASMAQSG
GTAVRIRRAR GKEIGDLPTY EYPNQDIEVS IESDAQFGEP FISATGSNDA AFVGGTPVKI
EVDGDLEASD IVRLPPNATD ETIALGYSLS RAGEYDVVVR GSDGTVLASA TVAVAPGEEI
ATFSDPSGDD TGPGEYVYPT DGAFEDGAFD LLSFGAYETD TEYLFSFEVA NLYDTFGGSF
SPHYFIVYLR DPTVSGGRTT TLDDLNVTAE FESPWQYRVD ASGFGAGVTD ASGTTLGTPE
TFVDFATNTV TVRVAKNQLS GLNTSNLEVV PVVGSEATGS FRAIEVEAGQ YVFGGAKEGA
VENAPRIIDL LTPSDASQTD VLAYDETSRA VLPFTPL
//