ID A0A271IWT0_9BACT Unreviewed; 988 AA.
AC A0A271IWT0;
DT 20-DEC-2017, integrated into UniProtKB/TrEMBL.
DT 20-DEC-2017, sequence version 1.
DT 27-MAR-2024, entry version 19.
DE RecName: Full=CBM-cenC domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=BSZ37_01855 {ECO:0000313|EMBL:PAP75274.1};
OS Rubrivirga marina.
OC Bacteria; Rhodothermota; Rhodothermia; Rhodothermales; Rubricoccaceae;
OC Rubrivirga.
OX NCBI_TaxID=1196024 {ECO:0000313|EMBL:PAP75274.1, ECO:0000313|Proteomes:UP000216339};
RN [1] {ECO:0000313|EMBL:PAP75274.1, ECO:0000313|Proteomes:UP000216339}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SAORIC-28 {ECO:0000313|EMBL:PAP75274.1,
RC ECO:0000313|Proteomes:UP000216339};
RA Yoshizawa S., Kumagai Y., Kogure K.;
RT "Study of marine rhodopsin-containing bacteria.";
RL Submitted (NOV-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PAP75274.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MQWD01000001; PAP75274.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A271IWT0; -.
DR Proteomes; UP000216339; Unassembled WGS sequence.
DR GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:InterPro.
DR Gene3D; 2.60.40.2030; -; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR InterPro; IPR038081; CalX-like_sf.
DR InterPro; IPR003305; CenC_carb-bd.
DR InterPro; IPR043744; DUF5689.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR026444; Secre_tail.
DR NCBIfam; TIGR04183; Por_Secre_tail; 1.
DR Pfam; PF02018; CBM_4_9; 1.
DR Pfam; PF18942; DUF5689; 1.
DR SUPFAM; SSF141072; CalX-like; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000216339};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..988
FT /note="CBM-cenC domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012063288"
FT DOMAIN 321..512
FT /note="DUF5689"
FT /evidence="ECO:0000259|Pfam:PF18942"
FT DOMAIN 730..865
FT /note="CBM-cenC"
FT /evidence="ECO:0000259|Pfam:PF02018"
SQ SEQUENCE 988 AA; 101295 MW; CC48781F9030AED0 CRC64;
MRLRFSLFFT LAALIAAPSA LAQSAFLNEF HYDNDGGDTG EFVEIAVADG ILDVEDVVIT
LYNGSGGASY NTVSGADLTV GASQNGYTLY TYSFPSNGIQ NGSPDGIALS TTEGDVLQFL
SYEGSFTATN GPANGETSDD IGVSEAGDTP VGFSLQLTGT SATYDGFTWA SPMAATLGAV
NTGQTFESPA PPDPDPVEVS FADETRMVRE GDTLTVALEL DYNENEPSGP VTVLVSFVGG
ASSATTADFA SASVASATFP GQRARDNMAE VTFVFADDDL VEGPETATFR LAVTSGDAVT
GSPNILTVTV DDAPQTATVA DARAAGVGES VTIEGVVSRA AGAFLYVQDE TGGLAIRQTS
GPLFDAVASG AVAPGTQIRL TGTLSEFRGL LQINGSDLES YDVLGTTDAP EPQVVTLAEL
AENGEAYEGE LVTVRNVSFA ETGTFSAATT YTVSDDSDDS GVVTARVPNG SDSTVDGTEI
PEVADVTAII GQFNADDPDG GYQLLLINAE DVGNASGGGG GDEIVPIAEA RAEGVGATVR
IEGVVTRAAG AFLYLQDETG ALTIRQTSGD LFDAIASGDV GPGTVLDVTG TLSEFRGLLQ
INGGDLESYE VTGTAAVPQP QVVTLAELAA NGEDYEGELV YVAGVTVDGS GAFTAATTYL
IDDNSEASGL VTLRVPNADD TTVDDTPIPD GPVGIIGVIG QFNADDPEGG FQLLVLDAAD
IDAPAPSLDV VVNGGFEMPA PGVVTGGDVP GFALNVGGAV TMAPEFAIVD DVAYEGDQSL
AVTVNGTGAN PWEIEVAAEP LTVEAGRTYL YSVWARSETD GGTASFTIGQ PAPSYNELGR
ISDAALTTEW QEFTVEFTVP DGVTEIRAPI HFSYAANVGN TIYIDNLSIT PAGEVAAENP
VEETAATLSV TNPIRTGATV RYSLETAGDV SVALFDMLGR QVAVVADGPA GPQERTARLD
AAGLASGVYV LRLQGEDVMV SRTVTVVR
//