ID A0A1Z1FGZ7_9SPHN Unreviewed; 542 AA.
AC A0A1Z1FGZ7;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE SubName: Full=DUF5597 domain-containing protein {ECO:0000313|EMBL:QNE07506.1};
DE SubName: Full=Glycoside hydrolase {ECO:0000313|EMBL:ARU18000.1};
GN ORFNames=A9D14_16970 {ECO:0000313|EMBL:ARU18000.1}, H4O24_16695
GN {ECO:0000313|EMBL:QNE07506.1};
OS Croceicoccus marinus.
OG Plasmid pCME4A9I {ECO:0000313|EMBL:ARU18000.1},
OG Plasmid pcme4a9i {ECO:0000313|Proteomes:UP000195807}, and
OG Plasmid plas1 {ECO:0000313|EMBL:QNE07506.1,
OG ECO:0000313|Proteomes:UP000515297}.
OC Bacteria; Pseudomonadota; Alphaproteobacteria; Sphingomonadales;
OC Erythrobacteraceae; Croceicoccus.
OX NCBI_TaxID=450378 {ECO:0000313|EMBL:ARU18000.1, ECO:0000313|Proteomes:UP000195807};
RN [1] {ECO:0000313|EMBL:ARU18000.1, ECO:0000313|Proteomes:UP000195807}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=E4A9 {ECO:0000313|EMBL:ARU18000.1,
RC ECO:0000313|Proteomes:UP000195807};
RC PLASMID=pCME4A9I {ECO:0000313|EMBL:ARU18000.1}, and Plasmid pcme4a9i
RC {ECO:0000313|Proteomes:UP000195807};
RA Wu Y.-H., Cheng H., Xu L., Huo Y.-Y., Wang C.-S., Xu X.-W.;
RT "Complete genome sequence of esterase-producing bacterium Croceicoccus
RT marinus E4A9.";
RL Submitted (JAN-2017) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:QNE07506.1, ECO:0000313|Proteomes:UP000515297}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=OT19 {ECO:0000313|EMBL:QNE07506.1,
RC ECO:0000313|Proteomes:UP000515297};
RC PLASMID=plas1 {ECO:0000313|EMBL:QNE07506.1,
RC ECO:0000313|Proteomes:UP000515297};
RA Liu G., Sun C.;
RL Submitted (AUG-2020) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP019603; ARU18000.1; -; Genomic_DNA.
DR EMBL; CP060053; QNE07506.1; -; Genomic_DNA.
DR RefSeq; WP_066850502.1; NZ_CP060053.1.
DR AlphaFoldDB; A0A1Z1FGZ7; -.
DR STRING; 450378.GCA_001661675_03412; -.
DR KEGG; cman:A9D14_16970; -.
DR OrthoDB; 9800974at2; -.
DR Proteomes; UP000195807; Plasmid pcme4a9i.
DR Proteomes; UP000515297; Plasmid plas1.
DR GO; GO:0009341; C:beta-galactosidase complex; IEA:InterPro.
DR GO; GO:0004565; F:beta-galactosidase activity; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.220.20; putative beta-Galactosidase from caulobacter crescentus; 1.
DR InterPro; IPR040719; DUF5597.
DR InterPro; IPR013529; Glyco_hydro_42_N.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR Pfam; PF18120; DUF5597; 1.
DR Pfam; PF02449; Glyco_hydro_42; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000313|EMBL:ARU18000.1};
KW Plasmid {ECO:0000313|EMBL:ARU18000.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000195807};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..542
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5036030729"
FT DOMAIN 68..211
FT /note="Glycoside hydrolase family 42 N-terminal"
FT /evidence="ECO:0000259|Pfam:PF02449"
FT DOMAIN 386..527
FT /note="DUF5597"
FT /evidence="ECO:0000259|Pfam:PF18120"
SQ SEQUENCE 542 AA; 59562 MW; 6D1859E48AE54C73 CRC64;
MKAIFLCGAL AGSIALTPLA AFAQQPLPRI ESANGKHLLM VDGEPFLMLG AQANNSSNYP
AVLPQVWPMM ERLHANTLEI PIAWEQFEPE EGRFDYSYLE ALVEGARERN KRLVLLWFAT
WKNTGPSYAP LWVKTDIERF PRMRTAEGNA HYALSPHARS TLEADKRAFV ELMRWLRDND
PQRTVIMVQP ENEVGVYGQK RDHSAEAEKL FAGPIPAGLA RHTGKSGSWR EAFGPLADSA
FNSWYTARYI DEIAAAGQEV LDLPMYANAA LSDPFSPPGE GGGASGGPDM PVIDIWKAAA
PHIDFVAPDI YMRDQQQVAE VMRLYARPDN ALMIPEIGNA ADYARFWWTA LGHGAIGFAP
FGMDDTGYSN YPLGAKELDE DTVEAFAAKY RLFAPMAGAW ARVAARSPTW GAAKPSDGSS
QSQRMGRWTA HVEYGEWQFG DRDSPWLKSD PHPTEGQPVG GAVFVQTGPD EFLVAGSNAR
VHLGLAEAAP GQSSAMLRVE EGTLAEDGSF VMRRVWNGDQ TDHGLNFTSQ PVLLKVTMGS
YQ
//