ID A0A370GCQ0_GLULI Unreviewed; 663 AA.
AC A0A370GCQ0;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE SubName: Full=Squalene--hopene cyclase {ECO:0000313|EMBL:MBB2184799.1};
DE EC=5.4.99.17 {ECO:0000313|EMBL:MBB2184799.1};
DE SubName: Full=Squalene-hopene/tetraprenyl-beta-curcumene cyclase {ECO:0000313|EMBL:RDI40224.1};
GN Name=shc {ECO:0000313|EMBL:MBB2184799.1};
GN ORFNames=C7453_10111 {ECO:0000313|EMBL:RDI40224.1}, HLH32_00055
GN {ECO:0000313|EMBL:MBB2184799.1};
OS Gluconacetobacter liquefaciens (Acetobacter liquefaciens).
OC Bacteria; Pseudomonadota; Alphaproteobacteria; Rhodospirillales;
OC Acetobacteraceae; Gluconacetobacter.
OX NCBI_TaxID=89584 {ECO:0000313|EMBL:RDI40224.1, ECO:0000313|Proteomes:UP000254958};
RN [1] {ECO:0000313|EMBL:RDI40224.1, ECO:0000313|Proteomes:UP000254958}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 5603 {ECO:0000313|EMBL:RDI40224.1,
RC ECO:0000313|Proteomes:UP000254958};
RA Goeker M.;
RT "Genomic Encyclopedia of Type Strains, Phase IV (KMG-IV): sequencing the
RT most valuable type-strain genomes for metagenomic binning, comparative
RT biology and taxonomic classification.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:MBB2184799.1, ECO:0000313|Proteomes:UP000562982}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=LMG 1382 {ECO:0000313|EMBL:MBB2184799.1,
RC ECO:0000313|Proteomes:UP000562982};
RA Sombolestani A.;
RT "Description of novel Gluconacetobacter.";
RL Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- PATHWAY: Secondary metabolite biosynthesis; hopanoid biosynthesis.
CC {ECO:0000256|ARBA:ARBA00004999}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RDI40224.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JABEQI010000001; MBB2184799.1; -; Genomic_DNA.
DR EMBL; QQAW01000001; RDI40224.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A370GCQ0; -.
DR OrthoDB; 9758578at2; -.
DR UniPathway; UPA00337; -.
DR Proteomes; UP000254958; Unassembled WGS sequence.
DR Proteomes; UP000562982; Unassembled WGS sequence.
DR GO; GO:0005811; C:lipid droplet; IEA:InterPro.
DR GO; GO:0051007; F:squalene-hopene cyclase activity; IEA:UniProtKB-EC.
DR GO; GO:0016104; P:triterpenoid biosynthetic process; IEA:InterPro.
DR CDD; cd02892; SQCY_1; 1.
DR Gene3D; 1.50.10.20; -; 2.
DR InterPro; IPR006400; Hopene-cyclase.
DR InterPro; IPR032696; SQ_cyclase_C.
DR InterPro; IPR032697; SQ_cyclase_N.
DR InterPro; IPR018333; Squalene_cyclase.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR NCBIfam; TIGR01507; hopene_cyclase; 1.
DR NCBIfam; TIGR01787; squalene_cyclas; 1.
DR PANTHER; PTHR11764:SF20; LANOSTEROL SYNTHASE; 1.
DR PANTHER; PTHR11764; TERPENE CYCLASE/MUTASE FAMILY MEMBER; 1.
DR Pfam; PF13243; SQHop_cyclase_C; 1.
DR Pfam; PF13249; SQHop_cyclase_N; 1.
DR SFLD; SFLDG01016; Prenyltransferase_Like_2; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 2.
PE 4: Predicted;
KW Isomerase {ECO:0000313|EMBL:MBB2184799.1};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 31..317
FT /note="Squalene cyclase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13249"
FT DOMAIN 327..650
FT /note="Squalene cyclase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF13243"
SQ SEQUENCE 663 AA; 73716 MW; 4F021EBE9B40509F CRC64;
MMVNATDTLE RPSAGAADPV VPMIEIDQAV DSAHAALGRR QNDDGHWVFE LEADATIPAE
YVLLEHYLDR IEPDLEARIG VYLRRIQGDH GGWPLYHGGR FDLSATVKAY FALKAIGDDI
DAPHMARARA AILDHGGAER SNVFTRFQLA LFGEVPWHAT PAMPVELMLL PRKALFSVWN
MSYWSRTVIA PLLVLAALRP RAVNPRDVHV PELFVTPPAQ VRDWIRGPYR SALGRVFKYV
DKVVRPGERL IPEATRRRAI KAAVDFIEPR LNGVDGLGAI YPAMANSVMM YRALGVPDSD
PRAATAWESV RRLLVENGDE AYCQPCVSPI WDTGLAGHAM IEAASGPDGI RPHETKKKLA
AAAEWLRSRQ ILDVKGDWAI NCPDVAPGGW AFQYNNDYYP DVDDTAVVGM LLHREGDPAH
HDAIERARQW ILGMQSTNGG WGAFDIDNNM DFLNHIPFAD HGALLDPPTA DVTARCVSFL
AQLGHPEDRP VIERAVAYLR SDQEAEGCWF GRWGTNYIYG TWSVLCALNV AGVSHDDPAI
VRAVDWLRSV QREDGGWGED CATFEGATPG IYTESLPSQT AWATLGLMAA GLRDDPAVAR
GMAYLARTQK EDGEWDEEPY NAVGFPKVFY LRYHGYRQFF PLMALSRYRN LESSNTRRVA
FGF
//