ID G9ZUF2_9PROT Unreviewed; 653 AA.
AC G9ZUF2;
DT 22-FEB-2012, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2012, sequence version 1.
DT 24-JAN-2024, entry version 42.
DE SubName: Full=Squalene-hopene cyclase {ECO:0000313|EMBL:EHM03396.1};
GN ORFNames=HMPREF9946_00166 {ECO:0000313|EMBL:EHM03396.1};
OS Acetobacteraceae bacterium AT-5844.
OC Bacteria; Pseudomonadota; Alphaproteobacteria; Rhodospirillales;
OC Acetobacteraceae.
OX NCBI_TaxID=1054213 {ECO:0000313|EMBL:EHM03396.1, ECO:0000313|Proteomes:UP000003292};
RN [1] {ECO:0000313|EMBL:EHM03396.1, ECO:0000313|Proteomes:UP000003292}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AT-5844 {ECO:0000313|Proteomes:UP000003292};
RA Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA Hall O., Minx P., Tomlinson C., Mitreva M., Hou S., Chen J., Wollam A.,
RA Pepin K.H., Johnson M., Bhonagiri V., Zhang X., Suruliraj S., Warren W.,
RA Chinwalla A., Mardis E.R., Wilson R.K.;
RL Submitted (SEP-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- PATHWAY: Secondary metabolite biosynthesis; hopanoid biosynthesis.
CC {ECO:0000256|ARBA:ARBA00004999}.
CC -!- SIMILARITY: Belongs to the terpene cyclase/mutase family.
CC {ECO:0000256|ARBA:ARBA00009755}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EHM03396.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGEZ01000011; EHM03396.1; -; Genomic_DNA.
DR AlphaFoldDB; G9ZUF2; -.
DR STRING; 1054213.HMPREF9946_00166; -.
DR PATRIC; fig|1054213.3.peg.156; -.
DR eggNOG; COG1657; Bacteria.
DR HOGENOM; CLU_019345_0_0_5; -.
DR OrthoDB; 9758578at2; -.
DR BioCyc; ABAC1054213:G1H32-155-MONOMER; -.
DR UniPathway; UPA00337; -.
DR Proteomes; UP000003292; Unassembled WGS sequence.
DR GO; GO:0005811; C:lipid droplet; IEA:InterPro.
DR GO; GO:0016866; F:intramolecular transferase activity; IEA:InterPro.
DR GO; GO:0016104; P:triterpenoid biosynthetic process; IEA:InterPro.
DR CDD; cd02892; SQCY_1; 1.
DR Gene3D; 1.50.10.20; -; 2.
DR InterPro; IPR006400; Hopene-cyclase.
DR InterPro; IPR032696; SQ_cyclase_C.
DR InterPro; IPR032697; SQ_cyclase_N.
DR InterPro; IPR018333; Squalene_cyclase.
DR InterPro; IPR002365; Terpene_synthase_CS.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR NCBIfam; TIGR01507; hopene_cyclase; 1.
DR NCBIfam; TIGR01787; squalene_cyclas; 1.
DR PANTHER; PTHR11764:SF20; LANOSTEROL SYNTHASE; 1.
DR PANTHER; PTHR11764; TERPENE CYCLASE/MUTASE FAMILY MEMBER; 1.
DR Pfam; PF13243; SQHop_cyclase_C; 1.
DR Pfam; PF13249; SQHop_cyclase_N; 1.
DR SFLD; SFLDG01016; Prenyltransferase_Like_2; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 2.
DR PROSITE; PS01074; TERPENE_SYNTHASES; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000003292};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 24..309
FT /note="Squalene cyclase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13249"
FT DOMAIN 320..640
FT /note="Squalene cyclase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF13243"
SQ SEQUENCE 653 AA; 72503 MW; 1EC7363A6FC01ADC CRC64;
MNDAFSFQAS PATPVSPDGV EDAIHHATAA LGRYQREDGH WVFELEADAT IPAEYVLLRH
YLGEPDDLTL ERKIGNYLRR IQGAHGGWPL FHGGAFDISA SVKAYFCLKM IGDDPDAPHM
VRAREAILAQ GGAARCNVFT RILLAQFGEL PWSAVPAMPV EMILLPRWFP VHLSKMSYWA
RTVIVPLLVL QAVKRRARNP RGIGVQELFR PGVRVGTPLT HQRRGWAAFF NGLDVVLQKV
EPLWPRGQRQ QAIARCEAFV TERLNGEDGL GAIYPAIANS VMMYDALGHG PEHPGRAIAR
QAIEKLLVVG EDEAYCQPCV SPVWDTALAS HAMLEVGGEA AGAALRGLEW LRPRQELEVK
GDWAETRPDV RPGGWAFQYR NAHYPDLDDT AVVVMAMDRA RYQFGAGGGY DIAIDRGAEW
VVGLQSTNGG WGAFDVDNNH DFLNNIPFAD HGALLDPPTA DVSARCVSML AQLGRTDTPE
MRRALDYLER EQEKDGSWFG RWGVNYVYGT WSALCALNAA GFDATRPSMR RGADWLLSIQ
NEDGGWGEDC DSYKLDYRGY EPAPSTASQT AWALLGLMAA GEVDNPAVAR GIEWLRRMQG
EDGLWPQEAY TGGGFPRIFY LRYHGYPKFF PLWALARYRN LRAGNARHVS HGM
//