ID A0A0Q6Z4X6_9BRAD Unreviewed; 654 AA.
AC A0A0Q6Z4X6;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 24-JAN-2024, entry version 27.
DE SubName: Full=Squalene--hopene cyclase {ECO:0000313|EMBL:KQW18477.1};
GN ORFNames=ASC80_20935 {ECO:0000313|EMBL:KQW18477.1};
OS Afipia sp. Root123D2.
OC Bacteria; Pseudomonadota; Alphaproteobacteria; Hyphomicrobiales;
OC Nitrobacteraceae; Afipia.
OX NCBI_TaxID=1736436 {ECO:0000313|EMBL:KQW18477.1, ECO:0000313|Proteomes:UP000051348};
RN [1] {ECO:0000313|EMBL:KQW18477.1, ECO:0000313|Proteomes:UP000051348}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Root123D2 {ECO:0000313|EMBL:KQW18477.1,
RC ECO:0000313|Proteomes:UP000051348};
RA Gilbert D.G.;
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KQW18477.1, ECO:0000313|Proteomes:UP000051348}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Root123D2 {ECO:0000313|EMBL:KQW18477.1,
RC ECO:0000313|Proteomes:UP000051348};
RA Schulze-Lefert P.;
RT "Functional overlap of the Arabidopsis leaf and root microbiotas.";
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- PATHWAY: Secondary metabolite biosynthesis; hopanoid biosynthesis.
CC {ECO:0000256|ARBA:ARBA00004999}.
CC -!- SIMILARITY: Belongs to the terpene cyclase/mutase family.
CC {ECO:0000256|ARBA:ARBA00009755}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KQW18477.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LMDP01000004; KQW18477.1; -; Genomic_DNA.
DR RefSeq; WP_056301563.1; NZ_LMDP01000004.1.
DR AlphaFoldDB; A0A0Q6Z4X6; -.
DR STRING; 1736436.ASC80_20935; -.
DR OrthoDB; 9758578at2; -.
DR UniPathway; UPA00337; -.
DR Proteomes; UP000051348; Unassembled WGS sequence.
DR GO; GO:0005811; C:lipid droplet; IEA:InterPro.
DR GO; GO:0016866; F:intramolecular transferase activity; IEA:InterPro.
DR GO; GO:0016104; P:triterpenoid biosynthetic process; IEA:InterPro.
DR CDD; cd02892; SQCY_1; 1.
DR Gene3D; 1.50.10.20; -; 2.
DR InterPro; IPR006400; Hopene-cyclase.
DR InterPro; IPR032696; SQ_cyclase_C.
DR InterPro; IPR032697; SQ_cyclase_N.
DR InterPro; IPR018333; Squalene_cyclase.
DR InterPro; IPR002365; Terpene_synthase_CS.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR NCBIfam; TIGR01507; hopene_cyclase; 1.
DR NCBIfam; TIGR01787; squalene_cyclas; 1.
DR PANTHER; PTHR11764:SF20; LANOSTEROL SYNTHASE; 1.
DR PANTHER; PTHR11764; TERPENE CYCLASE/MUTASE FAMILY MEMBER; 1.
DR Pfam; PF13243; SQHop_cyclase_C; 1.
DR Pfam; PF13249; SQHop_cyclase_N; 1.
DR SFLD; SFLDG01016; Prenyltransferase_Like_2; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 2.
DR PROSITE; PS01074; TERPENE_SYNTHASES; 1.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000051348};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 20..310
FT /note="Squalene cyclase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13249"
FT DOMAIN 319..641
FT /note="Squalene cyclase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF13243"
SQ SEQUENCE 654 AA; 72876 MW; BFC9381898188D90 CRC64;
MIIDQSTASA PAAETLEKSI SSASRALRDF RKGDGHWVFE LEADATIPAE YVLLRHYLAE
PVDAVLEAKI AMYLRRIQND NGGWSLFYGH EFDMSASVKA YFALKMIGDS VDAPHMVKAR
ETILARGGAA KSNVFTRIMM ALFGVLTWRA VPMMPVEIML LPRWFPFHLT KISYWARVVI
VPLLVLMTFK PRARNPLGVG IDELFHEDPK TVGPTPKAPH QSQLWFTGFN ILDRVLRVVD
PLFPKKTRER SVQKAVAFVT ERLNGVDGLG AIFPAMANSV MMFDLLGYPK DHPHYMLARA
SVEKLLVIKD DEAYCQPCVS PVWDTALTAH AMIESGSETE VASAKDALEW LAPLQVLDVE
GDWIEKRPGV RPGGWAFQYN NAHYPDLDDT AVVVMAMDRA RGLGVGTKYD EAIARGREWI
LGLQSVDGGW AAFDADNLEY YLNNIPFSDH GALLDPPTDD VTARCVSMLA QLGDTVDNSP
ALARGIDYLR RTQLADGSWF GRWGVNYIYG TWSVLCALNA AGVPHDDPAV RKAVDWLAAI
QNADGGWGED GNSYKLNYQG YQRSETTASQ TAWATLALMA AGQVDHPATQ RGIRYLVDSQ
GDNGLWHERH YTGGGFPRVF YLRYHGYSKF FPLWALARYR NLRKTNSKFV GVGM
//