ID A0A1Q5AAH1_9ACTN Unreviewed; 649 AA.
AC A0A1Q5AAH1;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 24-JAN-2024, entry version 22.
DE SubName: Full=Squalene--hopene cyclase {ECO:0000313|EMBL:OKI56031.1};
GN ORFNames=A6A27_30840 {ECO:0000313|EMBL:OKI56031.1};
OS Micromonospora sp. CB01531.
OC Bacteria; Actinomycetota; Actinomycetes; Micromonosporales;
OC Micromonosporaceae; Micromonospora.
OX NCBI_TaxID=1718947 {ECO:0000313|EMBL:OKI56031.1, ECO:0000313|Proteomes:UP000186700};
RN [1] {ECO:0000313|EMBL:OKI56031.1, ECO:0000313|Proteomes:UP000186700}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CB01531 {ECO:0000313|EMBL:OKI56031.1,
RC ECO:0000313|Proteomes:UP000186700};
RX PubMed=27999165; DOI=10.1128/mbio.02104-16;
RA Yan X., Ge H., Huang T., Hindra, Yang D., Teng Q., Crnovcic I., Li X.,
RA Rudolf J.D., Lohman J.R., Gansemans Y., Zhu X., Huang Y., Zhao L.X.,
RA Jiang Y., Van Nieuwerburgh F., Rader C., Duan Y., Shen B.;
RT "Strain Prioritization and Genome Mining for Enediyne Natural Products.";
RL MBio 7:e02104-e02116(2016).
CC -!- PATHWAY: Secondary metabolite biosynthesis; hopanoid biosynthesis.
CC {ECO:0000256|ARBA:ARBA00004999}.
CC -!- SIMILARITY: Belongs to the terpene synthase family.
CC {ECO:0000256|ARBA:ARBA00006333}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OKI56031.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LWLS01000086; OKI56031.1; -; Genomic_DNA.
DR RefSeq; WP_073837269.1; NZ_LWLS01000086.1.
DR AlphaFoldDB; A0A1Q5AAH1; -.
DR STRING; 1718947.A6A27_30840; -.
DR OrthoDB; 9758578at2; -.
DR UniPathway; UPA00337; -.
DR Proteomes; UP000186700; Unassembled WGS sequence.
DR GO; GO:0005811; C:lipid droplet; IEA:InterPro.
DR GO; GO:0016866; F:intramolecular transferase activity; IEA:InterPro.
DR GO; GO:0016104; P:triterpenoid biosynthetic process; IEA:InterPro.
DR CDD; cd02892; SQCY_1; 1.
DR Gene3D; 1.50.10.20; -; 2.
DR InterPro; IPR006400; Hopene-cyclase.
DR InterPro; IPR032696; SQ_cyclase_C.
DR InterPro; IPR032697; SQ_cyclase_N.
DR InterPro; IPR018333; Squalene_cyclase.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR NCBIfam; TIGR01507; hopene_cyclase; 1.
DR NCBIfam; TIGR01787; squalene_cyclas; 1.
DR PANTHER; PTHR11764:SF20; LANOSTEROL SYNTHASE; 1.
DR PANTHER; PTHR11764; TERPENE CYCLASE/MUTASE FAMILY MEMBER; 1.
DR Pfam; PF13243; SQHop_cyclase_C; 1.
DR Pfam; PF13249; SQHop_cyclase_N; 1.
DR SFLD; SFLDG01016; Prenyltransferase_Like_2; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 2.
PE 3: Inferred from homology;
KW Reference proteome {ECO:0000313|Proteomes:UP000186700};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 32..312
FT /note="Squalene cyclase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13249"
FT DOMAIN 326..636
FT /note="Squalene cyclase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF13243"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..23
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 649 AA; 70904 MW; 765EBED820DCC7A8 CRC64;
MTELLSSKPK PTTSPTTTAG GTASDPAWAA LRRARDHLLG LQDDAGWWKG NLATNVTMDA
EDLLLRQFLG IRTAEQTAES ARWIRSQQRP DGSWATFHGG PGDLSTTIEA YLALRLAGDT
PDAPHLAAAA RFVRARGGLA ASRVFTRFWL ALFGHWPWSQ LPAVPPELVL LPHWMPFNVY
DFACWARQTI VPLSIVRALR PVRELGFGVD ELCVPTAASR PPRLRSRAGV LHRLDRVASG
YERIARGPVR RHALRRAAEW IVARQEADGS WGGIQPPWVY SLMALHLLGY PLDHPVLRAG
LDGLERFTVR EQTEDGPVRW LEACQSPVWD TALAVTALSD AGLPAGHAAL ERAGNWLLKE
EIRVRGDWAI RRPKTPVGGW AFEFENDGYA DTDDTAEVIM ALRRTGVPAE AAVLRGTRWL
LGMQCRDGGW GAFDADNTRA IVGDLPFCDF GEVTDPPSAD VTAHIVEALA AENFAGTAPV
RRGVHWLLRA QEPDGSWFGR WGANHVYGTG AVVPALVAAG VKPGHPAIRA AVDWLHAHQN
PDGGWGEDMR SYRDPSWIGR GESTASQTAW ALLALHAAGH GAGEPAQRGV RWLVDTQRPD
GGWDEPHYTG TGFPGDFYIN YGMYRLVFPI SALGRILNAR GGLTAEVVP
//