ID A0A1R3IZ72_9ROSI Unreviewed; 650 AA.
AC A0A1R3IZ72;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 27-MAR-2024, entry version 20.
DE RecName: Full=Terpene cyclase/mutase family member {ECO:0000256|RuleBase:RU362003};
DE EC=5.4.99.- {ECO:0000256|RuleBase:RU362003};
GN ORFNames=COLO4_20535 {ECO:0000313|EMBL:OMO87875.1};
OS Corchorus olitorius.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Grewioideae; Apeibeae; Corchorus.
OX NCBI_TaxID=93759 {ECO:0000313|EMBL:OMO87875.1, ECO:0000313|Proteomes:UP000187203};
RN [1] {ECO:0000313|Proteomes:UP000187203}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. O-4 {ECO:0000313|Proteomes:UP000187203};
RA Alam M., Haque M.S., Islam M.S., Emdad E.M., Islam M.M., Ahmed B.,
RA Halim A., Hossen Q.M.M., Hossain M.Z., Ahmed R., Khan M.M., Islam R.,
RA Rashid M.M., Khan S.A., Rahman M.S., Alam M., Yahiya A.S., Khan M.S.,
RA Azam M.S., Haque T., Lashkar M.Z.H., Akhand A.I., Morshed G., Roy S.,
RA Uddin K.S., Rabeya T., Hossain A.S., Chowdhury A., Snigdha A.R.,
RA Mortoza M.S., Matin S.A., Hoque S.M.E., Islam M.K., Roy D.K., Haider R.,
RA Moosa M.M., Elias S.M., Hasan A.M., Jahan S., Shafiuddin M., Mahmood N.,
RA Shommy N.S.;
RT "Corchorus olitorius genome sequencing.";
RL Submitted (SEP-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the terpene cyclase/mutase family.
CC {ECO:0000256|ARBA:ARBA00009755, ECO:0000256|RuleBase:RU362003}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OMO87875.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AWUE01017234; OMO87875.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1R3IZ72; -.
DR STRING; 93759.A0A1R3IZ72; -.
DR OrthoDB; 608at2759; -.
DR Proteomes; UP000187203; Unassembled WGS sequence.
DR GO; GO:0005811; C:lipid droplet; IEA:InterPro.
DR GO; GO:0042300; F:beta-amyrin synthase activity; IEA:UniProt.
DR GO; GO:0016104; P:triterpenoid biosynthetic process; IEA:InterPro.
DR CDD; cd02892; SQCY_1; 1.
DR Gene3D; 1.50.10.20; -; 2.
DR InterPro; IPR032696; SQ_cyclase_C.
DR InterPro; IPR032697; SQ_cyclase_N.
DR InterPro; IPR018333; Squalene_cyclase.
DR InterPro; IPR002365; Terpene_synthase_CS.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR NCBIfam; TIGR01787; squalene_cyclas; 1.
DR PANTHER; PTHR11764:SF58; BETA-AMYRIN SYNTHASE-RELATED; 1.
DR PANTHER; PTHR11764; TERPENE CYCLASE/MUTASE FAMILY MEMBER; 1.
DR Pfam; PF13243; SQHop_cyclase_C; 1.
DR Pfam; PF13249; SQHop_cyclase_N; 1.
DR SFLD; SFLDG01016; Prenyltransferase_Like_2; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 2.
DR PROSITE; PS01074; TERPENE_SYNTHASES; 1.
PE 3: Inferred from homology;
KW Isomerase {ECO:0000256|RuleBase:RU362003};
KW Reference proteome {ECO:0000313|Proteomes:UP000187203};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 36..295
FT /note="Squalene cyclase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13249"
FT DOMAIN 309..644
FT /note="Squalene cyclase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF13243"
SQ SEQUENCE 650 AA; 74155 MW; 5FC94C992B595F4E CRC64;
MASDGHWPAE NAGPLFLLPP LVFAVYITGH LNTVFPEEHR KETLRYIYYH QNEDGGWGLH
IEGHSTMFCT AFSYICMRIL GVGPDGGQDN ACARARKWIL DHGTITHIPS WGKTWLSILG
VFDWSGCNPM PPEFWILPSF LPMHPAKMWC YSRMVYMPMS YLYGKRFVGP ITPLIEQLRE
ELYLQPYDEI NWKKVRHCCA QEDIYYPHPP IQDLIWDSLY ICTEPLLTRW PFNKLVREKA
LQVTMKHIHY EDENSRYITN ACVEKALCML ACWAEEPNSD YFKKHLARIP DYLWVAEDGM
KMQTFGSQQW DTGFAIQALL ASNFTDEIGP VLKRGHDYIK ISQVKDNPSG DFKSMYRHIS
KGSWTFSDQD HGWQVSDCTA EGLKCCLLLS MLPPEIVGEK MKPQQLYDAV NVVLSLQSKN
GGLAAWEPAG AQEWLELLNP SDVFADIVIE HEYVECTSSA IDALVLFKKL YPAHRTNNIE
DFITNAVRYL EDVQMHDGSW YGSWGVCFIY GSYFALGGLA AAGKTYNNCL AVRKGVQFLL
RSQKENGGWG ESYKSCPEKK YVPLEEGRSN LVQTAWAMMG LIHAGQAERD PTPLHRAAKL
IINSQMEDGD FPQQEITGAS IKTCMLHFAA FRNIFPLWAL AEYCKHVPLA
//