ID A0A3M0LH02_HIRRU Unreviewed; 790 AA.
AC A0A3M0LH02;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 13-SEP-2023, entry version 13.
DE RecName: Full=Terpene cyclase/mutase family member {ECO:0000256|RuleBase:RU362003};
DE EC=5.4.99.- {ECO:0000256|RuleBase:RU362003};
GN ORFNames=DUI87_04243 {ECO:0000313|EMBL:RMC18357.1};
OS Hirundo rustica rustica.
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Passeriformes; Sylvioidea; Hirundinidae;
OC Hirundo.
OX NCBI_TaxID=333673 {ECO:0000313|EMBL:RMC18357.1, ECO:0000313|Proteomes:UP000269221};
RN [1] {ECO:0000313|EMBL:RMC18357.1, ECO:0000313|Proteomes:UP000269221}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Chelidonia {ECO:0000313|EMBL:RMC18357.1};
RC TISSUE=Blood {ECO:0000313|EMBL:RMC18357.1};
RA Formenti G., Chiara M., Poveda L., Francoijs K.-J., Bonisoli-Alquati A.,
RA Canova L., Gianfranceschi L., Horner D.S., Saino N.;
RT "A high quality draft genome assembly of the barn swallow (H. rustica
RT rustica).";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the terpene cyclase/mutase family.
CC {ECO:0000256|ARBA:ARBA00009755, ECO:0000256|RuleBase:RU362003}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RMC18357.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QRBI01000096; RMC18357.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3M0LH02; -.
DR STRING; 333673.A0A3M0LH02; -.
DR Proteomes; UP000269221; Unassembled WGS sequence.
DR GO; GO:0005811; C:lipid droplet; IEA:InterPro.
DR GO; GO:0016866; F:intramolecular transferase activity; IEA:InterPro.
DR GO; GO:0016104; P:triterpenoid biosynthetic process; IEA:InterPro.
DR CDD; cd02892; SQCY_1; 1.
DR Gene3D; 1.50.10.20; -; 2.
DR Gene3D; 6.20.120.20; -; 1.
DR InterPro; IPR032696; SQ_cyclase_C.
DR InterPro; IPR032697; SQ_cyclase_N.
DR InterPro; IPR018333; Squalene_cyclase.
DR InterPro; IPR002365; Terpene_synthase_CS.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR NCBIfam; TIGR01787; squalene_cyclas; 1.
DR PANTHER; PTHR11764:SF20; LANOSTEROL SYNTHASE; 1.
DR PANTHER; PTHR11764; TERPENE CYCLASE/MUTASE FAMILY MEMBER; 1.
DR Pfam; PF13243; SQHop_cyclase_C; 1.
DR Pfam; PF13249; SQHop_cyclase_N; 1.
DR SFLD; SFLDG01016; Prenyltransferase_Like_2; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 2.
DR PROSITE; PS01074; TERPENE_SYNTHASES; 1.
PE 3: Inferred from homology;
KW Isomerase {ECO:0000256|RuleBase:RU362003};
KW Reference proteome {ECO:0000313|Proteomes:UP000269221};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 153..254
FT /note="Squalene cyclase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF13249"
FT DOMAIN 422..757
FT /note="Squalene cyclase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF13243"
SQ SEQUENCE 790 AA; 87475 MW; E6B879E00F501E9E CRC64;
MAAGAVWVPA VGSGAAVAGR SGWIPAVGSG AAVAGGSGWI RAVGSGAAVA GGSGSRRGGA
GWRPRFTVPV SGRAVRRRGG PWRTAAATEL PAWRLRCEGG RQLWRYLGDG DVGERRAQTA
LEQHSLGLDT SATLQALPAA GSAREAARNG MRFYAALQAE DGHWAGDYGG PLFLLPGLLI
TCHTAKIQLP EAFRKEMVRY LRSVQLPDGG WGLHVEDKST VFGTALNYVA LRILGLGPDD
PDIVRARVNL HNKGQPWGWY TAHPSRLWCH CRQVYLPMSY CYAKRLSAEE DELIRSLRQE
LYVQDYGSID WPAQRSNVAA CDVYTPHSWL LGAAYAVMNM YEAHHSTHLR QRAVTELYDH
IKADDRFTKC ISIGPISKTI NMLVRWFVEG KDSPAFQEHV SRIPDYLWLG LDGMKMQGTN
GSQLWDTAFA IQAFLEAEAQ EMPEFTSCLQ NAHGFLRFSQ IPENPPDYQK YYRHMNKGGF
PFSTRDCGWI VADCTAEGLK AVMLLQEKCP FIAKPVPAER LFDAVNVLLS MRNSDGGFAT
YETKRGGHLL ELLNPSEVFG DIMIDYTYVE CTSAVMQALR HFQSQFPEHR AGEIRETLQK
GLDFCRKKQR ADGSWEGSWG VCFTYGTWFG LEAFASMQHT YKNGTVCREV AQACQFLISK
QMADGGWGED FESCEQRTYV QSAESQIHNT CWALLGLMAV RYPDIGVLER GIKVLMDKQL
PNGDWPQENI AGVFNKSCAI SYTAYRNIFP IWTLGRFCRL HPKSPLVGQL PARARPSAGA
AQEEQGALSA
//