ID R1FQA0_EMIHU Unreviewed; 2677 AA.
AC R1FQA0;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 27-MAR-2024, entry version 63.
DE SubName: Full=Fatty acid synthase {ECO:0000313|EMBL:EOD37811.1};
GN ORFNames=EMIHUDRAFT_466966 {ECO:0000313|EMBL:EOD37811.1};
OS Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC Noelaerhabdaceae; Emiliania.
OX NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD37811.1};
RN [1] {ECO:0000313|EMBL:EOD37811.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD37811.1};
RG DOE Joint Genome Institute;
RA Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT "Genome variability drives Emilianias global distribution.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000013827}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX PubMed=23760476; DOI=10.1038/nature12221;
RA Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA Wurch L.L.;
RT "Pan genome of the phytoplankton Emiliania underpins its global
RT distribution.";
RL Nature 499:209-213(2013).
RN [3] {ECO:0000313|EnsemblProtists:EOD37811}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB864057; EOD37811.1; -; Genomic_DNA.
DR RefSeq; XP_005790240.1; XM_005790183.1.
DR STRING; 2903.R1FQA0; -.
DR PaxDb; 2903-EOD37811; -.
DR EnsemblProtists; EOD37811; EOD37811; EMIHUDRAFT_466966.
DR GeneID; 17283081; -.
DR KEGG; ehx:EMIHUDRAFT_466966; -.
DR eggNOG; KOG1202; Eukaryota.
DR HOGENOM; CLU_000022_36_0_1; -.
DR Proteomes; UP000013827; Unassembled WGS sequence.
DR GO; GO:0004315; F:3-oxoacyl-[acyl-carrier-protein] synthase activity; IEA:InterPro.
DR GO; GO:0031177; F:phosphopantetheine binding; IEA:InterPro.
DR GO; GO:0006633; P:fatty acid biosynthetic process; IEA:InterPro.
DR GO; GO:1901362; P:organic cyclic compound biosynthetic process; IEA:UniProt.
DR CDD; cd00833; PKS; 1.
DR Gene3D; 3.30.70.3290; -; 1.
DR Gene3D; 3.40.47.10; -; 1.
DR Gene3D; 1.10.1200.10; ACP-like; 2.
DR Gene3D; 3.40.50.1820; alpha/beta hydrolase; 2.
DR Gene3D; 3.40.366.10; Malonyl-Coenzyme A Acyl Carrier Protein, domain 2; 1.
DR Gene3D; 3.40.50.12780; N-terminal domain of ligase-like; 2.
DR Gene3D; 3.40.50.720; NAD(P)-binding Rossmann-like Domain; 1.
DR InterPro; IPR029058; AB_hydrolase.
DR InterPro; IPR001227; Ac_transferase_dom_sf.
DR InterPro; IPR036736; ACP-like_sf.
DR InterPro; IPR014043; Acyl_transferase.
DR InterPro; IPR016035; Acyl_Trfase/lysoPLipase.
DR InterPro; IPR000873; AMP-dep_Synth/Lig_com.
DR InterPro; IPR042099; ANL_N_sf.
DR InterPro; IPR018201; Ketoacyl_synth_AS.
DR InterPro; IPR014031; Ketoacyl_synth_C.
DR InterPro; IPR014030; Ketoacyl_synth_N.
DR InterPro; IPR016036; Malonyl_transacylase_ACP-bd.
DR InterPro; IPR036291; NAD(P)-bd_dom_sf.
DR InterPro; IPR020841; PKS_Beta-ketoAc_synthase_dom.
DR InterPro; IPR013968; PKS_KR.
DR InterPro; IPR020806; PKS_PP-bd.
DR InterPro; IPR009081; PP-bd_ACP.
DR InterPro; IPR006162; Ppantetheine_attach_site.
DR InterPro; IPR001031; Thioesterase.
DR InterPro; IPR016039; Thiolase-like.
DR InterPro; IPR020615; Thiolase_acyl_enz_int_AS.
DR PANTHER; PTHR43775; FATTY ACID SYNTHASE; 1.
DR PANTHER; PTHR43775:SF51; PHENOLPHTHIOCEROL_PHTHIOCEROL POLYKETIDE SYNTHASE SUBUNIT E; 1.
DR Pfam; PF00698; Acyl_transf_1; 1.
DR Pfam; PF00501; AMP-binding; 1.
DR Pfam; PF00109; ketoacyl-synt; 1.
DR Pfam; PF02801; Ketoacyl-synt_C; 1.
DR Pfam; PF08659; KR; 1.
DR Pfam; PF00550; PP-binding; 2.
DR Pfam; PF00975; Thioesterase; 2.
DR SMART; SM00827; PKS_AT; 1.
DR SMART; SM00822; PKS_KR; 1.
DR SMART; SM00825; PKS_KS; 1.
DR SMART; SM00823; PKS_PP; 2.
DR SUPFAM; SSF56801; Acetyl-CoA synthetase-like; 1.
DR SUPFAM; SSF47336; ACP-like; 2.
DR SUPFAM; SSF53474; alpha/beta-Hydrolases; 2.
DR SUPFAM; SSF52151; FabD/lysophospholipase-like; 1.
DR SUPFAM; SSF51735; NAD(P)-binding Rossmann-fold domains; 1.
DR SUPFAM; SSF55048; Probable ACP-binding domain of malonyl-CoA ACP transacylase; 1.
DR SUPFAM; SSF53901; Thiolase-like; 2.
DR PROSITE; PS50075; CARRIER; 1.
DR PROSITE; PS00606; KS3_1; 1.
DR PROSITE; PS52004; KS3_2; 1.
DR PROSITE; PS00012; PHOSPHOPANTETHEINE; 2.
DR PROSITE; PS00098; THIOLASE_1; 1.
PE 4: Predicted;
KW Phosphopantetheine {ECO:0000256|ARBA:ARBA00022450};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000013827};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 750..1212
FT /note="Ketosynthase family 3 (KS3)"
FT /evidence="ECO:0000259|PROSITE:PS52004"
FT DOMAIN 2596..2677
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT REGION 1316..1353
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2044..2107
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2515..2536
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2067..2105
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2677 AA; 269594 MW; 94C77F4EB81AEBAD CRC64;
MTTLLVQTPV DRPHLRLLLF PHAGGSAAAY EGWHDALASR LPVPVEVWAV SPPGRGARAG
EAAFPTVAAL AQAVLSELVR LELHLAPLAV FGHSLGALTA YELARRMVDA RLPPPVCLFA
SAHEPPSCGM PEAQRGLASL SDAALLRALA AFDFVPVASL EADAADAAEV AVIAPLLAPL
RADLAMREAY LDAVAAGTAP PPPPSPTLPL PVYAVGGAAD AAVDAASLQR WAALGSGGGG
CTQLPGGHFY LHSRLGGPSL APLLALVASK IAVCLAPVPA SVLCGPPLEG AVGAAVGAVG
AADAAEGAAE GAVGVEAPPV ESLSRDTTRP GHVPYVHELV LRWAAATPTA AAIVSEVGGR
LEAVSYAEVR EVVHALGGWL VSHGAAPRAT VAALLEHRAE SLLAQLAVGA AGAAFFPLEP
HLGPRSLAQL LAQATPALVD PPRLASLLAE ARATRILTTP SLLATFLELG CPPPPAAPPA
AASPAAPPAA ASPAVPPAAA SPLGCLRLWT LCGEVAPASL VERHAASLPH VQLVNDYSSW
EGCDAAYADL HPAAPDAAAG AIAPVGRPPA GVAIAVLDPD SLAPAFTGYA GAADLTTARL
LPAPPELCAL LRASLGPNSL TLTLCADHAA EAPPSPRAVR GAAGGEAELL LYRTGDLARL
LPSAAAGGEA SVQVLGRLDS TVKIRGYKAC ISALAGVGRC AVVALLDEAD EERLSLPRGL
SAAESKKLDR RSLPRPNLSL ASLRASAAAP PAAASADAAA PAADGSAVDA LEGDMLPVWR
SVLGVPDVEG GDNFFDLGGH SLLAARLARL AISLTVLDLF DAPTPAGLAR AVRARAEGAT
PRAAPAAAAP LDVASRELAV AFEHAGYAPL SGTPERTAVV AAPGIDGYMH HHLDGAPLKD
ALSPGDIFLG EVGSEKDYIA TRLSYALDLM GPSLTVNSAC SSGLLAVAHA AAPLLTGEAD
LALGGASALT FPSHGYVFEE GLVASIDGKV RPFDAAAHGT VFGDAVGAVV LKRAADADAA
ADAPLARLLG YAVTNDGARK AGYAAPGVAG QRAAVSGALG VAGVAASSLS YVECHATGTL
IGDGIEARAL TDALAASDAG AAAAASRGTF ISLGSIKGNL GHANAAAGVT GLHKALLCLS
RAELVPTAHF AKLNRNVHLA GTPLAVHQAG AAGAVPWARP TGVPGAASEA ARRCGVSSFG
IGGTNVHMIL EEPPRAAGKP AASAAAAAAA ASAAAAAAAP TRRAVHMLCV SAKSGAAARR
AAEALAAALE GEGAPPLEHV AAQLLGGREL FSKRVAVSAA TRAEAVAALR AAAAAPAPAP
APASPSPSKS SSMVDLESAG GGGGGARRRG KTPPVVLLFP GQGSQCPRMG EGIYRSEPAY
RRHVDRMCAT LRPLLGFDLR ERLFPAAGAE DADGFRAAFD APTVTQPAIF VTELALGRTL
VEEYGLAPAA LAGHSIGEFV AATLAGVLPE EDALALIATR ARLSEEAQPG GMLAVALSAA
RAAEVAAMPS HRGKVWLAVR NSGGRTVLAG DDDALLAVQA LLDSEGVRSR RLPLPRAYHT
PLMESVADGL AAVLARVRLA PPKLPLACNG GGGWMEAATA TDPSYWAGHV SRAVRWDANM
DLLAALAKEA GGDGLLAVEV GSGSSLAPLL AECTADGASA LSVLSTLRHP REDWAGGAAD
ERVFADALCG LWEARAPLTL RRARHGSRRY AAARLPGYAF EPAVHWLKPE RSMYVRPTAE
EVSAAQAALL AAAAPPSKFP LDPAGLPALQ PLRRASPGSR WVSAYCLAFA GGSTAAFAEL
AAGAPEWMEV VGVETPGKGA LADHTWPGEV APAGMQAGAA AEAEAEMIEE LAARIAADAA
GSAIVLVGWS MGGMLAAEVA LHLQAAGSPP HLLHVAGRMA PGSFVAASEA DLDSYSLASP
EVRASAAYRE WLLPTLLADL RADGRAERRV ADALRRATGG GSGAALDCAL QICCGDSDAS
FPPSAAGGWL SLTSGKRSVH TLAGGHDILT SGSAELMQLL LRRLLPQAPL YAVRWDRLPS
RAGESEAAAE ADSDDEALHA AHGERAPPAS PTPPSPPAAA PPPAAAPPPR AAPPPTKPAV
PPAAPPRLLR LGADEVGLRP RDAAALRTTL GLLVYIEPQE APAAQRAQCA ALLRLLLAAA
AAGGGRVVLV CSADTRSALA AGASKAAPLE MPELSVQRLF LPADADLLGA DRSVGSRLGA
IRPLLDGWVR WIAASSAARR EECDLLLDAA GGPPRAPRLA SLPPPPPADV PALDPAASYL
VTGGSGGLGG ALVEWLLDAQ GVPPAHVVLL SRRGAPHPRG VTSLAADLGD ADSLRGCAGL
AALGAALPRL GGLFHLAGVL DDGLISNLDE ARIGSVVAPK AGVLPLLRQL ARASASPPWV
MLSSSTSSLL GYAGQANYCA ANAIFDHAAA FGLPPPPAEL GSPPRVLTVN FGPWGEVGMA
REGTKAHALS VASGETPMAT GAAIACIAHA VRAVGAGAAS PPAASPPAAS AALSSASPLA
ASGPSSPAPA PHPEQAEPNL QFCIADVSWW RSPWPDHPLV QGVVRRSPAA VPSAAVVASG
GGAAEEGGAE AGGVLAPELD AEAKAANGRA RAEAWMRGRL NEWELETRLA ELGLDSLDLV
QLRNAFNKHF RTEVPLSVFS NANQTLAALL GRVGELI
//