ID A0A164PBK0_9NOCA Unreviewed; 4884 AA.
AC A0A164PBK0;
DT 06-JUL-2016, integrated into UniProtKB/TrEMBL.
DT 06-JUL-2016, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE RecName: Full=Polyketide synthase {ECO:0008006|Google:ProtNLM};
GN ORFNames=AWN90_18365 {ECO:0000313|EMBL:KZM75356.1};
OS Nocardia terpenica.
OC Bacteria; Actinomycetota; Actinomycetes; Mycobacteriales; Nocardiaceae;
OC Nocardia.
OX NCBI_TaxID=455432 {ECO:0000313|EMBL:KZM75356.1, ECO:0000313|Proteomes:UP000076512};
RN [1] {ECO:0000313|EMBL:KZM75356.1, ECO:0000313|Proteomes:UP000076512}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IFM 0406 {ECO:0000313|EMBL:KZM75356.1,
RC ECO:0000313|Proteomes:UP000076512};
RA Evans L.H., Alamgir A., Owens N., Weber N.D., Virtaneva K., Barbian K.,
RA Babar A., Rosenke K.;
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KZM75356.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LWGR01000003; KZM75356.1; -; Genomic_DNA.
DR RefSeq; WP_067582582.1; NZ_KV411303.1.
DR STRING; 455432.AWN90_18365; -.
DR OrthoDB; 4516163at2; -.
DR Proteomes; UP000076512; Unassembled WGS sequence.
DR GO; GO:0004315; F:3-oxoacyl-[acyl-carrier-protein] synthase activity; IEA:InterPro.
DR GO; GO:0031177; F:phosphopantetheine binding; IEA:InterPro.
DR GO; GO:0006633; P:fatty acid biosynthetic process; IEA:InterPro.
DR CDD; cd08952; KR_1_SDR_x; 2.
DR CDD; cd00833; PKS; 3.
DR Gene3D; 1.10.287.1960; -; 1.
DR Gene3D; 3.30.70.3290; -; 3.
DR Gene3D; 3.40.47.10; -; 3.
DR Gene3D; 6.10.140.1830; -; 2.
DR Gene3D; 1.10.1200.10; ACP-like; 3.
DR Gene3D; 3.30.70.250; Malonyl-CoA ACP transacylase, ACP-binding; 1.
DR Gene3D; 3.40.366.10; Malonyl-Coenzyme A Acyl Carrier Protein, domain 2; 3.
DR Gene3D; 3.40.50.720; NAD(P)-binding Rossmann-like Domain; 3.
DR Gene3D; 3.10.129.110; Polyketide synthase dehydratase; 1.
DR InterPro; IPR001227; Ac_transferase_dom_sf.
DR InterPro; IPR036736; ACP-like_sf.
DR InterPro; IPR014043; Acyl_transferase.
DR InterPro; IPR016035; Acyl_Trfase/lysoPLipase.
DR InterPro; IPR018201; Ketoacyl_synth_AS.
DR InterPro; IPR014031; Ketoacyl_synth_C.
DR InterPro; IPR014030; Ketoacyl_synth_N.
DR InterPro; IPR016036; Malonyl_transacylase_ACP-bd.
DR InterPro; IPR036291; NAD(P)-bd_dom_sf.
DR InterPro; IPR032821; PKS_assoc.
DR InterPro; IPR020841; PKS_Beta-ketoAc_synthase_dom.
DR InterPro; IPR041618; PKS_DE.
DR InterPro; IPR042104; PKS_dehydratase_sf.
DR InterPro; IPR020807; PKS_DH.
DR InterPro; IPR049551; PKS_DH_C.
DR InterPro; IPR049552; PKS_DH_N.
DR InterPro; IPR013968; PKS_KR.
DR InterPro; IPR020806; PKS_PP-bd.
DR InterPro; IPR009081; PP-bd_ACP.
DR InterPro; IPR006162; Ppantetheine_attach_site.
DR InterPro; IPR002347; SDR_fam.
DR InterPro; IPR016039; Thiolase-like.
DR PANTHER; PTHR43775; FATTY ACID SYNTHASE; 1.
DR PANTHER; PTHR43775:SF51; PHENOLPHTHIOCEROL_PHTHIOCEROL POLYKETIDE SYNTHASE SUBUNIT E; 1.
DR Pfam; PF00698; Acyl_transf_1; 3.
DR Pfam; PF16197; KAsynt_C_assoc; 3.
DR Pfam; PF00109; ketoacyl-synt; 3.
DR Pfam; PF02801; Ketoacyl-synt_C; 3.
DR Pfam; PF08659; KR; 3.
DR Pfam; PF18369; PKS_DE; 2.
DR Pfam; PF21089; PKS_DH_N; 1.
DR Pfam; PF00550; PP-binding; 3.
DR Pfam; PF14765; PS-DH; 1.
DR PRINTS; PR00081; GDHRDH.
DR SMART; SM00827; PKS_AT; 3.
DR SMART; SM00826; PKS_DH; 1.
DR SMART; SM00822; PKS_KR; 3.
DR SMART; SM00825; PKS_KS; 3.
DR SMART; SM00823; PKS_PP; 3.
DR SMART; SM01294; PKS_PP_betabranch; 3.
DR SUPFAM; SSF47336; ACP-like; 3.
DR SUPFAM; SSF52151; FabD/lysophospholipase-like; 3.
DR SUPFAM; SSF51735; NAD(P)-binding Rossmann-fold domains; 6.
DR SUPFAM; SSF55048; Probable ACP-binding domain of malonyl-CoA ACP transacylase; 3.
DR SUPFAM; SSF53901; Thiolase-like; 3.
DR PROSITE; PS50075; CARRIER; 3.
DR PROSITE; PS00606; KS3_1; 2.
DR PROSITE; PS52004; KS3_2; 3.
DR PROSITE; PS00012; PHOSPHOPANTETHEINE; 3.
PE 4: Predicted;
KW Phosphopantetheine {ECO:0000256|ARBA:ARBA00022450};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000076512};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 33..455
FT /note="Ketosynthase family 3 (KS3)"
FT /evidence="ECO:0000259|PROSITE:PS52004"
FT DOMAIN 1480..1555
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT DOMAIN 1575..2006
FT /note="Ketosynthase family 3 (KS3)"
FT /evidence="ECO:0000259|PROSITE:PS52004"
FT DOMAIN 3034..3109
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT DOMAIN 3136..3561
FT /note="Ketosynthase family 3 (KS3)"
FT /evidence="ECO:0000259|PROSITE:PS52004"
FT DOMAIN 4735..4810
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT REGION 1182..1203
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2546..2592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4884 AA; 501336 MW; 913FC5034627F36C CRC64;
MDTESRLRDY LKRVTTDLRA ARRELETERG RRTEPLAVVG MACRFPGGIA TPEAFWAALR
GGRDMVGPFP TDRGWDLEHL FDADPDHLGT SYTSQGAFLD EAGQFDAGFF EISPREALAM
DPQQRLLLET SWEALESAGI DPRGLSGQQI GVFVGVSNQG YGTPGPAEVE GHVLTGTSGA
VVSGRLAYVF GLEGPTVTVD TMCSSSLVAL HLAAQAVRAG ECDAALVGGA TIMGTARNFV
EFSRQRGLAT DGRCKPFSAD ADGTGWGEAV GTVFLERLST ARRAGHPVLA VLRGTAINSD
GASNGLTAPN GPSQQRVIGA ALRRSGLRPD EIDVVEAHGT GTELGDPIEA QALLATYGRD
RDRPLLLGAL KGATGHTQAA SGVAALIKMV LALRHGYLPG ILHLNALSPH VDWSAGAVEP
LVGGREWAAV EDRPRRAGIS SFGGSGTNAH VVLEEAPAPE EAETGSVATI SGPVAWPLSA
RDAEALSAMA ANLAAVVGDS DARQVAAALC ARSTFDHRAV LIDPGTEAVA RLGELAAGRA
GDGVVRGRVR AGGDAPVFVF PGQGAQWAGM GAELLDGTGR SAEVFARRLA ECSAAVEAAG
GPDVVAVLRD GGERSLDDVG VVQPVSWAVM VALAAVWEDA GVAPGAVIGH SQGEIAAAVV
AGALSVADGA RVVTARALAL RAVAGSGAMA SLGETPEQAA ERIRELDGVE IAAINGPAAV
VVAGPVEGVQ AAIAAAEAEG RRAKLIPVDY ASHTPGMEVL REPVLAALDG LTPSAPRVPW
LSTHDVDWID SDSADAAYWF ANLRDTVRFA AGVAALLDAG YDSFLEISAH PVLVPAIEDV
ADATGVEIAA GGTLRRGEGG ATRVLTALAE AWVGGVAIEW ERFVAGVDPR AVSLPTYPFQ
RRPYWLAPTA AAADADPADA AFWNAVADND IAALVEALPG DAIDSGREAA AVLAEAVPLL
ARWRRGRADK NTRDGWRYRI CWQPHPDTTA TATGTWMLVR PTTFAAGDAV VTRVRDALAA
AGIEVVDVAV EPTADRADIA ALLGEFAAEP DGVVSLLAAA ESDHPEYRGL PVGVVTTLGL
LQGLGDKGIA APLWAVTVGA ETTAADDHLT RPLQAAVWGL GRVAALEHPD RWGGLVDLPE
ADPASDDPVR LLPAVLAHPV EDQVALRAAG ALVRRLQRAT RPAREAANGR RTRGTALITG
GTGGLGAHTA RMLARSGTAH LVLLSRRGPE AEGAAELCAE LEAMGPRVTI VACDADDPAA
VAAVVERIEA EGETIRTVVH TAGVGILVPL AETGLEQFAA GAGGKLSGAR VLDALFDGER
GRELDAFVLF SSVAGLWGAG DHGAYSAANA VADAIASARR ARGLVGTSIP WGIWEASGGG
MGRDVISTQL KWLGIRFMPA TLAIDAMADA LEDDETLLAI ADIDWDTFAP VFTAARRRPL
LDGVPDVAAA LGRTAAAEAD TDGPGSALRA RMAAATDPRR VVLDAVRDAV AAALGFAARD
EVDPDRAFRE LGIDSLTAVA LRNTVTAETG VRLPVTVVFD HPTVTALTDH LLAELGVAAV
DIEHDATAPA ATATDDPIVV VGMACRFPGG IRTPDQLWQV LHDGVDVIGG FPTDRGWDLD
GLYDPDPDRE GRIYTRSGGF LHDAAEFDPE FFGISPREAL AMDPQQRLLL ESSWEALERA
GIDPRAAADA RTGVFIGAAY QGFGGTVGST DAPVGPEGAE GHIVTGLATS VASGRISYSF
GFEGPAVTID TACSSSLVAL HQASQSIRDG DCDRALVGGV AVMVAPVGLL GFSRQRGLSE
DGRCRAFSAD ADGMGLAEGA GMLVIERLSV ARAAGHPVLA VVRASAINQD GASNGLSAPS
GKAQERVIRA ALRRAGLTAD DVDVVEAHGT GTTLGDPIEA GALLATYGRD RDPERPLWLG
SVKSNIGHTQ AAAGAAGLMK MVLALQHSEL PATLHADNPS PYIDWESGAV RLLTEARPWA
ADGRPRRAGV SAFGVSGTNV HVILEEPPTR DPEPVPVAPI PVPWVVSART EAALDEMIAG
VGGLPADGPG AQAAAAVLAR KTAFEHRAVL DAATGAVLAR GRVAARGSGT VFVFPGQGAQ
WAGMGRELLA ATDGPGAVFA ARFAECADAI AAVSDIDARA AVADTSGAAL EDVAILQSVS
WALMVALAAM WESVGVRPAA VIGHSQGEIA AAVVAGALSL ADGARVVTAR ALALRGVAGT
GAMGSIGEGI DAVRARLAGR DSVVVAVVNG PASVVVAGTP EEVEAVLAEA AADGVRTRLL
PVDYASHSPL MQPLAETITG ALAGIGGSTP DIPWYSTARP GWVDTAPEPG YWFANLSGTV
HFSDAVAAVA AAGYDAFVEI GTHPVLVAAV LETADAAGHE VTASGTLRRD EGDLNRVVAA
LAEAWTAGVA VDWTRVLPGD RSAAAAVPTY PFQRKRLWLN AIETAAPDAA AGDAEFWAVV
ERQDPAELAR TLGTDADAVS GLLPTLAAWR RRHDRDGAVA AWRHRVRWVS ATAPGHSRLT
GHWLVLTGTA TDVRAAGTAT DVRAAGTATD ARAAGTATDA RAAGTATDAR AAGTATDARA
AGTATDARAD ETATDARADE VVAALTAAGA EVTLRRFDPA AAEVRLEDGT DPAGILLLPD
TEPVGAVPGG VLTLAALLRA TVGSTAAVWC ATRAAIPATD ADPVDERAAG VWGLGRIAAQ
EYPDRWGGLV DLPARLDDRT GGLLAAVLAG AGALAGEDQV AVRPAGLFVR RLEHASRAAA
RSAWTPRGTV LVTGGTGALA GHVARWLAGA GAQRLVLAGR RGPDAPGAAE LRAELIDQGA
VVDVVACDVT DRAALAALLD TYRPDAVVHT AGIVDDELIA DLDADRAAAV CAPKVLAAQY
LDELTRDREL DAFVLFSSMA GALGGSGQGA YAAANATLDA LAEWRRREGL PATAIGWGAW
AGDGLAEAVS DRLRGQGVLP MDPESAVAAM AAAVGSGAAH VIVADVDWAR HAEVLTASRP
LPALAGIPEA APAPAAQESD VPILAGLDAE ERRAAVRRLV RTEVATALGL DGPGDVVSDR
TFRDLGFDSL TAVDLRNRLV RATGVRLPVT LVFDYPTVDE LADHLLARWS ADLGVAVTPT
EPTADTLAPR PVPTGEDVIA IVGMACRLPG GVTSPQDLWD LLERRGDAVV AFPTDRGWDV
AGRYHPDPEH RGTFATTGGG FLDDPAGFDA EFFGISPREA LTIDPQHRLL LETSWEAFER
AGIAPASLRG SRTGVFVGSN YHDYGSRLSA EPGIYEGQLA TGSAGSVASG RVAYSFGLQG
PAVTLDTACS SSLVAMHMAA QALRTGECSL ALAGGVTVIS SLDTFIEFSR QGALSPDGRC
RAFAEDADGA GWAEGVGVVV LERLADARAA GHPVLALLRS SAINSDGASN GLTAPNGPAQ
QRVIRDALAA GDLSPADIDA VEAHGTGTRL GDPIEAQALL NTYGAVERAH PLWLGALKSN
IGHTQAAAGI AGVIKSVLSL RHGRFPATLH AETPSSRIEW DSGAVRLAQS AVELPDAGRP
WRIGVSSFGI SGTNAHVIVE QAPAEAAAQA VAEPEVVPWQ VSARSPETLR EALDAMASRW
TPDVPRRAVA AALRHRGVFD HRGVVLAGAT AEPSVVGGAV VPGPTGILLS GQGSQRLGMG
RLLRESFEPF ATAFDAARAA VDAHLTGSIA DVIDGDDAEL LAGTGWAQPA LFAYHVAGYR
LLESWGLAPG VLVGHSVGEI AAAHLSGALS LADAARLVAA RATAMAALPE GGAMAAISAT
ADRLAALTAD LPEGLSVAAH NSATNLVLSG PAEILERVLA ERADGLRVSR LVTSHAFHSP
LMAPAAAAVE RVAAELEWHD PALPVISTHT GVAVDRAAWA DPRHWSGQLT APVRFAAAVA
EAMRAHGVGR WLELAPHATL TGHVAADHPA VVTACLGDKN IREPLAAQRA AATLWVAGAD
LPGWPGEPAV PAAAVLPTLP TYPFAHQRYW LDAPVPVTPE SLGLAATGQL VLAGHLALAG
ADEHLFTGRL SVTSHRWLAD HAVGGAAIVP ATAYLELALD AAARSGAGAV RELTVQVPLV
LPEAGGVDVQ ARVHAAAEDG SRMLTVDARE DGGDWVRHAE GTLGAVDAGT AELPGAWPPA
GAAPLTVTGL YERMADGGFA YGPAFRGLRA AWRDGDAVLA EVALPEHIAT DAARAALHPA
LLDAALHTIA LDRDPGDGAV MPFSVRSVRI DRRGAAALRV RMRATGPNTV ALDLADAAGT
PIGRVEEVAL RPVPRAVLAG ARSRTMYRVD WVPASTPVTA PAAVEFHTDL MGVGGEIGGT
VVVPAPIPAD GTLPERTAAA TRAALLLVQR WLTLPQRADA RLVVLTTAAC AAGDHAPDPA
AAAVLGLVRA AQAEHPDRFV VLDHAPASVP GHEVTEAAVG AAVASGEPVL AWRDGQLRVP
RLTPAASPAA ADPAWRGTVL VTGGTGGLGA AVARHLAARP EVERLVLTSR RGPAADGAED
LRAELAGLGA QVEIVAADLT TDDGVAAAIA AADGRVDSVV HAAGVIDDGA IESLTPERIA
PVLAPKVDAV TRLAERLPQA RLVLFSSLSG TFGGVGQANY SAANAALDAL ATRWRGGGRE
VVSIAWGLWA VRSGMTGELS AADRARLARG GVVPMDTAES LALLDAAVAA GTATVVAARF
DIPALRAASG GVPALLSAIA PAAPAVAASA ALPATASATG VLDRLRGVDE EERAEILLDL
VRTEAALVLG HSSIDAIPVD QGFLDVGFDS LTAVELRNRL GAATGLRLPA TMLFDYPNMR
RLAGLLDELL PADEHGPGLA EIARLEGIAR GLNGDDRARQ ALVQRLQDVL GLLGAGAEPA
DPAELIESAS DGELFDIIDG LGVD
//