ID Q0SE34_RHOJR Unreviewed; 4903 AA.
AC Q0SE34;
DT 05-SEP-2006, integrated into UniProtKB/TrEMBL.
DT 05-SEP-2006, sequence version 1.
DT 24-JAN-2024, entry version 122.
DE SubName: Full=Non-ribosomal peptide synthetase {ECO:0000313|EMBL:ABG94202.1};
DE EC=5.1.1.12 {ECO:0000313|EMBL:ABG94202.1};
DE EC=6.2.1.26 {ECO:0000313|EMBL:ABG94202.1};
GN OrderedLocusNames=RHA1_ro02397 {ECO:0000313|EMBL:ABG94202.1};
OS Rhodococcus jostii (strain RHA1).
OC Bacteria; Actinomycetota; Actinomycetes; Mycobacteriales; Nocardiaceae;
OC Rhodococcus.
OX NCBI_TaxID=101510 {ECO:0000313|EMBL:ABG94202.1, ECO:0000313|Proteomes:UP000008710};
RN [1] {ECO:0000313|Proteomes:UP000008710}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RHA1 {ECO:0000313|Proteomes:UP000008710};
RX PubMed=17030794; DOI=10.1073/pnas.0607048103;
RA McLeod M.P., Warren R.L., Hsiao W.W.L., Araki N., Myhre M., Fernandes C.,
RA Miyazawa D., Wong W., Lillquist A.L., Wang D., Dosanjh M., Hara H.,
RA Petrescu A., Morin R.D., Yang G., Stott J.M., Schein J.E., Shin H.,
RA Smailus D., Siddiqui A.S., Marra M.A., Jones S.J.M., Holt R.,
RA Brinkman F.S.L., Miyauchi K., Fukuda M., Davies J.E., Mohn W.W.,
RA Eltis L.D.;
RT "The complete genome of Rhodococcus sp. RHA1 provides insights into a
RT catabolic powerhouse.";
RL Proc. Natl. Acad. Sci. U.S.A. 103:15582-15587(2006).
CC -!- COFACTOR:
CC Name=pantetheine 4'-phosphate; Xref=ChEBI:CHEBI:47942;
CC Evidence={ECO:0000256|ARBA:ARBA00001957};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP000431; ABG94202.1; -; Genomic_DNA.
DR RefSeq; WP_011595136.1; NC_008268.1.
DR KEGG; rha:RHA1_ro02397; -.
DR PATRIC; fig|101510.16.peg.2428; -.
DR eggNOG; COG1020; Bacteria.
DR HOGENOM; CLU_000022_0_8_11; -.
DR OMA; CTGVQKS; -.
DR OrthoDB; 5475787at2; -.
DR UniPathway; UPA00011; -.
DR Proteomes; UP000008710; Chromosome.
DR GO; GO:0016853; F:isomerase activity; IEA:UniProtKB-KW.
DR GO; GO:0016874; F:ligase activity; IEA:UniProtKB-KW.
DR GO; GO:0031177; F:phosphopantetheine binding; IEA:InterPro.
DR GO; GO:0043604; P:amide biosynthetic process; IEA:UniProt.
DR GO; GO:0019752; P:carboxylic acid metabolic process; IEA:UniProt.
DR GO; GO:0008610; P:lipid biosynthetic process; IEA:UniProt.
DR GO; GO:1901566; P:organonitrogen compound biosynthetic process; IEA:UniProt.
DR GO; GO:0044550; P:secondary metabolite biosynthetic process; IEA:UniProt.
DR CDD; cd05930; A_NRPS; 1.
DR CDD; cd17646; A_NRPS_AB3403-like; 1.
DR CDD; cd19540; LCL_NRPS-like; 2.
DR Gene3D; 3.30.300.30; -; 4.
DR Gene3D; 3.40.50.980; -; 8.
DR Gene3D; 1.10.1200.10; ACP-like; 3.
DR Gene3D; 3.40.50.1820; alpha/beta hydrolase; 1.
DR Gene3D; 3.30.559.10; Chloramphenicol acetyltransferase-like domain; 5.
DR Gene3D; 3.30.559.30; Nonribosomal peptide synthetase, condensation domain; 5.
DR InterPro; IPR010071; AA_adenyl_domain.
DR InterPro; IPR029058; AB_hydrolase.
DR InterPro; IPR036736; ACP-like_sf.
DR InterPro; IPR025110; AMP-bd_C.
DR InterPro; IPR045851; AMP-bd_C_sf.
DR InterPro; IPR020845; AMP-binding_CS.
DR InterPro; IPR000873; AMP-dep_Synth/Lig_com.
DR InterPro; IPR023213; CAT-like_dom_sf.
DR InterPro; IPR001242; Condensatn.
DR InterPro; IPR010060; NRPS_synth.
DR InterPro; IPR020806; PKS_PP-bd.
DR InterPro; IPR020802; PKS_thioesterase.
DR InterPro; IPR009081; PP-bd_ACP.
DR InterPro; IPR006162; Ppantetheine_attach_site.
DR InterPro; IPR001031; Thioesterase.
DR NCBIfam; TIGR01733; AA-adenyl-dom; 4.
DR NCBIfam; TIGR01720; NRPS-para261; 1.
DR PANTHER; PTHR45527:SF1; FATTY ACID SYNTHASE; 1.
DR PANTHER; PTHR45527; NONRIBOSOMAL PEPTIDE SYNTHETASE; 1.
DR Pfam; PF00501; AMP-binding; 4.
DR Pfam; PF13193; AMP-binding_C; 4.
DR Pfam; PF00668; Condensation; 5.
DR Pfam; PF00550; PP-binding; 4.
DR Pfam; PF00975; Thioesterase; 1.
DR SMART; SM00823; PKS_PP; 4.
DR SMART; SM00824; PKS_TE; 1.
DR SUPFAM; SSF56801; Acetyl-CoA synthetase-like; 4.
DR SUPFAM; SSF47336; ACP-like; 4.
DR SUPFAM; SSF53474; alpha/beta-Hydrolases; 1.
DR SUPFAM; SSF52777; CoA-dependent acyltransferases; 10.
DR PROSITE; PS00455; AMP_BINDING; 4.
DR PROSITE; PS50075; CARRIER; 4.
DR PROSITE; PS00012; PHOSPHOPANTETHEINE; 4.
PE 4: Predicted;
KW Isomerase {ECO:0000313|EMBL:ABG94202.1};
KW Ligase {ECO:0000313|EMBL:ABG94202.1};
KW Phosphopantetheine {ECO:0000256|ARBA:ARBA00022450};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000008710}.
FT DOMAIN 953..1027
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT DOMAIN 2466..2541
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT DOMAIN 3508..3583
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT DOMAIN 4566..4641
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT REGION 935..957
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 4883..4903
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 939..957
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4903 AA; 526512 MW; C94F5AE59491CD48 CRC64;
MSSPTSVGPG DSSARAFPLS PAQTGMWLAQ HVDPTVPVSV AQYVDIHGDL DRGVLTDACV
RAAAEFESFF LKVSDIDGQP LQWVDPSLDA SVGYLDFRDK PDPEASAHAW MRTETADPID
IQRDRLTVSF LLHVGHDHYF WYSRAHHLAM DGFGSVTMLY RIAALYTAAV HGLPPPASSP
VGLRRIHELE TDYRTSARRE SDRDYWTETM RGVTHASTLA RGSAPATAGS RNVGGTLTTE
AVDQLEKCAA AHGVNPATLT VCAVAGYLAR MTGADDVTLS LPVSARTTAA LRRSGGMVSN
VVPLRITSAG TGTVGDLVSR TRLVMSGALR HQRYRHEDIR ADLGSAVSGS RLFGPAVNIM
LFPEELDLGS LRSQLHVLSS GPIEDLLVNL YRYGPRGRVH LDFKANPRLY SQDELEGHHT
RFLRYFRHFL DADAAHPIGE LPVLTENETA ALVPFPGPPD SVPVTLADVF RAAATQHPDA
TAVVTAGSEI TYRELDGRSD RIAATLARLG VGSGDVVAVA LPRSSGHVCA VWAVAKTGAA
FLPVDPTYPV SRVRHMLGDS HAAVGLTSAE YTNTLPDSTE WLLLDESGGS NVHDFPTPTI
RLDDAAYLVY TSGSTGVPKG VVVTHRGIAN LVSAQRTRLD LDSAARVLHV ASPSFDASVF
EMLMAFGSGA ALVIAPPAVF GGSPLARLMT TERVTHAVIT PSVLASMDPG EVGGLRTLVV
AGEKCPPELV SRWASRCRMI DAYGPAETTV MATVSEPLAE PGPVTVGRPI RGARAVVLDH
RLRPVPVGVV GELYVAGTAL ARGYHRNARQ TSERFVADPY GPPGTRMYRT GDTVRWTYDH
ELEYLGRSDE QVNLRGLRVE PGEIDATLLR YPAVRFAVTV IRSRGAGDQL VSYVVGADTV
TEQDLLSFLS TELPPHLVPA AVVVLPDIPL TPSGKLDRTA LPEHSRRSRS RTAPRNERER
TLHALYADIL GTSAFGVDES FFALGGDSIM AIQLASRARS AGISFTPRDV FENRTISRLG
QVAGGACDRP VLAELPGGGV GDLPLTPAAR FLLDRAGVID RYAQAVVVEL PRAIDPDTLD
SVIGAVVERH DALRARLVCG ADGTAFLTVR PAGAPLPVGT ILRVPLPAHA DPAQRTRDEH
EAAVCRLNPA AGVMVQCVWL DPGPDDTTRR GRLLLVAHHL VIDGVSWRIL IGDLAAAWTQ
VKAGVPPTLA SIGTSLRRWS HGLVEEAQRP GRRSELSFWR EMTAYEDSPL GSRPLDAGRD
VNSTVDRVHT EVSADVTHSL LTDLPAAFRC EVNDALLTAL TLALAQWQRR RGRPATAPVI
RLEGHGREEA VLPGADLSRT VGWFTSIFPI RLDLGDVDLD DAFAGGHAAS TAIKLVKEQL
RAVPDKGIGY GLLRYPGGDP GQVAFNYLGR LDTAETEHDW RPVRDDAAVF SSAEPSMPVT
ALIDITAFTA RGVLDATFAY PTGAIASEDV RALTDLWEQA LTALAAVPRS PHAGGLTPSD
LPLVAVDRND IDEWERDRPG LLDVWPVTPL QSGLLFHAAL TGTDADPYAM QVILSLSGHL
DQDRLRRACS AVLAGHANLR TAFVRDSRGV PVQLVLDDVP LRWRQIDLTD VTEDPEDILR
DDRTTRFDMS APPLVRFTLL SRGPTRADLA ITAHHIVLDG WSMPLLVREL FTAYPRDDVR
TESPDSSPYR DYLEWLTAKD TEASARAWVA ALDGITEPAV AAEPTPTTSE AVLELSEEGT
RKLTARATEV GVTLSTIVQA AWGIVLGQAT ARDDVVIGAT VSGRPADVAG VESAVGLFVN
TVPVRIPLDP DIDTATLVTR LQADHVRLLE HHHLGLADIQ HAVGVAPLFD TLVAFESYPV
DRTALPEPVD GLTLDGVRAH DASHYAVTLA VTVSRTVRLT AKCRSHRYDE KALLARVGRV
LECIGRDPHV VLGGLDLLGD QERERVLRTW NATDAPVPSQ TLVDLFDAQV ERSPEAVAVI
FGPDRLTYAE FDARVNRLAR YLVAKGIVPE TVVGVAVSRS IQLLVALHAV LKAGGAYLPM
DPEHPLGRTA LVIDSAAPAL VLTSGSHNRA VPHGVAVVDL DLLDLTGYDA RSITNSERRA
PLHPANAAYV IYTSGSTGRP KGVTVTHEMM VNQFRWAQTL TPLDGSDAVL HKTPLTFDIS
AWELLWPLHT GARVVIAAPD GHRDPRYLAR MVAQESITTL HVVPSMLDAF LEQCGPRELA
PLRRVYSAGE PLSAATASRF EERSAAALYN WYGPCEAAAA TSESLDGNEF GTSVAIGRPI
HNIRTYVLDS RLRPVPVGTR GELYLAGAYL ARGYAGRPDL TAERFVADPF GETGGRMYRT
GDVVRWNESG RLEYLGRTDF QIKLRGQRVE PGEIESALTS DPAVTHAAVT VHRDAEAGDR
LIAYVVAADG VPPDERRIVD RLAGLLPAYM IPSAVVPLPA LPLTTSGKLD RAALPEPAPV
ASAFDPPATP TEDVVAFVFA EVTGTGRVGR HDNFFAAGGN SLTAAQAVAR VGAALGTSVD
IRALFDHPTV AELSEYLANG SHRTGRAPLV PRAPTAHVPL SLAQQRLWFL NRLDGRSATD
SIPVALRLTG TLDESALCAA VGDVIERHEP LRTVYPEHDG IPCQQVLPAS AVPPCTVRTT
TDPVGRVRDL LSTGFDLATE PPFRTELLAV SADEWVLALV VHHISADGFS LGPLARDVMA
AYRARAQGNS PVWTPLPVRY ADYALWQREL QDGQRGLAYW TQTLSGLPEL LPLPFVRGRP
SVATHRGANV EFILDAEIHR RIRALARAHD ATPFMVVHTA LAVLAARLGD TSDIVVGTPI
AGRGPRALDD LVGMFVNTLV LRTDVPLHTT FADTLGRVRD VDLAAYAHAD VSFEEVVEAV
APARSRGRNP LFQLALAYHN TAPVVIDLPE VSATIVEIET HTAKFDLQLT VTESVDDADA
PAPLSCVFTY ATDLFDDATV SGFADRFQRI LRAALAQPSV VIGDIDVRDL AEQQARHHGP
PSVPQRTLPD LMSAAARQNP GGPALTADGR SMAYRELDAE SNRLARALMR RGVGPEAFVA
LGVPRSVLSV VSVWAVAKTG AAFVPVDPHY PAARVRYLMD DSGAVLGLTT AADREALPDG
TDWLLLDDPD FRTECRGYSS APVTDADRAR PLDARHPAYV IYTSGSTGNP KGVVVTHTGL
ADFTAEQRER YSVTASSRTL HASSPSFDGS VLEILLALGA GACMVLAPPA VQADDQLTDL
LARERVSHVF TTPTVLATVD PRGLHDLRVV VAGGEPCPPE LVAVWAAQEQ MYNGYGPTET
TVMTSISDPM AAGGPVTIGR PIRGAAVLLL DSRLHPVPTG VAGELYISGP GVARGYHRRP
ALTAARFVAS PFGASGERMY RTGDVGRWRR DGSIEYLHRN DSQIELRGVR IELGEIDATL
LAHPTVRFAA TDVRELRDGV DALVSYVVPA RNETVDPDQL IRFAATRLPM PLVPAIVVVV
DAVPLTVNGK LDRSALPDPV PAATEFEAPH TPAERVVATV FAEVLGLDLV GREANFFALG
GSSLSATRAA SRIGAALETT VPVRAVFDTP TVSALAASVT EQTDGATHVA LTVRPRPARV
PLSPAQQRLW FLARLAPGSP EYNIPMALRL TGALDVAALT AAVADVISRH EPLRTVYPDK
DGVAYQQLIP VSRAAPILRP IPVSRGDVAA RVATFASKGF ALGAEPPFRA RLFAPAPDEW
VLAIMVHHIS TDGFSVPVLV RDVMTAYEAR VSGEAPRWAD LPVQYTDYTL WQHELLGDRD
DPQSVAAQQL SYWKDTLSGA PEQLSLPADR PHPAIASHRG GTVEFAIDDA LYRAVEHTAT
ALAVTPFMIV HAAFAVLLAR LSGSTDIVIG TPVAGRGDQA LDDLVGMFVN NLVLRTEVIP
AESFAELLGR VREVDLDAFA HSTVPFETVV EVLDPARSRA RNPLFQVALA FQNLDHTPLT
LPGLRVSAMT PPTRSARFDL QLTVTADPDP STERTRHAAV FTYATDLFDH HTVADIGERF
IRMLSEAVAQ PDTAVGDLAL LDPDPDVPTA TPSSGETLVD LFERQVAAAP DAIAVVSGDR
TLTYRELDEQ VNRLARHLIG AGVGPESMVG LAIRRSVDLL VGMYAVATAG GAYVPIDPDQ
PGQRNAQVIR SSATRVVLTT ERDRFDLSDV LPAETRLVVI DRVDLSGVDP SPISDLDRIA
PLLAQHPAYV IHTSGSTGTP KGVVVTHAAV VSFLSWRQDT DPLGADDTVM LKLQYTFDAS
VREFWWPLIA GARMVIARPD GHRDPRHLAE LIGRHRVTAG YFLPLMLAEI LAIPEADLTS
LRQVSCGGEV LPPGTAHSVH ARCPDAVLYN EYGPTETAVA VTRTVVGTEA ATVPIGVPQN
GVGVLVLDSR LHPVPIGVPG ELYLAGAQLA RGYLGQPGAT AARFPANPWG PPGQRMYRTG
DVVSRRRDGT IDYLGRRDLQ VKIRGQRVEL EEIETALRRH AAVAQSAAAV YESAGTGARL
VGYVVPHPDM VVDTRAVSAE LAQRLPRYMV PNQIVVLGAL PSTPHGKLDR RALPDPGVPR
AQRFRPPTTR TEKLVTAVFA DVLGTGLVGL DDNFFELGGN SLSATRAASR LHTSTRVEVR
LDWFFADATA ESIAARISSS FAAASAASAG LGVLLPLHPD GGREPLFCIH PAIGLAWCYA
GLGAHLGPDR PVWGVQSPAV TVPGERFASI TQRAHRYVEE IRRVQPRGPY HLLGYSVGGV
IAHAMAVELR SRGEEVGVLA MIDSYAAAER DTPAPTLPEL LTEFGGGASE HLGPEQLTRL
YEDYVDVVDA AADFVPGHFD GDLVFFGAAG NGSQPAPASE TWRPWIDGDV IAHTVDRPHT
RMTDPEALAV IGPILAEYLA GASATDGTEN PPPTQRETAQ RHE
//