ID A0A1Y2MT20_PSEAH Unreviewed; 4082 AA.
AC A0A1Y2MT20;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE SubName: Full=Dimodular nonribosomal peptide synthase {ECO:0000313|EMBL:OSY38353.1};
GN Name=dhbF {ECO:0000313|EMBL:OSY38353.1};
GN ORFNames=BG845_04114 {ECO:0000313|EMBL:OSY38353.1};
OS Pseudonocardia autotrophica (Amycolata autotrophica) (Nocardia
OS autotrophica).
OC Bacteria; Actinomycetota; Actinomycetes; Pseudonocardiales;
OC Pseudonocardiaceae; Pseudonocardia.
OX NCBI_TaxID=2074 {ECO:0000313|EMBL:OSY38353.1, ECO:0000313|Proteomes:UP000194360};
RN [1] {ECO:0000313|EMBL:OSY38353.1, ECO:0000313|Proteomes:UP000194360}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 535 {ECO:0000313|EMBL:OSY38353.1,
RC ECO:0000313|Proteomes:UP000194360};
RA Grumaz C., Vainshtein Y., Kirstahler P., Sohn K.;
RT "Pseudonocardia autotrophica DSM535, a candidate organism with high
RT potential of specific P450 cytochromes.";
RL Submitted (SEP-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- COFACTOR:
CC Name=pantetheine 4'-phosphate; Xref=ChEBI:CHEBI:47942;
CC Evidence={ECO:0000256|ARBA:ARBA00001957};
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OSY38353.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MIGB01000023; OSY38353.1; -; Genomic_DNA.
DR STRING; 2074.BG845_04114; -.
DR OrthoDB; 2472181at2; -.
DR Proteomes; UP000194360; Unassembled WGS sequence.
DR GO; GO:0003824; F:catalytic activity; IEA:InterPro.
DR GO; GO:0031177; F:phosphopantetheine binding; IEA:InterPro.
DR GO; GO:0043604; P:amide biosynthetic process; IEA:UniProt.
DR GO; GO:0019752; P:carboxylic acid metabolic process; IEA:UniProt.
DR GO; GO:0008610; P:lipid biosynthetic process; IEA:UniProt.
DR GO; GO:1901362; P:organic cyclic compound biosynthetic process; IEA:UniProt.
DR GO; GO:1901566; P:organonitrogen compound biosynthetic process; IEA:UniProt.
DR GO; GO:0044550; P:secondary metabolite biosynthetic process; IEA:UniProt.
DR CDD; cd05930; A_NRPS; 2.
DR Gene3D; 3.30.300.30; -; 3.
DR Gene3D; 3.40.50.980; -; 4.
DR Gene3D; 1.10.1200.10; ACP-like; 2.
DR Gene3D; 3.40.50.1820; alpha/beta hydrolase; 1.
DR Gene3D; 3.30.559.10; Chloramphenicol acetyltransferase-like domain; 4.
DR Gene3D; 3.40.50.12780; N-terminal domain of ligase-like; 1.
DR Gene3D; 3.30.559.30; Nonribosomal peptide synthetase, condensation domain; 4.
DR InterPro; IPR010071; AA_adenyl_domain.
DR InterPro; IPR029058; AB_hydrolase.
DR InterPro; IPR036736; ACP-like_sf.
DR InterPro; IPR025110; AMP-bd_C.
DR InterPro; IPR045851; AMP-bd_C_sf.
DR InterPro; IPR020845; AMP-binding_CS.
DR InterPro; IPR000873; AMP-dep_Synth/Lig_com.
DR InterPro; IPR042099; ANL_N_sf.
DR InterPro; IPR023213; CAT-like_dom_sf.
DR InterPro; IPR001242; Condensatn.
DR InterPro; IPR020806; PKS_PP-bd.
DR InterPro; IPR020802; PKS_thioesterase.
DR InterPro; IPR009081; PP-bd_ACP.
DR InterPro; IPR006162; Ppantetheine_attach_site.
DR InterPro; IPR001031; Thioesterase.
DR NCBIfam; TIGR01733; AA-adenyl-dom; 3.
DR PANTHER; PTHR45527:SF1; FATTY ACID SYNTHASE; 1.
DR PANTHER; PTHR45527; NONRIBOSOMAL PEPTIDE SYNTHETASE; 1.
DR Pfam; PF00501; AMP-binding; 3.
DR Pfam; PF13193; AMP-binding_C; 3.
DR Pfam; PF00668; Condensation; 4.
DR Pfam; PF00550; PP-binding; 3.
DR Pfam; PF00975; Thioesterase; 1.
DR SMART; SM00823; PKS_PP; 3.
DR SMART; SM00824; PKS_TE; 1.
DR SUPFAM; SSF56801; Acetyl-CoA synthetase-like; 3.
DR SUPFAM; SSF47336; ACP-like; 3.
DR SUPFAM; SSF53474; alpha/beta-Hydrolases; 1.
DR SUPFAM; SSF52777; CoA-dependent acyltransferases; 8.
DR PROSITE; PS00455; AMP_BINDING; 3.
DR PROSITE; PS50075; CARRIER; 3.
DR PROSITE; PS00012; PHOSPHOPANTETHEINE; 3.
PE 4: Predicted;
KW Phosphopantetheine {ECO:0000256|ARBA:ARBA00022450};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000194360}.
FT DOMAIN 962..1040
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT DOMAIN 2040..2118
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT DOMAIN 3721..3796
FT /note="Carrier"
FT /evidence="ECO:0000259|PROSITE:PS50075"
FT REGION 426..447
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2019..2042
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2595..2620
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 4082 AA; 431807 MW; 52DFE571ECC75D26 CRC64;
MSNPVRLPLS VAQRGVWSAQ QLFPDSTVYR VGQIIWMAGP IDPVAFVEAV DAAVTETEAL
RARFGEIDGS PVQRVDDDAT VHTAIVHEPL DDDAIVDRAR IDYRTPATGS DTLYAPWSLL
ARRSGGGWAW AFNHHHLVLD AYGVSLVVRR IAELYTARVS GAPAPERWFG TLADIVDPPD
GGPEAAEAYW RGVLDVDPGV PDDAGRLDDA FGFRPHTVPV PLGPEVRDRI AAFARPARLS
WPDALVALWG LFTARDRHSD RVAVRMPFML RDGAARLRTP SMTSRIVPLV VGIGPRTTVA
ELLQAISAQI RGVPAHTAIE DAQLARLWPG GEAAYFELPV VNIKLLDYAA DFAGTAGVEQ
TVNPGPVGRL DLSVHNDPVH GFRLDLRGRE PEYAGRAVEE HARRFADYLD AVLDLPIDTT
LVDLDRAGAP RPDGDTATGP DGIALDPPAR TVDELVRRQV ARTPGATAVV DDPTGLRWTF
AELDARVNAL AALLVERGVR VGDRVGVLLP RSPDLVVALA AVLRAGAGYL PIDPNLPGAR
IATILEDGTP CLVVTDTGGA GRAGPDVLVL DEHTGELDRG AATPPALSRE PAPDDPVYVI
FTSGTTGRPK GVEVPHRALV NRLAWGRERY PLGAGGTVLM KTPVSFDVSA PEVFAPLTEG
GSLVVAADGR HGDPEYLHEV IRRHAVHRIN FVPSMGEEFV RAGTGTLPSP TVTMVAGEAF
PAALATALGE RTGGTVLNIY GPTETGEITH HEADPTTPGR GALVPIGTPM ANTRARVLDR
WLRPVGAGMP GELYLGGDQV ATGYVGRPGR TAERFVADPG GAGQRLYRTG DLVRRTADGV
LEYLGRADDQ VKIRGHRVEP GEVAAVLERH PAVTRAAVVA QEHRTAGTRL VGYVTTVSRT
GDEPSGRAAA LRAHLAGQLP GHMVPAAIVE LDGFPVTANG KIDRRALPEP GALAEPAAAG
RPPRTGTEIA LAEAFREILG LPADTWPGVD DDFFALGGHS LLATRLLARV NAMAGTRASL
RDVFDRPTIA GLADLLGELP RAVAPAASAA ARPGRVPASA GQQALWFAEQ LGGPGGRYVV
PTVAQVTGDL DPAALTAAVR DVVARHESLR TLLRDDDGGG LVQVVVPADE AGRRLVCTME
DLSAAAPEVL DERVARVVRA RFDLAVDLPV RVAVLRTGAQ EWRWVLAVHH HAVDEWSLPV
LLRDVSVAYR ARRAGREPER APLPVQYADH AISRRAHLGR ADDPGSVLAG HLAYWRDALA
GAPEESTICA DRPRPAEPTH RGEDVGFALD GDVVSALRTV AARHGVTSFM IAQAAVALAV
SALDGTGSAD ADVVVGSPVG GRTEEGLADA VGYFANVLPF RHRFSAADRP ADVLARARET
VLDGFAHQDA PFDRIVTAAG AGRDTGRNPV YQVMLTHQQH TGESYPLVLP GADVHPDGTG
IGAVKADLDV YLTDRPDGID GFVSAATDLF DRATAERFVA VLTRALAAFA ADPEQPLARL
ELLPTDDLAR IEQWSRGTTG APAGASTVDA MLRQRIAATP TATAVVDGAS GRRWTFAQLD
ARIDAVAATL AERGVTVGDR VGVLLPRGVD LVPVLAGVLR AGAACVPLDP AHPPAHLARL
VEVARPRLVV TAPGGPELPL DADRLLDVPE VAQAPQAPPA PADRRAPSGG DPAQIIFTSG
TTGEPKGVQL PHAALANRLS WGNALLGLGA GSRALVKSGV GFVDAVTELF GPLTAGATVV
VAPDRIARDA AALWSAVRDH DITHLLTVPA LADGLGAEPH TAGDPGPDTL RHWVSSGEPL
TPGTRDTMRR LAPGAVLHNF YGSTEVTGDA TVCTVDTARA GGRVPIGRPQ PGVTARVLDG
RLRPVGPGVT GDLYLGGAQL ADGYLGRPAF TAERFVADPF AAAGGGRLYR TGDLARWTAD
GRLEHLGRAD DQVKIRGHRV EPGQARAVLR AHPAVSSAVV VAAEHPAGGM RLVGYVTVRP
GSTGPDVPGE LRAFLAERLP DHLVPSALVE LDQLPLTPNG KVDRRALPAP GPGTGAAGRA
PATDHERLLA TAFREVLALD PDTALGVDDD FFRLGGHSLL AARLLTRVNT TLASALRLRH
VFAHPTIAGL AAAVSGPARH DSSPLPPITG VVRPDPIPAS FGQQALWVLG ELGMGPAYQV
GIVLRIPGGA DVAALGRALH RFVERHEILR TRFVADDGLL TQVVTAPPQQ PGPTVRTVAA
DDVPARVGEL LAERRDLAVD GGAGFTLLQV HDRTGAGPAG RDDLLVVHGH HIVIDEGSVR
PLVRDLDALY DAELTGTPVA LPPLPVQSAE FAVWQRRLLG DRHDPGSRFR ADLEHWAKQL
ADLPVETPLP LDRPRAETTE RTIRTVRAGL GGEGSAAVDR LLSDRRATPL QGLVTALALG
LWAEGAGSTV PVGTPVELRD QPELADAIGY LVNTVVVRAD VDDSAGFGEL LADVRDRVVD
AGEHKHAPFD SVVETLAPPR IPGISPLFQV MAAFRDDHRR DDHRPRRLVP DPAVVAAAAD
DRARPALSDL VGLVVRRPDG QLDLQLNSAR ELFTAATADR LLARVHRFLV LGARYPHLPV
RHLVQLVRAA GDRLDDRADR DTADPVHSGA GPEARHPLPD LDPGDVASWN AALEYLSCTL
PGAGPLTLHV RDDGSAELIG HPDGPALAGP GGGPAAALGT VAGLAAELVA SYRTRVALTV
GREPGVHPGR PQLRPADRVR VAPADADRLR ARYGPDSRLL PLSALQSGLL YHMVRSRETG
DNNAYVSQVL REISGELDVD HLRRTVERVC ARYPNLFAAF VPLHDTEVAV VPAGATAGFR
VVGPDELAGD TAASYLERER RVPFDLTDPP LIRFTLVRHA ERAATLAMTF EHILMDGWSI
NALLAEIVDS YADPGLPERT GPASFEDYLD WLGARDTTAA HRAWDDYLAD LAGPTLLWPE
GGDLGLTRVD TGDVHRDLTP EAAAAVHAAA RRAGVTVGTL LQAAWALTLA RVTGGDDVVF
GNTVSGRPPE LPGSDRMIGL LFNTLPMRVG LRPAESVGEL LSRIRTEQLR VLDHPYAELT
RIQDATGLGA LFDTLFVVQN LPFDPFGTDE TAPGGLRVTG GTVNDATHYP VTVAVNPWER
AGHATVHVRL SYRRDALGSD AADRLTERYL HVLGELAADP NRPVARVGTY LPDERPPSPA
GAARPVREVT VADLLDEQVV RSPQETAIVA GDRSLTFAGF AAEVNRYARL LLSRGVRPEH
RVALLLPRDE RMVVAMFAVF TVGAAYVPID AEHPDERIGT MLDIARPTAT LVTTRDAARV
PGPAPGSGRG SAGELLDLDD PAVRAELAAG DPTPVTDAER GGPIEQDNLA YVIFTSGSTG
LPKGVAVGYR GLTNMYANHV EEIFDRVVAH QGGRRMRIAH TTSFSFDASW EQLFWLLNGH
EVHVIDEELR REPQRLLDHY DAARIDGFDV TPSYGQLLVD SGLLDRDRPA GRSVAADAPG
VVFVSLGGEA VPEALWRRLR EAPGVESYNL YGPTEYTINA LGADLSRSET SSVGRPIANT
RAYILDGALH PALPGVPGEL YLAGAGTARG YWAEPGRTAE RFVACPWEPG TRMYRTGDLA
RWTPEGTIDY LGRADDQVKI RGYRIEPGEV ADVLAGDPQV ARAAVIARPD PQGSTALYGY
LVSAAGEIAL DAVRGRARQL LPDYMVPAGL AAIDELPLTV NGKIEARALP DITTDAAEHV
APRTPAEAAV TGAVAELLGV PQVSATAGFF DAGGNSLLAM RLVARLNERL GSGLLVRDVF
TAQDLASIAE LIDPGSGADP GGGGSADVAG AVLMPLAPST TGRHLFCAHA RYGHASLYSA
LAGHVPPGVG VVGLQDPAHA GLDTEFGSMG ELAAVYADAV QRVQSAGPYD LLGWSFGGHI
VFAVARELVA RGEPVATITI IDTTPTGPDH VPDPGDVAPR PGVPVAADTL RQEEFLRATS
GELREVLGEQ ASAEVFADHT QLTAFAVSGL RCERHMAEPT TGGLDCPALL VAAGAPVEAD
TAAGTGIAGW TAHLPRARTV HVGDADHNAI VRPDRGLPHW AHHLTGLLQR SGPGTHNQEG
HR
//