Biosynthesis of doxorubicin

Overview
Doxorubicin (DXR) is a 14-hydroxylated version of daunorubicin, the immediate precursor of DXR in its biosynthetic pathway. Daunorubicin is more abundantly found as a natural product because it is produced by a number of different wild type strains of streptomyces. In contrast, only one known non-wild type species, streptomyces peucetius subspecies cesius ATCC 27952, was initially found to be capable of producing the more widely used doxorubicin. This strain was created by Arcamone et. al in 1969 by mutating a strain producing daunorubicin, but not DXR, at least in detectable quantities. Subsequently, Hutchinson's group showed that under special environmental conditions, or by the introduction of genetic modifications, other strains of streptomyces can produce doxorubicin. His group has also cloned many of the genes required for DXR production, although not all of them have been fully characterized. In 1996, Strohl's group discovered, isolated and characterized dox A, the gene encoding the enzyme that converts daunorubicin into DXR. By 1999, they produced recombinant Dox A, a Cytochrome P450 oxidase, and found that it catalyzes multiple steps in DXR biosynthesis, including steps leading to daunorubicin. This was significant because it became clear that all daunorubicin producing strains have the necessary genes to produce DXR, the much more therapeutically important of the two. Hutchinson's group went on to develop methods to improve the yield of DXR, from the fermentation process used in its commercial production, not only by introducing Dox A encoding plasmids, but also by introducing mutations to deactivate enzymes that shunt DXR precursors to less useful products, for example baumycin-like glycosides. Some triple mutants, that also over-expressed Dox A, were able to double the yield of DXR. This is of more than academic interest because at that time DXR cost about $1.37 million per kg and current production in 1999 was 225 kg per annum. More efficient production techniques have brought the price down to $1.1 million per kg for the non-liposomal formulation. Although DXR can be produced semi-synthetically from daunorubicin, the process involves electrophilic bromination and multiple steps and the yield is poor. Since daunorubicin is produced by fermentation, it would be ideal if the bacteria could complete DXR synthesis more effectively.

Overview
The anthracycline skeleton of doxorubicin (DXR) is produced by a Type II polyketide synthase (PKS) in streptomyces peucetius. First, a 21-carbon decaketide chain (Fig 1. (1)) is synthesized from a single 3-carbon propionyl group from propionyl-CoA, and 9 2-carbon units derived from 9 sequential (iterative) decarboxylative condensations of malonyl-CoA. Each malonyl-CoA unit contributes a 2-carbon ketide unit to the growing polyketide chain. Each addition is catalyzed by the "minimal PKS" consisting of an acyl carrier protein (ACP), a ketosynthase (KS)/chain length factor (CLF) heterodimer and a malonyl-Coa:ACP acyltransferase(MAT). (refer to top of Figure 10.

This process is very similar to fatty acid synthesis, by fatty acid synthases and to Type I polyketide synthesis. But, in contrast to fatty acid synthesis, the keto groups of the growing polyketide chain are not modified during chain elongation and they are not usually fully reduced. In contrast to Type I PKS systems, the synthetic enzymes (KS, CLF, ACP and AT) are not attached covalently to each other, and may not even remain associated during each step of the polyketide chain synthesis. After the 21-carbon decaketide chain of DXR is completed, successive modifications are made to eventually produce a tetracyclic anthracycline aglycone(without glycoside attached). The daunosamine amino sugar, activated by addition of Thiamine diphosphateTDP, is created in another series of reactions. It is joined to the anthracycline aglycone and further modifications are done to produce first daunorubicin then DXR. There are at least 3 gene clusters important to DXR biosynthesis: dps genes which specifiy the enzymes required for the linear polyketide chain synthesis and its first cyclizations, the dnr cluster is responsible for the remaining modifications of the anthracycline structure and the dnm genes involved in the amino sugar, daunosamine, synthesis. Additionally, there is a set of "self resistance" genes to reduce the toxic impact of the anthracycline on the producing organism. One mechanism is a membrane pump that causes efflux of the DXR out of the cell (drr loci). Since these complex molecules are only advantageous under specific conditions, and require a lot of energy to produce, their synthesis is tightly regulated.

Polyketide Chain Synthesis
Doxorubicin is synthesized by a specialized polyketide synthase.

The initial event in DXR synthesis is the selection of the propionyl-CoA starter unit and its decarboxylative addition to a two carbon ketide unit, derived from malonyl-CoA to produce the five carbon B-ketovaleryl ACP. The five carbon diketide is delivered by the ACP to the cysteine sulfhydryl group at the KS active site, by thioester exchange, and the ACP is released from the chain. The free ACP picks up another malonate group from malonyl-CoA, also by thioester exchange, with release of the CoA. The ACP brings the new malonate to the active site of the KS where is it decarboxylated, possibly with the help of the CLF subunit, and joined to produce a 7 carbon triketide, now anchored to the ACP (see top of Figure 1). Again the ACP hands the chain off to the KS subunit and the process is repeated iteratively until the decaketide is completed.

In most Type II systems the initiating event is delivery by ACP of an acetate unit, derived from acetyl-CoA, to the active site of the ketosynthase (KS) subunit of the KS/CLF heterodimer. The default mode for Type II PKS systems is the incorporation of acetate as the primer unit, and that holds true for the DXR "minimal PKS". In other words the action of KS/CLF/ACP (Dps A, B and G) from this system will not produce 21-carbon decaketides, but 20-carbon decaketides instead, because acetate is the “preferred” starter. The process of specifying propionate is not completely understood, but it is clear that it depends on an additional protein, Dps C, which may be acting as a ketosynthase or acyltransferase selective for propionyl-CoA, and possibly Dps D makes a contribution.

A dedicated MAT has been found to be dispensable for polyketide production under in vitro conditions. The PKS may "borrow" the MAT from its own fatty acid synthase and this may be the primary way ACP receives its malonate group in DXR biosynthesis. Additionally, there is excellent evidence that "self-malonylation" is an inherent characteristic of Type II ACPs. In summary, a given Type II PKS may provide its own MAT (s), it may borrow one from FAS, or its ACP may “self-malonylate”.

It is unknown whether the same KS/CLF/ACP ternary complex chaperones the growth of a full length polyketide chain through the entire catalytic cycle, or whether the ACP dissociates after each condensation reaction. A 2.0-Å resolution structure of the actinorhodin KS/CLF, which is very similar to the dps KS/CLF, shows polyketides being elongated inside an amphipathic tunnel formed at the interface of the KS and CLF subunits. The tunnel is about 17-Å long and one side has many charged amino acid residues which appear to be stabilizing the carbonyl groups of the chain, while the other side is hydrophobic. This structure explains why both subunits are necessary for chain elongation and how the reactive growing chain is protected from random spontaneous reactions until it is positioned properly for orderly cyclization. The structure also suggests a mechanism for chain length regulation. Amino acid side groups extend into the tunnel and act as "gates". A couple of particularly bulky residues may be impassable by the chain, causing termination. Modifications to tunnel residues based on this structure were able to alter the chain length of the final product. The final condensation causes the polyketide chain to "buckle" allowing an intramolecular attack by the C-12 methylene carbanion, generated by enzyme catalyzed proton removal and stabilized by electrostatic interactions in the tunnel, on the C-7 carbonyl (see 3 in Figure 1). This tunnel aided intramolecular aldol condensation provides the first cyclization when the chain is still in the tunnel. The same C-7/C-12 attack occurs in the biosynthesis of DXR, in a similar fashion.

Conversion to 12-deoxyalkalonic acid
The 21-carbon decaketide is converted to 12-deoxyalkalonic acid (5), the first free easily isolated intermediate in DXR biosynthesis, in 3 steps. These steps are catalyzed by the final 3 enzymes in the dps gene cluster and are considered part of the polyketide synthase. While the decaketide is still associated with the KS/CLF heterodimer the 9-carbonyl group is reduced by Dps E, the 9-ketoreductase, using NADPH as the reducing agent/hydride donor. Dps F, the “1st ring cyclase” /aromatase, is very specific and is in the family of C-7/C-12 cyclases that require prior C-9 keto-reduction. These two reactions are felt to occur while the polyketide chain is still partially in the KS/CLF tunnel and it is not known what finally cleaves the chain from its covalent link to the KS or ACP. If the Dps F cyclase is inactivated by mutations or gene deletions, the chain will cyclize spontaneously in random fashion. Thus, Dps F is thought to “chaperone” or help fold the polyketide to ensure non-random cyclization, a reaction that is energetically favorable and leads to subsequent dehydration and resultant aromatization. Next, Dps Y regioselectively promotes formation of the next two carbon-carbon bonds and then catalyzes dehydration leading to aromatization of one of the rings to give (5).

Conversion to є-rhodomycinone
The next reactions are catalyzed by enzymes originating from the dnr gene cluster. Dnr G, a C-12 oxygenase (see (5) for numbering) introduces a keto group using molecular oxygen. It is an "anthrone type oxygenase", also called a quinone-forming monooxygenase, many of which are important 'tailoring enzymes' in the biosynthesis of several types of aromatic polyketide antibiotics. They have no cofactors: no flavins, metals or energy sources. Their mechanism is poorly understood but may involve a "protein radical". Alkalonic acid (6), a quinone, is the product. Dnr C, alkalonic acid-O-methyltransferase methylates the carboxylic acid end of the molecule forming an ester, using S-adenosyl methionine (SAM) as the cofactor/methyl group donor. The product is alkalonic acid methyl ester (7). Interestingly, the methyl group is removed later, but it serves to activate the adjacent methylene group facilitating its attack on the terminal carbonyl group, a reaction catalyzed by DnrD. Dnr D, the fourth ring cyclase (AAME cyclase), catalyzes an intramolecular aldol addition reaction. No cofactors are required and neither aromatization nor dehydration occurs. A simple base catalyzed mechanism is proposed. The product is aklaviketone (8). Dnr H, aklaviketone reductase, stereospecifically reduces the 17-keto group of the new fourth ring to a 17-OH group to give aklavinone (9). This introduces a new chiral center and NADPH is a cofactor. Dnr F, aklavinone-11-hydroxylase, is a FAD monooxygenase that uses NADPH to activate molecular oxygen for subsequent hydroxylation. є-rhodomycinone (10) is the product.

Conversion to doxorubicin
Dnr S, daunosamine glycosyltransferase  catalyzes the addition of the TDP activated glycoside, L-daunosamine-TDP  to є-rhodomycinone   to give rhodomycin D (Figure 2). The release of      TDP drives the reaction forward. The enzyme has sequence similarity to glycosyltransferases of the other  "unusual sugars" added to Type II PKS aromatic products. Dnr P, rhodomycin D methylesterase, removes the methyl group added previously by DnrC. It initially served to activate the adjacent methylene group, and after that it prevented its carboxyl group from leaving the C-10 carbon (see Fig 2). Had the carboxyl group not been esterified prior to the fourth ring cyclization, its departure as [ CO2 would have been favored by the formation of a bicyclic aromatic system. After C-7 reduction and glycosylation, the C-8 methylene group is no longer activated for deprotonation, thereby making aromatization less likely. Note that the non-isolable intermediate, with numbering, is the 3rd molecule in Figure 2. The numbering system is very odd and a vestige of early nomenclature. The decarboxylation of the intermediate occurs spontaneously, or by the influence of Dnr P, giving 13-deoxycarminomycin. A crystal structure, with bound products, of aclacinomycin methylesterase, an [enzyme] with 53% sequence homology to Dnr P, from streptomyces purpurascens, has been solved. It is able to catalyze the same reaction and uses a classic Ser-His-Asp catalytic triad with serine acting as the nucleophile and gly-met providing stabilization of the transition state by forming an "oxyanion hole". The active site amino acids are almost entirely the same as Dnr P, and the mechanism is almost certainly identical. Although Dox A is shown next in the biosynthetic scheme (Figure 2), Dnr K, carminomycin 4-O-methyltransferase is able to O-methylate the 4-hydroxyl group of any of the glycosides in Figure 2. A 2.35 Å resolution crystal structure of the enzyme with bound products has recently been solved. The orientation of the products is consistent with a SN2 mechanism of methyl transfer. Site-directed mutagenesis of the potential acid/base residues in the active site did not affect catalysis leading to the conclusion that Dnr K most likely acts as an entropic enzyme in that rate enhancement is mainly due to orientational and proximity effects. This is in contrast to most other O-methyltransferases where acid/base catalysis has been demonstrated to be an essential contribution to rate enhancement. Dox A catalyzes three successive oxidations in streptomyces peucetius. Deficient DXR production is not primarily due to low levels of or malfunctioning Dox A, but because there are many products diverted away from the pathway shown in Figure 2. Each of the glycosides is a potential target of shunt enzymes, not shown, some of which are products of the dnr gene cluster. Mutations of these enzymes does significantly boost DXR production. In addition, Dox A has a very low kcat/Km value for C-14 oxidation (130/M) compared to C-13 oxidation (up to 22,000/M for some substrates). Genetic manipulation to overexpress Dox A has also increased yields, particularly if the genes for the shunt enzymes are inactivated simultaneously. Dox A is a cytochrome P-450 monooxygenase that has broad substrate specificity, catalyzing anthracycline hydroxylation at C-13 and C-14 ( Figure 2). The enzyme has an absolute requirement for molecular oxygen and NADPH. Initially, two successive oxidations are done at C-13, followed by a single oxidation of C-14 that converts daunorubicin to doxorubicin.