Partition coefficient

In the fields of organic and medicinal chemistry, a partition or distribution coefficient (KD) is the ratio of concentrations of a compound in the two phases of a mixture of two immiscible solvents at equilibrium. Hence these coefficients are a measure of differential solubility of the compound between these two solvents.

Normally one of the solvents chosen is water while the second is hydrophobic such as octanol. Hence both the partition and distribution coefficient are measures of how hydrophilic ("water loving") or hydrophobic ("water hating") a chemical substance is. Partition coefficients are useful for example in estimating distribution of drugs within the body. Hydrophobic drugs with high partition coefficients are preferentially distributed to hydrophobic compartments such as lipid bilayers of cells while hydrophilic drugs (low partition coefficients) preferentially are found in hydrophilic compartments such as blood serum.

Partition coefficient and log P
The partition coefficient is the ratio of concentrations of un-ionized compound between the two solutions. To measure the partition coefficient of ionizable solutes, the pH of the aqueous phase is adjusted such that the predominant form of the compound is un-ionized. The logarithm of the ratio of the concentrations of the un-ionized solute in the solvents is called log P:


 * $$log\ P_{oct/wat} = log\Bigg(\frac{\big[solute\big]_{octanol}}{\big[solute\big]_{water}^{un-ionized}}\Bigg)$$

Distribution coefficient and log D
The distribution coefficient is the ratio of the sum of the concentrations of all forms of the compound (ionized plus unionized) in each of the two phases. For measurements of distribution coefficient, the pH of the aqueous phase is buffered to a specific value such that the pH is not significantly perturbed by the introduction of the compound. The logarithm of the ratio of the sum of concentrations of the solute's various forms in one solvent, to the sum of the concentrations of its forms in the other solvent is called Log D:


 * $$log\ D_{oct/wat} = log\Bigg(\frac{\big[solute\big]_{octanol}}{\big[solute\big]_{water}^{ionized}+\big[solute\big]_{water}^{un-ionized}}\Bigg)$$

In addition, log D is pH dependent, hence the one must specify the pH at which the log D was measured. Of particular interest is the log D at pH = 7.4 (the physiological pH of blood serum). For un-ionizable compounds, log P = log D at any pH for which the compound remains unionized.

Pharmacokinetics
In the context of pharmacokinetics (what the body does to a drug), the distribution coefficient has a strong influence on ADME properties (Absorption, Distribution, Metabolism, and Excretion) of the drug. Hence the hydrophobicity of a compound (as measured by its distribution coefficient) is a major determinant of how drug-like it is. More specifically, in order for a drug to be orally absorbed, it normally must first pass through lipid bilayers in the intestinal epithelium (a process known as transcellular transport). For efficient transport, the drug must be hydrophobic enough to partition into the lipid bilayer, but not so hydrophobic, that once it is in the bilayer, it will not partition out again. Likewise, hydrophobicity plays a major role in determining where drugs are distributed within the body after adsorption and as a consequence how rapidly they are metabolized and excreted.

Pharmacodynamics
In the context of pharmacodynamics (what a drug does to the body), the hydrophobic effect is the major driving force for the binding of drugs to their receptor targets. On the other hand, hydrophobic drugs tend to be more toxic because they in general are retained longer, have a wider distribution within the body (e.g., intracellular), somewhat less selective in their binding to proteins, and finally are often extensively metabolized and in some cases these metabolites may be chemically reactive. Hence it is advisable to make the drug as hydrophilic as possible while still retaining adequate binding affinity to the therapeutic protein target. Therefore the ideal distribution coefficient for a drug is usually intermediate (not too hydrophobic nor too hydrophilic).

Consumer Products
Many other industries take into account distribution coefficients for example in the formulation of make-up, topical ointments, dyes, hair colors and many other consumer products.

Agrochemicals
Hydrophobic insecticides and herbicides tend to be more active. On the other hand, hydrophobic agrochemicals in general have longer half lives and therefore display increased risk of adverse environmental impact.

Environmental
The hydrophobicity of a compound can give scientists an indication of how easily a compound might be taken up in groundwater to pollute waterways, and its toxicity to animals and aquatic life. Distribution coefficients may be measured or predicted for compounds currently causing problems or with foresight to gauge the structural modifications necessary to make a compound environmentally more friendly in the research phase.

In the field of hydrogeology, the octanol water partition coefficient, or Kow, is used to predict and model the migration of dissolved hydrophobic organic compounds in soil and groundwater.

Shake flask (or tube) method
The classical and most reliable method of log P determination is the shake-flask method, which consists of dissolving some of the solute in question in a volume of octanol and water, then measuring the concentration of the solute in each solvent. The most common method of measuring the distribution of the solute is by UV/VIS spectroscopy. There are a number of pros and cons to this method:

Pros:
 * Most accurate method
 * Accurate for broadest range of solutes (neutral and charged compounds applicable)
 * Chemical structure does not have to be known beforehand.

Cons:
 * Time consuming (>30 minutes per sample)
 * Octanol and water must be premixed and equilibrated (takes at least 24 hours to equilibrate)
 * Complete solubility must be attained, and it can be difficult to detect small amounts of undissolved material.
 * The concentration vs. UV-Vis response must be linear over the solute's concentration range. (See Beer-Lambert law)
 * If the compound is extremely lipophilic or hydrophilic, the concentration in one of the phases will be exceedingly small, and thus difficult to quantify.
 * Relative to chromatographic methods, large amounts of material are required.

As an alternative to UV/VIS spectroscopy other methods can be used to measure the distribution, one of the best is to use a carrier free radiotracer. In this method (which is well suited for the study of the extraction of metals) a known amount of a radioactive material is added to one of the phases. The two phases are then brought into contact and mixed until equilibrium has been reached. Then the two phases are separated before the radioactivity in each phase is measured. If an energy dispersive detector can be used (such as a high purity germanium detector) then it is possible to use several different radioactive metals at once, with the more simple gamma ray detectors it is only possible to use one radioactive element in the sample.

If the volume of both of the phases are the same then the math is very simple.

For a hypothetical solute (S)

D or P = radioactivity of the organic phase / radioactivity of the aqueous phase

D or P = [Sorganic]/[Saqueous]

In such an experiment using a carrier free radioisotope the solvent loading is very small, hence the results are different from those which are obtained when the concentration of the solute is very high. A disadvantage of the carrier free radioisotope experiment is that the solute can absorb on the surfaces of the glass (or plastic) equipment or at the interface between the two phases. To guard against this the mass balance should be calculated.

It should be the case that

radioactivity of the organic phase + radioactivity of the aqueous phase = initial radioactivity of the phase bearing the radiotracer

For nonradioactive metals, it is possible in some cases to use ICP-MS or ICP-AES. Sadly ICP methods often suffer from many interferences which do not apply to gamma spectroscopy so hence the use of radio-tracers (counted by gamma ray spectroscopy) is often more straightforward.

HPLC determination
A faster method of log P determination makes use of high-performance liquid chromatography. The log P of a solute can be determined by correlating its retention time with similar compounds with known log P values.

Pros:
 * Fast method of determination (5-20 minutes per sample)

Cons:
 * The solute's chemical structure must be known beforehand.
 * Since the value of log P is determined by linear regression, several compounds with similar structures must have known log P values.
 * Different chemical classes will have different correlation coefficients, between-class comparisons are not significant.

Electrochemical methods
In the recent past some experiments using polarised liquid interfaces have been used to examine the thermodynamics and kinetics of the transfer of charged species from one phase to another. Two main methods exist.
 * ITIES, Interfaces between two immiscible electrolyte solutions which for example has been used at Ecole Polytechnique Fédérale de Lausanne.
 * Droplet experiments which have been used by Alan Bond, Frank Marken and also by the team at the Ecole Polytechnique Fédérale de Lausanne. Here a reaction at a triple interface between a conductive solid, droplets of a redox active liquid phase and an electrolyte solution have been used to determine the energy required to transfer a charged species across the interface.

Prediction
QSPR (Quantitative Structure-Property Relationship) algorithms calculate a log P in several different ways:
 * Atomic based prediction (atom contribution)
 * The simplest method for prediction of log P is parameterizing the contributions of various atoms to the over all molecular partition coefficient using constrained least squares fitting to a training set of compounds with experimentally measured partition coefficients.  In order to get reasonable correlations, the most common elements contained in drugs (hydrogen, carbon, oxygen, sulfur, nitrogen, and halogens) are divided into several different atom types depending on the environment of the atom in the molecule.  While this method is generally the least accurate, the advantage is that is the most general being able to provide at least a rough estimate of a wide variety of molecules.


 * Fragment based prediction (group contribution)
 * It has been shown that the log P of a compound can be determined by the sum of its non-overlapping molecular fragments (defined as one or more atoms covalently bound to each other within the molecule). Fragmentary log P values have been determined in a statistical method analogous to the atomic methods (least squares fitting to a training set). In addition, Hammett type corrections are included to account of electronic and steric effects. This method in general gives better results than atomic based methods, but cannot be used to predict partition coefficients for molecules containing unusual functional groups for which the method has not yet been parameterized (most likely because of the lack of experimental data for molecules containing such functional groups).


 * Data mining prediction
 * A typical data mining based prediction uses e.g. support vector machines, decision trees, neural networks are usually very successful for calculating log P values when trained with compounds that have similar chemical structures and known log P values.


 * Molecule mining prediction
 * Molecule mining approaches apply a similarity matrix based prediction or an automatic fragmentation scheme into molecular substructures. Furthermore there exist also approaches using maximum common subgraph searches or molecule kernels.


 * Estimation of log D (at a given pH) from log P and pKa:
 * exact expressions:
 * $$log\ D_{acids} = log\ P + log\Bigg[\frac{1}{(1+10^{pH-pK_a})}\Bigg]$$


 * $$log\ D_{bases} = log\ P + log\Bigg[\frac{1}{(1+10^{pK_a-pH})}\Bigg]$$


 * approximations for when the compound is largely ionized:
 * $$\mathrm{for\ acids\ with\ } \big(pH - pK_a\big) > 1,\ log\ D_{acids} \cong log\ P + pK_a - pH$$
 * $$\mathrm{for\ bases\ with\ } \big(pK_a - pH\big) > 1,\ log\ D_{bases} \cong log\ P - pK_a + pH$$
 * approximation when the compound is largely un-ionized:
 * $$log\ D \cong log\ P$$
 * Prediction of pKa
 * For prediction of pKa which in turn can be used to estimate log D, Hammett type equations have frequently been applied.

Some Octanol-Water partition coefficient data
The given values are sorted by the partition coefficient. Acetamide is hydrophilic and 2,2',4,4',5-Pentachlorobiphenyl is lipophilic.

Limitations
LogP is not an accurate determinant of lipophilicity for ionizable compounds because it only correctly describes the partition coefficient of neutral (uncharged) molecules. Taking the example of drug discovery we see how the limitations of logP can effect research. Since the majority of drugs (approximately 80%) are ionizable, logP is not an appropriate predictor of a compound's behaviour in the changing pH environments of the body. The distribution coefficient (LogD) is the correct descriptor for ionizable systems.

LogP calculators
There are many logP calculators or predictors available both commercially and for free.
 * Chemistry Development Kit
 * JOELib
 * ACD/LogP DB a commercial application that calculates LogP values and includes the largest commercially available database of experimental logP values with  calculation of Rule-of-5 parameters
 * ACD/LogP Freeware Download the free logP calculator
 * ALOGPS Free online calculations and comparison of 7 logP methods
 * Free online logP calculations using ChemAxon's Marvin and Calculator Plugins - requires Java
 * miLogP free logP and Rule of Five calculator by Molinspiration
 * an overview of on-line WWW resources for logP and other PhysProp calculations