Beta distribution

You don't need to be Editor-In-Chief to add or edit content to WikiDoc. You can begin to add to or edit text on this WikiDoc page by clicking on the edit button at the top of this page. Next enter or edit the information that you would like to appear here. Once you are done editing, scroll down and click the Save page button at the bottom of the page.

Jump to: navigation, search

Editor-In-Chief: C. Michael Gibson, M.S., M.D. [1] Phone:617-525-6884

Please Take Over This Page and Apply to be Editor-In-Chief for this topic: There can be one or more than one Editor-In-Chief. You may also apply to be an Associate Editor-In-Chief of one of the subtopics below. Please mail us [2] to indicate your interest in serving either as an Editor-In-Chief of the entire topic or as an Associate Editor-In-Chief for a subtopic. Please be sure to attach your CV and or biographical sketch.

Overview

Beta
Probability density function
Probability density function for the Beta distribution
Cumulative distribution function
Image:Beta distribution cdf.png
Parameters α > 0 shape (real)
β > 0 shape (real)
Support x \in [0; 1]\!
Probability density function (pdf) \frac{x^{\alpha-1}(1-x)^{\beta-1}} {\mathrm{B}(\alpha,\beta)}\!
Cumulative distribution function (cdf) I_x(\alpha,\beta)\!
Mean \frac{\alpha}{\alpha+\beta}\!
Median
Mode \frac{\alpha-1}{\alpha+\beta-2}\! for α > 1,β > 1
Variance \frac{\alpha\beta}{(\alpha+\beta)^2(\alpha+\beta+1)}\!
Skewness \frac{2\,(\beta-\alpha)\sqrt{\alpha+\beta+1}}{(\alpha+\beta+2)\sqrt{\alpha\beta}}
Excess kurtosis see text
Entropy see text
Moment-generating function (mgf) 1  +\sum_{k=1}^{\infty} \left( \prod_{r=0}^{k-1} \frac{\alpha+r}{\alpha+\beta+r} \right) \frac{t^k}{k!}
Characteristic function {}_1F_1(\alpha; \alpha+\beta; i\,t)\!

In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] parameterized by two non-negative shape parameters, typically denoted by α and β.

Characterization

Probability density function

The probability density function of the beta distribution is

 f(x;\alpha,\beta) = \frac{x^{\alpha-1}(1-x)^{\beta-1}}{\int_0^1 u^{\alpha-1} (1-u)^{\beta-1}\, du} \!
= \frac{\Gamma(\alpha+\beta)}{\Gamma(\alpha)\Gamma(\beta)}\, x^{\alpha-1}(1-x)^{\beta-1}\!
= \frac{1}{\mathrm{B}(\alpha,\beta)}\, x
^{\alpha-1}(1-x)^{\beta-1}\!

where Γ is the gamma function. The beta function, B, appears as a normalization constant to ensure that the total probability integrates to unity.

Cumulative distribution function

The cumulative distribution function is

F(x;\alpha,\beta) = \frac{\mathrm{B}_x(\alpha,\beta)}{\mathrm{B}(\alpha,\beta)} = I_x(\alpha,\beta) \!

where Bx(α,β) is the incomplete beta function and Ix(α,β) is the regularized incomplete beta function.

Properties

Moments

The expected value and variance of a beta random variable X with parameters α and β are given by the formulae:

 
  \begin{align}
   \operatorname{E}(X)   = & \frac{\alpha}{\alpha+\beta} \\
   \operatorname{Var}(X) = & \frac{\alpha \beta}{(\alpha+\beta)^2(\alpha+\beta+1)}
  \end{align}

The skewness is


\frac{2 (\beta - \alpha) \sqrt{\alpha + \beta + 1} }   
        {(\alpha + \beta + 2) \sqrt{\alpha \beta}}. \,\!

The kurtosis excess is:

6\,\frac{\alpha^3-\alpha^2(2\beta-1)+\beta^2(\beta+1)-2\alpha\beta(\beta+2)}
{\alpha \beta (\alpha+\beta+2) (\alpha+\beta+3)}.\,\!

Quantities of information

Given two beta distributed random variables, X ~ Beta(α, β) and Y ~ Beta(α', β'), the information entropy of X is


\begin{align}
H(X) &= \ln\mathrm{B}(\alpha,\beta)-(\alpha-1)\psi(\alpha)-(\beta-1)\psi(\beta)+(\alpha+\beta-2)\psi(\alpha+\beta)
\end{align}
\,

where ψ is the digamma function.

The cross entropy is

H(X,Y) = \ln\mathrm{B}(\alpha',\beta')-(\alpha'-1)\psi(\alpha)-(\beta'-1)\psi(\beta)+(\alpha'+\beta'-2)\psi(\alpha+\beta).\,

It follows that the Kullback-Leibler divergence between these two beta distributions is


 D_{\mathrm{KL}}(X,Y) = \ln\frac{\mathrm{B}(\alpha',\beta')}
                                {\mathrm{B}(\alpha,\beta)} -
                        (\alpha'-\alpha)\psi(\alpha) - (\beta'-\beta)\psi(\beta) + 
                        (\alpha'-\alpha+\beta'-\beta)\psi(\alpha+\beta)

Shapes

The beta density function can take on different shapes depending on the values of the two parameters:

  • \alpha < 1,\ \beta < 1 is U-shaped (red plot)
  • \alpha < 1,\ \beta \geq 1 or \alpha = 1,\ \beta > 1 is strictly decreasing (blue plot)
  • \alpha = 1,\ \beta = 1 is the uniform distribution
  • \alpha = 1,\ \beta < 1 or \alpha > 1,\ \beta \leq 1 is strictly increasing (green plot)
    • \alpha > 2,\ \beta = 1 is strictly convex
    • \alpha = 2,\ \beta = 1 is a straight line
    • 1 < \alpha < 2,\ \beta = 1 is strictly concave
  • \alpha > 1,\ \beta > 1 is unimodal (purple & black plots)

Moreover, if α = β then the density function is symmetric about 1/2 (red & purple plots).

Parameter estimation

Let

\bar{x} = \frac{1}{N}\sum_{i=1}^N x_i

be the sample mean and

v = \frac{1}{N}\sum_{i=1}^N (x_i - \bar{x})^2

be the sample variance. The method-of-moments estimates of the parameters are

\alpha = \bar{x} \left(\frac{\bar{x} (1 - \bar{x})}{v} - 1 \right),
\beta = (1-\bar{x}) \left(\frac{\bar{x} (1 - \bar{x})}{v} - 1 \right).

Related distributions

Applications

B(ij) with integer values of i and j is the distribution of the ith-highest of a sample of i + j − 1 independent random variables uniformly distributed between 0 and 1. The cumulative probability from 0 to x is thus the probability that the i-th highest value is less than x, in other words, it is the probability that at least i of the random variables are less than x, a probability given by summing over the binomial distribution with its p parameter set to x. This shows the intimate connection between the beta distribution and the binomial distribution.

Beta distributions are used extensively in Bayesian statistics, since beta distributions provide a family of conjugate prior distributions for binomial (including Bernoulli) and geometric distributions. The beta(0,0) distribution is an improper prior and sometimes used to represent ignorance of parameter values.

The Beta distribution can be used to model events which are constrained to take place within an interval defined by a minimum and maximum value. For this reason, the Beta distribution - along with the triangular distribution - is used extensively in PERT, critical path method (CPM) and other project management / control systems to describe the time to completion of a task. In project management, shorthand computations are widely used to estimate the mean and standard deviation of the Beta distribution:

 \begin{align}
  \mathrm{mean}(X) & {} = E(X)= \frac{a + 4b + c}{6}, \\
  \mathrm{s.d.}(X) & {} = \frac{c-a}{6},
\end{align}

where a is the minimum, c is the maximum, and b is the most likely value.

External links


WikiDoc Help Menu

Quick Start..

Editing basics

Advanced editing

Communicating your edits

Help Videos You Can Watch


Acknowledgement and Attribution Regarding Sources of Content

Some of the initial content on this page may be incorporated in part from copyleft sources in the public domain including wikis such as Wikipedia and AskDrWiki. Drug information for patients came from the The National Library of Medicine. Infectious disease information may have come from the Centers for Disease Control (CDC). Differential Diagnoses are drawn from clinicians as well as an amalgamation of 3 sources: 1.The Disease Database; 2. Kahan, Scott, Smith, Ellen G. In A Page: Signs and Symptoms. Malden, Massachusetts: Blackwell Publishing, 2004:3; 3. Sailer, Christian, Wasner, Susanne. Differential Diagnosis Pocket. Hermosa Beach, CA: Borm Bruckmeir Publishing LLC, 2002:7 .

Personal tools