The Dirichlet (also Beta) distribution

Here's a category of distributions we may often want: a distribution on the simplex.

I.e. a multivariate distribution on n nonnegative numbers that add up to 1. This is something that we can definitely see using as prior distributions on parameters that can be interpreted as "probabilities" of something. One can see that this is important for the Categorical and Multinomial distributions, for instance: and in the case of two numbers (i.e. a univariate distribution, since it's on a line segment), for the Bernoulli and Binomial distributions.

Here's one such distribution family that may come to your mind: for xi in the simplex ixi = 1,

f(x1,…xnθ1,…θn) ∝ ∏ixiθi

By adjusting the values of the $α$s, one can get suitable priors that represent our beliefs correctly. This is known as the Dirichlet distribution, and its univariate case f(x|θ1,θ2) ∝ xθ1(1−x)θ2 is known as the Beta distribution.

In fact the parameters of said distributions are usually provided a bit differently, with α − 1 = θ.

*Exercise:* Prove that: