
Dirichlet Distribution
Let $Theta = {theta_1,theta_2,dots,theta_m}$, then $Theta$ is parameters of a multinomial distribution. Dirichlet distribution is a distribution over distribution. The probability density function of Dirichlet distribution is
where ${alpha_1,alpha_2,dots,alpha_k}$ is the parameters of the Dirichlet distribution.
Dirichlet Process
Dirichlet Process is discribed by a sacling parameter $alpha$ and a base distribution $G_0$, denoted as $Gsim DP(alpha,G_0)$
Assume we view these variables in a specific order, we get
Let there be only K unique values for $X_1dots X_{n-1}$, then we can rewrite the formular above as
Something interesting is the density function of $X_1,dots,X_{n}$ can be written as
Note that, as the base distribution $G_0$ is continuos, the probability that any two samples from this distribution is zero.
There are several interpretation for Dirichlet Process. One of famous interpretation is Chinese Restaurant Process. $X_n$ is the new customer. She/He is more willing to sit at a table if there are already many people sitting there, and She/He will sit at a new table with probability proportional to $alpha$.




近期评论