probability introduction

Dirichlet Distribution

Let $Theta = {theta_1,theta_2,dots,theta_m}$, then $Theta$ is parameters of a multinomial distribution. Dirichlet distribution is a distribution over distribution. The probability density function of Dirichlet distribution is

where ${alpha_1,alpha_2,dots,alpha_k}$ is the parameters of the Dirichlet distribution.

Dirichlet Process

Dirichlet Process is discribed by a sacling parameter $alpha$ and a base distribution $G_0$, denoted as $Gsim DP(alpha,G_0)$

Assume we view these variables in a specific order, we get

Let there be only K unique values for $X_1dots X_{n-1}$, then we can rewrite the formular above as

Something interesting is the density function of $X_1,dots,X_{n}$ can be written as

Note that, as the base distribution $G_0$ is continuos, the probability that any two samples from this distribution is zero.

There are several interpretation for Dirichlet Process. One of famous interpretation is Chinese Restaurant Process. $X_n$ is the new customer. She/He is more willing to sit at a table if there are already many people sitting there, and She/He will sit at a new table with probability proportional to $alpha$.