$H(Y)=-sumlimits_{i=1}^{k}P(Y=sigma_{i});log(P(Y=sigma_{i}))$. For a second random variable $X=X_{1}X_{2}…X_{n}$, where all $X_{i}$’s are independent and equally distributed (each $X_{i}$ is a copy of the same random variable $Y$), the following equation is known to be true:
$H(X)=ncdot H(Y)$
I want to prove this simple equation, where the outcomes from $Y$ are interpreted as symbols from an alphabet $Sigma$ and therefore $X$ is the random variable for strings of length $n$ (based on the distribution of $Y$). It is easy to see, that $P(X=w)=P(X=w_{1}…w_{n})=P(X_{1}=w_{1});cdot;…;cdot; P(X_{n}=w_{n})=prodlimits_{i=1}^{n}P(Y=w_{i})$
… but my approach to prove $H(X)=ncdot H(Y)$ seems to be in a dead point
(every word $w$ has the form $w=w_{1}…w_{n}$ with $w_{i}inSigma$ and let $large |w|_{sigma_{i}}$ be the number of occurrences of $sigma_{i}$ in $w$): $H(X)=-sumlimits_{winSigma^{n}}P(X=w);log(P(X=w))$
$=-sumlimits_{winSigma^{n}}left(prodlimits_{i=1}^{n}P(Y=w_{i})right);logleft(prodlimits_{i=1}^{n}P(Y=w_{i})right)$
$=-sumlimits_{winSigma^{n}}left(prodlimits_{i=1}^{n}P(Y=w_{i})right)left(sumlimits_{i=1}^{n}logleft(P(Y=w_{i})right)right)$
$=-sumlimits_{winSigma^{n}}left(prodlimits_{i=1}^{k}P(Y=sigma_{i})^{large |w|_{sigma_{i}}}right)left(sumlimits_{i=1}^{k}large |w|_{sigma_{i}}normalsize ;logleft(P(Y=sigma_{i})right)right)$ So, I am able to change some indices from word length to the length of the alphabet, which is used in $H(Y)$. But what now? Any help?
Asked By : Danny
Answered By : Shaull
Best Answer from StackOverflow
Question Source : http://cs.stackexchange.com/questions/21525 Ask a Question Download Related Notes/Documents