| Jonathan Ultis
|
1
|
 |
|
05-08-2001 01:33 AM ET (US)
|
|
Let me offer my humble apologies to everyone for making you read this paper. It's very difficult to parse even though the ideas are good. The pictures and charts are also pretty poor.
Basically, this paper deals with using a particular type of Bayesian network and a version of the EM algorithm to cluster terms that occur together frequently. I'm not going to talk about the details of the EM algorithm that the author uses, so don't worry about trying to figure that out in too much detail.
Similarly, don't worry about Figure 4. It is meant to show the benefit of annealed EM in preventing overfitting, but it isn't explained well at all in this paper.
Good luck with it.
|