T-61.5020 Statistical Natural Language Processing
Exercises 7 -- Word sense disabiguation
Version 1.0
According to Bayes' theorem, the probability of the sense , if we know the context , is
Note: The next two problems require some knowledge of Finnish.
Group 2:
Test set:
Note: The next three problems require usage of a computer.
You have English material (e.g. Google, http://www.google.com) available and want to know
Word ``kuusi'' has occurred in the contexts given in the list below. We know that it has two meanings (``six'' and ``spruce''). Classify the contexts to two groups according to in which sense ``kuusi'' was, using the expectation-maximization (EM) algorithm.