... probability (i.e. we assume uniform distribution) l°. The noise in the lexicon was filtered by manually checking the lexicon entries for the most frequent 200 words in the corpus 11 to eliminate ... context constraints using sta- tistical decision trees. We then use the ac- quired constraints in a flexible POS tag- ger. The tagger is able to use informa- tion of any degree: n-grams, automati- ... and the ambiguity ratio is 2.44 tags/word over the ambiguous words, 1.52 overall. We used a lexicon derived from training corpora, that contains all possible tags for a word, as well as...