... corpusconsisted of a corpus of about 1,300,000 wordswith a vocabulary of almost 50,000 words.2.3 The newspaper corpusWe have also used a corpus consisting of a col-lection of Swedish newspaper texts of ... specific as it includes many artist names, songs and radio stations that often consist of rarewords. It is also very repetitive covering all com-binations of songs and artists in utterances such ... inthis first version of the application is limited to60 Swedish songs, 60 Swedish artists, 3 albumsand 3 radio stations. The vocabulary may seemsmall if you consider the number of songs andartists...