... all of the words from the beginning of the paper up to either the first section of the paper, usually the introduction, or to the end of the first page,whichever occurs first. The abstract is automatically ... either the labeled data (L), the combination of the labeled and distantly-labeled data (L+D), or the in-terpolation of the labeled and distantly-labeled data (L*D). Extractionresults for these ... as well.In the spidering task, the on-topic documents are immediate re-wards, like the pieces of cheese. The actions are following a particularhyperlink. The state is the set of on-topic documents...