... similarity of collections of doc-
uments is closely related to the similarity of the
218
A Figure of Merit for the Evaluationof Web- Corpus Randomness
Massimiliano Ciaramita
Institute of Cognitive ... Terms from the Web. In Pro-
ceedings of LREC 2004, pages 1313–1316.
K. Bharat and A. Broder. 1998. A Technique for Mea-
suring the Relative Size and Overlap of the Public
Web Search Engines. In ... with query category y
s
.
These Web- corpora can be seen as a dataset D
of n = 20 data-points each consisting of a series
of unigram word distributions, one for each search
category. If all n data-points...
... freely for any purposes (see copyright notice below).
For information about publishing your research in Environmental Sciences Europe go to
http://www.enveurope.com/authors/instructions/
For information ... (e.g.
flowering fields)
Speed of detection of
resources
Number of nectar and
pollen foragers
Time and frequency of
foraging trips
Distribution of breeding
and foraging territories or
home ...
knowledge of the ecology of a species affects not only the realism of model simulations, but also
of higher tier risk assessments or the use of field studies in risk assessment. Therefore, for a...
... (e.g.
flowering fields)
Speed of detection of
resources
Number of nectar and
pollen foragers
Time and frequency of
foraging trips
Distribution of breeding
and foraging territories or
home ... freely for any purposes (see copyright notice below).
For information about publishing your research in Environmental Sciences Europe go to
http://www.enveurope.com/authors/instructions/
For information ...
knowledge of the ecology of a species affects not only the realism of model simulations, but also
of higher tier risk assessments or the use of field studies in risk assessment. Therefore, for a...
... (e.g.
flowering fields)
Speed of detection of
resources
Number of nectar and
pollen foragers
Time and frequency of
foraging trips
Distribution of breeding
and foraging territories or
home ... freely for any purposes (see copyright notice below).
For information about publishing your research in Environmental Sciences Europe go to
http://www.enveurope.com/authors/instructions/
For information ...
knowledge of the ecology of a species affects not only the realism of model simulations, but also
of higher tier risk assessments or the use of field studies in risk assessment. Therefore, for a...
... sequence of words. A websearch
query, however, is often formulated by a user as a
bag of keywords. For example, if a user is look-
861
We mentioned that one of the motivations of
parsing search ... implementation of a
parser for this kind of grammar. Section 5 gives
an example of such a grammar designed for the
purpose of automatic tagging of queries. Section
6 discusses motivations for and ... Contextual information often plays a
big role in resolving tagging ambiguities and is
one of the key benefits of discriminative models
such as CRFs. But such information is not
straightforward...
... al., 1996) is a probabilistic
model forinformationretrieval and is one of the
most popular and effective algorithms used in in-
formation retrieval. For ease of reference, we in-
corporate the ... provides information about the gen-
eral distribution of term i amongst documents of
all classes, without providing any additional evi-
dence of class preference. The utilization of idf
in information ... Pre-
vious research has shown that in general the per-
formance of the former tend to be superior to that
of the latter (Mullen and Collier, 2004; Lin and
He, 2009). One of the main issues for supervised
approaches...
... the
found set of features for text classification (index-
ing) for an OIR query of the first level (finds opin-
ionated information) and for an OIR query of the
second level (finds opinionated information ... Association for Computational Linguistics
Kinds of Features for Chinese Opinionated Information Retrieval
Taras Zagibalov
Department of Informatics
University of Sussex
United Kingdom
T.Zagibalov@sussex.ac.uk
Abstract
This ... paper presents the results of experi-
ments in which we tested different kinds of
features forretrievalof Chinese opinionated
texts. We assume that the task ofretrieval of
opinionated texts (OIR)...
... lack of significant
differences between the measures except for cer-
tain specific values of
. We have also shown that
the evaluation results and the ranking of AMs dif-
fer depending on the kind of ... data.
(2) The evaluation strategies applied: Instead
of examining only a small sample of
-best can-
didates for each measure as it is common practice,
we make use of recall and precision values for -
best ... for evaluation
General statistics for the AdjN and PNV base
sets are given in Table 1. Manual annotation was
performed for AdjN pairs with frequency
and PNV triples with only (see section
5 for...
... 1979.
Representation and classification of knowledge and
information for use in interactive information re-
trieval. In Human Aspects ofInformation Science.
Oslo: Norwegian Library School.
148 ...
constraints typical of DR systems. The modi~,cations
are designed to recognize such aspects of discourse
structure as establishment of topic; "setting of context;
summarizing; concept foregrounding; ... alternative systems for each of the pro-
posed modifications. In this experiment the original
corpus of thirty abstracts (but not the prublem state-
ments) is submitted to all versions of the analysis...
... resources
for informationretrieval tasks. Natural language in-
formation retrieval. Kluwer Academic Publishers
Dordrecht, NL.
Bruce Croft and John Lafferty. 2003. Language Mod-
eling forInformation Retrieval. ... ofInformation Retrieval.
1 Introduction
The task of an InformationRetrieval (IR) system
is to retrieve documents from a collection, in re-
sponse to a user need, which is expressed in the
form ... Blocks for Information
Retrieval
Christina Lioma
Department of Computing Science
University of Glasgow
17 Lilybank Gardens
Scotland, U.K.
xristina@dcs.gla.ac.uk
Iadh Ounis
Department of Computing...
... Linguistics
Is It Correct? - Towards Web- Based Evaluationof Automatic Natural
Language Phrase Generation
Calkin S. Montero and Kenji Araki
Graduate School ofInformation Science and Technology, ... number of hits, returned by the search
engine for a given n-gram. Table 1 shows some
of the n-grams produced for the generated phrase
“what are your plans for the game?” The fre-
quency of each ... (2005),the size of indexable Web had become approx-
imately 11.5 billion pages
9
The tuning of the thresholds of each n-gram type was
preformed using the phrases of the Phrase DB
10
The evaluation...