...
A completely different approach is to
formulate a DATR theory which gives the
lexical information in a PATR-usable format
(i.e. a feature structure) as the result of the
evaluation of a ... resulthag value has to be transformed into
a PATR path equation (that partially describes
a feature structure) and passed on to the PATR
system. What is most disturbing about this
st...
... Geneva
tanja.samardzic@unige.ch
Abstract
We present a novel approach to the task of
word lemmatisation. We formalise lemmati-
sation as a category tagging task, by describ-
ing how a word-to-lemma transformation rule
can ... European languages having different morpho-
logical complexity, including agglutinative (Hungar-
ian, Estonian) and fusional (Slavic) languages.
2 Lemmatisation...
... approaches: These approaches
crucially rely on lexical knowledge base.
Graph-based WSD approaches (Agirre and
Soroa, 2009; Sinha and Mihalcea, 2007) per-
form disambiguation over a graph composed
of ... of a word as its vari-
able. Each agent w
i
is associated with the variable
s
w
i
. The value assigned to this variable indicates
the sense assigned by the algorithm.
3.3 Domains
Sense...
... 5¢-CTC
GAGATGGATAAAGTTTTAAACAGAG-3¢ and LTA-
1R, 5¢-TGAAGGCAAATCTCTGGAC-3¢ for the former,
and LTA–M2F, 5¢-CAGCTGTTTTGCTTGAATTATG-3¢
and LTA–2R, 5¢-GAATTCATTATGTTTCAGGTTCA
GGGG-3¢ for the latter. The ... isolation and long-term
culture of organ-specific blood vascular and lymphatic
endothelial cells of the mouse
Takashi Yamaguchi, Taeko Ichise, Osamu Iwata, Akiko Hori, Tomomi Adachi, Masar...
... takes as input file(s) annotated in the
S
ENSEVAL-2 lexical sample format, which is an
XML–based format that has been used for both the
S
ENSEVAL-2 and SENSEVAL-3 exercises. A file in
this format ... disambiguated.
3.1 Command Line
The command-line interface disamb.pl takes as input
aS
ENSEVAL-2 formatted lexical sample file. The
program disambiguates the marked up word in each
ins...
... domain.
Marking synsets with field labels has a clear ad-
vantage: in general, given a polysemous word in
WordNet and a particular field label, in most of
the cases the word is disambiguated. ... (or at least a reason-
able approximation of it) in a short time. The
right FL contains those words that are necessary
for the application and only those. The presence
of all the r...
... 499–
507.
13 Filho EJ, Carvalho AU, Assis RA, Lobato FF,
Rachid MA, Carvalho AA, Ferreira PM, Nascimento
RA, Fernandes AA, Vidal JE et al. (2009) Clinico-
pathologic features of experimental Clostridium ... S, Wada A, Shibasaki S, Annaka M,
Higuchi H, Adachi K, Mori N, Ishikawa T, Masuda
Y, Watanabe H et al. (2009) Spread of a large plasmid
carrying the cpe gene and the tcp locus amongst
C...
... kar + A + nA kar
karvAnA kar + vA + nA kar
Word Form Inflectional Segmentation Root
karnA kar + nA kar
karAnA karA + nA karA
karvAnA karvA + nA karvA
Table 1: Morpheme Segmentation
laDkA Nominative ... Hindi language, a highly inflectional and mod-
erately agglutinative Indo-European language spo-
ken widely in South Asia.
Since a POS tagger, another basic tool, was
available along with P...
... 111–127.
13 Kawasaki S, Arai H, Kodama T & Igarashi Y (1997)
Gene cluster for dissimilatory nitrite reductase (nir)
from Pseudomonas aeruginosa: sequencing and identifi-
cation of a locus for heme ... dehydrogenase-ferrochelatase
from Saccharomyces cerevisiae), we found that the
two proteins had 24% sequence similarity. A crystal
structure of Met8P has shown that this protein has
a...
... red).
Table 1. Average distances between CA atoms of the stefins and
catalytic residues of cysteine proteases.
Distance calculated d (A
˚
)
Papain–stefin B 23.93
Cathepsin H–stefin A 23.36 ± 0.23
Cathepsin ... the
P1 data set is a consequence of highly anisotropic diffrac-
tion, which forced us to discard part of the collected data
to maintain reasonable merging statistics. The anisotropy
w...