... predicted
translation was counted as an alignment with 100% align-
ment coverage, at least 99% identity, and no gaps. A manually
annotated gene was counted as present if the protein align-
ment was at least ... FASTA [13]. The evidence used to gener-
ate these manually annotations had not been deposited to any
public database and was not used in the generation of any of
the input sets or...
... a parse-annotated treebank
from raw data
repeat
Parse a new section of raw data
Manually correct errors in the parser output
Add the corrected data to the training set
Extract a new grammar for ... can be rapidly induced from appropri-
ate treebank material. However, treebank- and
machine learning-based grammatical resources re-
flect the characteristics of the training data. They
genera...
... determination in Hymenoptera
Like most Hymenoptera, honey bees have an extraordinary
sex-determining mechanism known as haplo-diploidy:
females are normally diploid and a product of sexual
congress; ... olfactory
associations. This enhanced neural plasticity may have led to
the retention in Hymenoptera of genes such as Mahya,
which is also found in vertebrates but has been lost from
Dipte...
...
cardiovascular disease. In addition, data on food in-
take, basal energy expenditure (BEE) , and total daily
energy expenditure (TEE) were not available for
analysis that may have provided a ... demonstrated that hypoalbu-
minemia was a strong predictor of mortality in dialysis
patients. Kalantar-Zadeh et al (34) also showed higher
mortality in dialysis patients with lower albumin....
... Computational Linguistics
Creating a manually error-tagged and shallow-parsed learner corpus
Ryo Nagata
Konan University
8-9-1 Okamoto,
Kobe 658-0072 Japan
rnagata @ konan-u.ac.jp.
Edward Whittaker ... Vera Sheinman
The Japan Institute for
Educational Measurement Inc.
3-2-4 Kita-Aoyama, Tokyo, 107-0061 Japan
whittaker,sheinman @jiem.co.jp
Abstract
The availability of learner corpora, especi...
... collocation, computational linguists
agree that collocations and more generally multi-
word expressions play a very important role in
many NLP applications such as terminology ex-
traction, translation, ... unable
to create a complete analysis of a sentence, the
Fips parser returns chunks of partial analyses. If
132
Creating a Multilingual Collocation Dictionary from Large Text Corpor...
... Conference on Empirical Methods in Natural
Language Processing and Computational Natural
Language Learning (EMNLP-CoNLL), pages 410–
420.
Andreas Vlachos, Anna Korhonen, and Zoubin
Ghahramani. 2009. Unsupervised ... the
summarization community how to evaluate a sum-
mary. The methods at hand are either superficial
or time and resource consuming and not easily re-
peatable. Another argument a...
... Science
University of Pennsylvania
Philadelphia, PA 19104, USA
juliahr@cis.upenn.edu
Abstract
We present an algorithm which creates a
German CCGbank by translating the syn-
tax g raphs in the German Tiger ... Sapporo, Japan.
Gerald Gazdar, Ewan Klein, Geoffrey K. Pullum, and Ivan A.
Sag. 1985. Generalised Phrase Structure Grammar.
Blackwell, Oxford.
Julia Hockenmaier and Mark Steedman. 20...
... 15,
431–440.
55.Jin,D.,Takai,S.,Yamada,M.,Sakaguchi,M.,Yao ,Y. &
Miyazaki, M. (2001) Possible roles of cardiac chymase after
myocardial infarction in hamster hearts. Jpn. J. Pharmacol. 86,
203–214.
56. Miyazaki,M.,Wada,T.,Shiota,N.&Takai,S.(1999)Effectofan
angiotensin ... Biomedical Research, Hino, Tokyo, Japan;
2
TEIJIN Material Analysis Research Laboratories, Tokyo,
Japan;
3
Center f...
... syntactical relation).
When parallel corpora are available, also the
translation equivalents of the collocation context
are displayed, thus allowing the user to see how a
given collocation was translated ... for the opti-
mal candidate as target paragraph.
We perform two kinds of tests on the paragraphs
in this span: a test of paragraph content, and a test
of paragraphs relative size ma...