... Ill-formed Input
for Robust Spoken- language Translation
Osamu FURUSE t, Setsuo YAMADA, Kazuhide YAMAMOTO
ATR Interpreting Telecommunications Research Laboratories
2-2 Hikaridai, Seika-cho, ...
parsing as follows:
(A) Neighboring passive arcs can create a
larger passive arc by concatenating them.
(B) A passive arc which concatenates neigh-
boring passive arcs can be further...
... have just two terms, namely a standard
phrase translation model based on the classified
parallel data, and an instance model assigning a
probability to each target instance given the source
class ... Improving Statistical
Machine Translation
Adri
`
a de Gispert
TALP Research Center
Universitat Polit`ecnica de Catalunya (UPC)
Barcelona
agispert@gps.tsc.upc.es
Abstract
In this paper...
... 619-0288, Japan
Seiichi Yamamoto
Masuzo Yanagida
ATR Spoken Language
Doshisha University
Translation Research Laboratories
1
-
3,
Tatara-miyakodani, Kyotanabe,
2-2-2, Hikaridai, "Keihanna ... Applications of Automatic Evaluation Methods to Measuring a Capability
of Speech Translation System
Keiji Yasuda
ATR Spoken Language Translation Research Laboratories
2-2-2, Hik...
... PRE
a
Sequence RCS
b
II ATGAAACA 1.00
IV tTGAAACA 0.27
c
AaGAAACA 0.14
ATaAAACA 0.81
ATcAAACA 0.03
c
ATtAAACA 0.01
c
ATGcAACA 0.69
ATGgAACA 0.20
c
I ATGAgACA 0.02
c
ATGAAgCA 0.05
c
ATGAAAgA 0.30
III ... lysate. The reactions were carried out at
30 °C for 90 min and then assayed immediately for DNA
binding activity.
Oligonucleotides used for EMSA are detailed in
Table S1, and were anne...
... corpus for
statistical machine translation. In MT Summit X.
Young-Suk Lee. 2004. Morphological analysis for
statistical machine translation. In Proceedings of
HLT/NAACL-2004 – Companion Volume, pages ... presents a
review of related work and situates our approach.
We assume that the reader is familiar with the
basics of phrase-based statistical machine transla-
tion (Koehn et al...
... manual translations. Our probabilistic dic-
tionary was acquired at a fraction of the cost of
manually translating the entire thesaurus. By
prioritizing human translations based on the
translation ... value that do not make it onto the prioritized
list for manual translation (i.e., low translation
value). The overall access value of the thesaurus
rises very quickly after initia...
... motivates a fully-Bayesian approach,
which, rather than committing to a particular set
of parameter values as in an EM-based approach,
integrates over all possible values of θ and, most
importantly, ... Morphologically-Rich,
Resource-Scarce Languages
Kazi Saidul Hasan and Vincent Ng
Human Language Technology Research Institute
University of Texas at Dallas
Richardson, TX 75083-0688
{saidu...
... ma-
chine translation that is quick and can be reused as a
benchmark across machine translation tasks. Bleu is
calculated as the geometric mean of n-grams com-
paring a machine translation and ... a machine translation framework based
on Praguian tectogrammatics (Sgall, 1967) which
represents four main layers: word layer, morpho-
logical layer, analytical layer, and tectog...
... -1O
Output
Input
Label
Label
kare wa
(he)
kanojo no
(her)
atatakai
(warm)
magokoro ni
(heart)
kando-shita.
(be moved)
(He was moved by her warm heart.)
Seg. No. 1 2 3 4 5
kare wa
(he)
kanojo no
(her)
atatakai
(warm)
magokoro ... are an-
alyzed. A number ofstudies have focused on parsing
of Japanese as well as of other languages. Popular
parsers are CaboCha (Kudo and Matsumoto, 2002)
and KNP...
... approach is that it
allows one to make use of the available grammars
and parsers for language modeling task. So as to
be used for this task, syntactic analyzer compo-
nents developed for a rule-based ... c-structure and the associated f-
structures.
3.6 Parse evaluation and recovery
After all rules are executed and no more edges are
left in the agenda, the chart parsing process ends...