... Combination of Arabic preprocessing schemes forstatistical ma-chine translation. In Proc. ofthe 21st InternationalConference on Computational Linguistics and 44thAnnual Meeting ofthe Association ... 2006. Modellinglexical redundancy formachine translation. In Pro-ceedings ofthe 21st International Conference on Computational Linguistics and 44th Annual Meet-ing ofthe Association for Computational ... Boston, Massachusetts, USA.Adam Lopez and Philip Resnik. 2006. Word -based alignment, phrase -based translation: What’s the link? In Proceedings ofthe 7th Conference of the Association for Machine...
... for Phrase -based StatisticalMachineTranslation Mod-els. In Proc. ofthe Association forMachine Trans-lation in the Americas (AMTA).P. Koehn. 2004b. Statistical Significance Tests for MachineTranslation ... Evaluation. In Proc. of the EMNLP, Barcelona, Spain.Y. Lee. 2004. Morphological Analysis for Statistical Machine Translation. In Proc. of NAACL, Boston,MA.Y. Lee. 2005. IBM StatisticalMachineTranslation ... only statistically significantly at the 1% level. The results for WA are generally similar to D1.This makes sense since w+ is by far the most com-mon ofthe two conjunctions D1 splits off. The...
... word in the target clump of given the final two words ofthe target clump of , (d) probability of predicting the rest of the words in the target clump of . The language model is trained on a separate ... and Wei-Jing Zhu. 2002. BLEU: a Method for AutomaticEvaluation ofmachine translation. In Proc. of the 40th Annual Conf. ofthe Association for Computa-tional Linguistics (ACL 02), pages 311–318, ... perceptron, the theoryjustifies the standard practice of going through the train-ing data more than once. In the non-separable case, the method solves a regularized version of Eq. 8, which has the statistical...
... either. Flat model assigns constant probabili-ties for monotone order and non-monotone order. The two probabilities can be set to prefer mono-tone or non-monotone orientations depending on the language ... ways. The firstone is recombination. When two derivations in the same cell have the same w leftmost/rightmostwords onthe English yields, where w depends on the order ofthe language model, they ... Association for Computational LinguisticsMaximum Entropy Based Phrase Reordering Model forStatisticalMachine Translation Deyi XiongInstitute of Computing TechnologyChinese Academy of SciencesBeijing,...
... translation ambiguity by the exploitation ofthe context in-formation at the sentence level, we adopt the topical context information in our method for the following reasons: (1) the topic informa-tion ... topic-specific translationmodel toquantify the effect ofthe topic informationon the translation probability estimation.• Estimate the topic posterior distributions of phrases in the in-domain monolingual ... ModelAdaptationfor Statisti-cal Machine Translation. Machine Translation, pages77-94.Hua Wu, Haifeng Wang and Chengqing Zong. 2008. Do-main AdaptationforStatisticalMachine Translation with...
... Activelearning for multilingual statisticalmachine trans-lation. In Proceedings ofthe Joint Conference of the 47th Annual Meeting ofthe ACL and the 4th In-ternational Joint Conference on Natural ... statistical phrase -based machine translation. In Proceedings of HumanLanguage Technologies: The 2009 Annual Confer-ence ofthe North American Chapter ofthe Associa-tion for Computational Linguistics, ... learning for sequence labeling. InProceedings ofthe Joint Conference ofthe 47th An-nual Meeting ofthe ACL and the 4th InternationalJoint Conference on Natural Language Processing of the AFNLP,...
... feature templates to encode the reorder-ing information in the source part ofthe translation rules. We need to remember the source side con-text ofthe rules, themodel size would still be muchsmaller ... generally viewed beyond the scope of the phrase -based systems discussed above, because of either distortion limits or lack of discriminative fea-tures for modeling. The most notable solution to thisproblem ... Phrase -Based Model forStatisticalMachine Translation. In Proc.ACL, pages 263-270.Michael Collins, Philipp Koehn and Ivona Kucerova.2005. Clause restructuring forstatisticalmachine translation. ...
... all the allowed bilin-gual corpora except the UN corpus and Hong Kong Hansard corpus have been used for esti-mating thetranslation model. The second one is the small data condition where only ... method is also suitable for syntax -based MT systems and phrase -based MT systems. The only difference is the definition ofthe context. For a syntax -based system, the context of a rule could be ... function. After carrying out a series of additional expe-riments onthe small data condition and observ-ing the results onthe dev set, we set the size of the vector to 500 for Alg1; while for...
... Association for Computational Linguistics.Philipp Koehn et al. 2007. Moses: Open source toolkit for statisticalmachine translation. In Proceedings of the 45th Annual Meeting ofthe Association for ... Proceedings ofthe main con-ference on Human Language Technology Conference of the North American Chapter ofthe Association of Computational Linguistics, HLT-NAACL ’06, pages17–24.Chris Callison-Burch. ... experimentswere performed on Chinese-English, the approach ismore useful for low resource languages. The advan-tage ofthe proposed method is that we only need the SRL labels on either side ofthe language...
... approach forthe use of syntactic information within phrase -based SMTsystems. The approach constitutes a simple, directmethod forthe incorporation of syntactic informa-tion in a phrase based ... length.1 For each of these 100 translations, we presented the two anno-tators with three translations: the reference (human) translation, the output from the baseline system, and the output from the ... where is the car-dinality ofthe set .We now come to the definition of — howshould we judge whether a translation from one sys-tem is better or worse than thetranslation from an-other system?...
... context infor-mation that can be extracted from the same paral-lel corpus. This additional information could be:Simple context information: information of the words surrounding the word pair;Syntactic ... sentence. Therefore, the results ofthe transla-tion experiments are only preliminary. For the evaluation ofthetranslation qualitywe use the automatically computable Word Er-ror Rate (WER). The ... Spainfcn@iti.upv.esAbstractTypically, the lexicon models used in statistical machinetranslation systemsdo not include any kind of linguisticor contextual information, which oftenleads to problems in performing a cor-rect...
... higher scores than the sample translations, which leads to the conclusion that the improvement ofthe models (stronger language modelforthe target lan- guage, better translationmodel and especially ... 1: Architecture ofthetranslation approach based on Bayes' decision rule. The task ofstatisticalmachinetranslation can be subdivided into two fields: 1. the field of modelling, which ... along the positions ofthe target string, which allows the integration ofthe bigram in a very natural way without restricting the alignment paths to the class of monotone alignments. The...
... transla-tion task, i.e. we measure the influence of the segmentation process onthe final translation out-put. The quality ofthetranslation output is mainlyevaluated using BLEU, with NIST (Doddington,2002) ... unit we areconsidering. Once the phrases are extracted, weperform the estimation of weights forthe fea-tures ofthe log-linear model. We then use asimple dictionary -based maximum matching ... dic-tionary if and only if their measures are abovesome fixed thresholds tCOOCand tAC, which al-low forthe control ofthe size ofthe dictionary and the quality of its contents. Some other measures(including...
... have concentrated on the approaches that improve translation quality using information between hypotheses from one or more SMT systems as well. System combination is built on top ofthe N-best ... reason for these improvements is that, system combination can only use a small portion ofthe component systems’ search spaces; HM decoding, on the other hand, can make full use ofthe entire ... phrase -based system in decoding procedure, as such hypotheses are basedon flat structures, which cannot provide any additional information needed in the syntactic model. The second one is the...