0

adaptation of the translation model for statistical machine translation based on information retrieval

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Unsupervised Search for The Optimal Segmentation for Statistical Machine Translation" doc

Báo cáo khoa học

... Combination of Arabic preprocessing schemes for statistical ma-chine translation. In Proc. of the 21st InternationalConference on Computational Linguistics and 44thAnnual Meeting of the Association ... 2006. Modellinglexical redundancy for machine translation. In Pro-ceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meet-ing of the Association for Computational ... Boston, Massachusetts, USA.Adam Lopez and Philip Resnik. 2006. Word -based alignment, phrase -based translation: What’s the link? In Proceedings of the 7th Conference of the Association for Machine...
  • 6
  • 445
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Combination of Arabic Preprocessing Schemes for Statistical Machine Translation" ppt

Báo cáo khoa học

... for Phrase -based Statistical Machine Translation Mod-els. In Proc. of the Association for Machine Trans-lation in the Americas (AMTA).P. Koehn. 2004b. Statistical Significance Tests for Machine Translation ... Evaluation. In Proc. of the EMNLP, Barcelona, Spain.Y. Lee. 2004. Morphological Analysis for Statistical Machine Translation. In Proc. of NAACL, Boston,MA.Y. Lee. 2005. IBM Statistical Machine Translation ... only statistically significantly at the 1% level. The results for WA are generally similar to D1.This makes sense since w+ is by far the most com-mon of the two conjunctions D1 splits off. The...
  • 8
  • 295
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt

Báo cáo khoa học

... word in the target clump of given the final two words of the target clump of , (d) probability of predicting the rest of the words in the target clump of . The language model is trained on a separate ... and Wei-Jing Zhu. 2002. BLEU: a Method for AutomaticEvaluation of machine translation. In Proc. of the 40th Annual Conf. of the Association for Computa-tional Linguistics (ACL 02), pages 311–318, ... perceptron, the theoryjustifies the standard practice of going through the train-ing data more than once. In the non-separable case, the method solves a regularized version of Eq. 8, which has the statistical...
  • 8
  • 578
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Maximum Entropy Based Phrase Reordering Model for Statistical Machine Translation" docx

Báo cáo khoa học

... either. Flat model assigns constant probabili-ties for monotone order and non-monotone order. The two probabilities can be set to prefer mono-tone or non-monotone orientations depending on the language ... ways. The firstone is recombination. When two derivations in the same cell have the same w leftmost/rightmostwords on the English yields, where w depends on the order of the language model, they ... Association for Computational LinguisticsMaximum Entropy Based Phrase Reordering Model for Statistical Machine Translation Deyi XiongInstitute of Computing TechnologyChinese Academy of SciencesBeijing,...
  • 8
  • 390
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Translation Model Adaptation for Statistical Machine Translation with Monolingual Topic Information" doc

Báo cáo khoa học

... translation ambiguity by the exploitation of the context in-formation at the sentence level, we adopt the topical context information in our method for the following reasons: (1) the topic informa-tion ... topic-specific translation model toquantify the effect of the topic information on the translation probability estimation.• Estimate the topic posterior distributions of phrases in the in-domain monolingual ... Model Adaptation for Statisti-cal Machine Translation. Machine Translation, pages77-94.Hua Wu, Haifeng Wang and Chengqing Zong. 2008. Do-main Adaptation for Statistical Machine Translation with...
  • 10
  • 533
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation" docx

Báo cáo khoa học

... Activelearning for multilingual statistical machine trans-lation. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th In-ternational Joint Conference on Natural ... statistical phrase -based machine translation. In Proceedings of HumanLanguage Technologies: The 2009 Annual Confer-ence of the North American Chapter of the Associa-tion for Computational Linguistics, ... learning for sequence labeling. InProceedings of the Joint Conference of the 47th An-nual Meeting of the ACL and the 4th InternationalJoint Conference on Natural Language Processing of the AFNLP,...
  • 11
  • 580
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

Báo cáo khoa học

... feature templates to encode the reorder-ing information in the source part of the translation rules. We need to remember the source side con-text of the rules, the model size would still be muchsmaller ... generally viewed beyond the scope of the phrase -based systems discussed above, because of either distortion limits or lack of discriminative fea-tures for modeling. The most notable solution to thisproblem ... Phrase -Based Model for Statistical Machine Translation. In Proc.ACL, pages 263-270.Michael Collins, Philipp Koehn and Ivona Kucerova.2005. Clause restructuring for statistical machine translation. ...
  • 9
  • 615
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

Báo cáo khoa học

... all the allowed bilin-gual corpora except the UN corpus and Hong Kong Hansard corpus have been used for esti-mating the translation model. The second one is the small data condition where only ... method is also suitable for syntax -based MT systems and phrase -based MT systems. The only difference is the definition of the context. For a syntax -based system, the context of a rule could be ... function. After carrying out a series of additional expe-riments on the small data condition and observ-ing the results on the dev set, we set the size of the vector to 500 for Alg1; while for...
  • 10
  • 594
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Corpus Expansion for Statistical Machine Translation with Semantic Role Label Substitution Rules" doc

Báo cáo khoa học

... Association for Computational Linguistics.Philipp Koehn et al. 2007. Moses: Open source toolkit for statistical machine translation. In Proceedings of the 45th Annual Meeting of the Association for ... Proceedings of the main con-ference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, HLT-NAACL ’06, pages17–24.Chris Callison-Burch. ... experimentswere performed on Chinese-English, the approach ismore useful for low resource languages. The advan-tage of the proposed method is that we only need the SRL labels on either side of the language...
  • 5
  • 416
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Clause Restructuring for Statistical Machine Translation" ppt

Báo cáo khoa học

... approach for the use of syntactic information within phrase -based SMTsystems. The approach constitutes a simple, directmethod for the incorporation of syntactic informa-tion in a phrase based ... length.1 For each of these 100 translations, we presented the two anno-tators with three translations: the reference (human) translation, the output from the baseline system, and the output from the ... where is the car-dinality of the set .We now come to the definition of — howshould we judge whether a translation from one sys-tem is better or worse than the translation from an-other system?...
  • 10
  • 378
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach" pptx

Báo cáo khoa học

... context infor-mation that can be extracted from the same paral-lel corpus. This additional information could be:Simple context information: information of the words surrounding the word pair;Syntactic ... sentence. Therefore, the results of the transla-tion experiments are only preliminary. For the evaluation of the translation qualitywe use the automatically computable Word Er-ror Rate (WER). The ... Spainfcn@iti.upv.esAbstractTypically, the lexicon models used in statistical machine translation systemsdo not include any kind of linguisticor contextual information, which oftenleads to problems in performing a cor-rect...
  • 8
  • 427
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "ADP based Search Algorithm for Statistical Machine Translation" docx

Báo cáo khoa học

... higher scores than the sample translations, which leads to the conclusion that the improvement of the models (stronger language model for the target lan- guage, better translation model and especially ... 1: Architecture of the translation approach based on Bayes' decision rule. The task of statistical machine translation can be subdivided into two fields: 1. the field of modelling, which ... along the positions of the target string, which allows the integration of the bigram in a very natural way without restricting the alignment paths to the class of monotone alignments. The...
  • 8
  • 480
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Bilingually Motivated Domain-Adapted Word Segmentation for Statistical Machine Translation" pptx

Báo cáo khoa học

... transla-tion task, i.e. we measure the influence of the segmentation process on the final translation out-put. The quality of the translation output is mainlyevaluated using BLEU, with NIST (Doddington,2002) ... unit we areconsidering. Once the phrases are extracted, weperform the estimation of weights for the fea-tures of the log-linear model. We then use asimple dictionary -based maximum matching ... dic-tionary if and only if their measures are abovesome fixed thresholds tCOOCand tAC, which al-low for the control of the size of the dictionary and the quality of its contents. Some other measures(including...
  • 9
  • 236
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Hypothesis Mixture Decoding for Statistical Machine Translation" ppt

Báo cáo khoa học

... have concentrated on the approaches that improve translation quality using information between hypotheses from one or more SMT systems as well. System combination is built on top of the N-best ... reason for these improvements is that, system combination can only use a small portion of the component systems’ search spaces; HM decoding, on the other hand, can make full use of the entire ... phrase -based system in decoding procedure, as such hypotheses are based on flat structures, which cannot provide any additional information needed in the syntactic model. The second one is the...
  • 10
  • 389
  • 0

Xem thêm