large pruned or continuous space language models on a gpu for statistical machine translation

Báo cáo khoa học: "On-line Language Model Biasing for Statistical Machine Translation" docx

Báo cáo khoa học: "On-line Language Model Biasing for Statistical Machine Translation" docx

Ngày tải lên : 17/03/2014, 00:20
... Language Model Biasing for Statistical Machine Translation Sankaranarayanan Ananthakrishnan, Rohit Prasad and Prem Natarajan Raytheon BBN Technologies Cambridge, MA 02138, U.S .A. {sanantha,rprasad,pnataraj}@bbn.com Abstract The ... biasing as we move from low- resource languages to those for which significantly larger parallel corpora and LM training data are available. 448 References Yaser Al-Onaizan, Jan Curin, Michael ... of machine translation. In ACL ’02: Pro- ceedings of the 40th Annual Meeting on Association for Computational Linguistics, pages 311–318, Mor- ristown, NJ, USA. Association for Computational...
  • 5
  • 311
  • 0
Tài liệu Báo cáo khoa học: "Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation" docx

Tài liệu Báo cáo khoa học: "Bucking the Trend: Large-Scale Cost-Focused Active Learning for Statistical Machine Translation" docx

Ngày tải lên : 20/02/2014, 04:20
... In- ternational Joint Conference on Natural Language Processing of the AFNLP, pages 181–189, Suntec, Singapore, August. Association for Computational Linguistics. Gholamreza Haffari, Maxim Roy, and Anoop ... the Associa- tion for Computational Linguistics, pages 415–423, Boulder, Colorado, June. Association for Computa- tional Linguistics. Rebecca Hwa. 2000. Sample selection for statistical grammar ... cost-conscious. The vast majority of AL research has not focused on accurate cost accounting and a typical assumption is that each annotatable has equal annotation cost. An early exception in the AL...
  • 11
  • 580
  • 0
Tài liệu Báo cáo khoa học: "Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach" pptx

Tài liệu Báo cáo khoa học: "Refined Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach" pptx

Ngày tải lên : 20/02/2014, 18:20
... Lexicon Models for Statistical Machine Translation using a Maximum Entropy Approach Ismael Garc ´ a Varea Dpto. de Inform´atica Univ. de Castilla-La Mancha Campus Universitario s/n 02071 Albacete, ... entropy approach is outlined in Section 3. 2 Statistical Machine Translation The goal of the translation process in statisti- cal machine translation can be formulated as fol- lows: A source language ... language cor- responds to only one word in the target language. Those lexicon models lack from context infor- mation that can be extracted from the same paral- lel corpus. This additional information...
  • 8
  • 427
  • 0
Báo cáo khoa học: "Distortion Models For Statistical Machine Translation" doc

Báo cáo khoa học: "Distortion Models For Statistical Machine Translation" doc

Ngày tải lên : 08/03/2014, 02:21
... 6th Con- ference of the Association for Machine Translation in the Americas, pages 115–124, Washington DC, September-October. The Association for Machine Translation in the Americas (AMTA). Franz ... for statistical machine translation. In Daniel Marcu Susan Dumais and Salim Roukos, ed- itors, HLT-NAACL 2004: Short Papers, pages 101– 104, Boston, Massachusetts, USA, May 2 - May 7. Association ... 7. Association for Computational Linguistics. Christoph Tillmann and Hermann Ney. 2003. Word Re-ordering and a DP Beam Search Algorithm for Statistical Machine Translation. Computational Lin- guistics,...
  • 8
  • 485
  • 0
Báo cáo khoa học: "Randomised Language Modelling for Statistical Machine Translation" doc

Báo cáo khoa học: "Randomised Language Modelling for Statistical Machine Translation" doc

Ngày tải lên : 17/03/2014, 04:20
... that higher-order LMs and models trained on additional monolingual corpora can yield better translation performance, the chal- lenges in deploying large LMs are not trivial. In- creasing the order ... monolingual corpora to be used more easily for language modelling in SMT. In a companion paper (Talbot and Osborne, 2007) we have proposed a framework for deriving con- ventional smoothed n-gram models ... minimum error rate training and evaluation is performed using the BLEU score. 4.2 Baseline and comparison models Our baseline LM and other comparison models are conventional n-gram models smoothed...
  • 8
  • 268
  • 0
Báo cáo khoa học: "A Comparative Study on Reordering Constraints in Statistical Machine Translation" potx

Báo cáo khoa học: "A Comparative Study on Reordering Constraints in Statistical Machine Translation" potx

Ngày tải lên : 17/03/2014, 06:20
... increases from about 87% to about 96%. We have presented a polynomial-time search al- gorithm for statistical machine translation based on the ITG constraints and its extension for the gen- eration ... inverted concatenation 1 − p m . Now, we have obtained two word graphs: one for a monotone and one for a inverted concatenation. The final word graphs is constructed by merging the two start nodes and ... concatenating the partial word graphs either in monotone or inverted order. Now, we describe this idea in a more formal way. A word graph is a directed acyclic graph (dag) with one start and one...
  • 8
  • 410
  • 0
Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Ngày tải lên : 07/03/2014, 22:20
... Computational Linguistics Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers Deyi Xiong, Min Zhang, Haizhou Li Human Language Technology Institute ... Gildea. 2008. Parsers as language models for statistical machine translation. In Proceed- ings of AMTA. Sylvain Raybaud, Caroline Lavecchia, David Langlois, and Kamel Sma ¨ ıli. 2009. New confidence ... models for statistical machine translation. In Proceedings of MT Summit IX. Intl. Assoc. for Machine Translation. David Chiang. 2007. Hierarchical phrase-based transla- tion. Computational Linguistics,...
  • 10
  • 415
  • 0
Tài liệu Báo cáo khoa học: "Mixing Multiple Translation Models in Statistical Machine Translation" docx

Tài liệu Báo cáo khoa học: "Mixing Multiple Translation Models in Statistical Machine Translation" docx

Ngày tải lên : 19/02/2014, 19:20
... USA. ACL. Almut Silja Hildebrand, Matthias Eck, Stephan Vogel, and Alex Waibel. 2005. Adaptation of the translation model for statistical machine translation based on in- formation retrieval. In ... the Association for Computational Linguistics, HLT ’10, pages 975–983, Stroudsburg, PA, USA. ACL. Matthias Eck, Stephan Vogel, and Alex Waibel. 2004. Language model adaptation for statistical machine translation ... and Alfons Juan. 2007. Domain adap- tation in statistical machine translation with mixture modelling. In Proceedings of the Second Workshop on Statistical Machine Translation, StatMT ’07, pages 177–180,...
  • 10
  • 456
  • 0
Báo cáo khoa học: "Pivot Language Approach for Phrase-Based Statistical Machine Translation" pot

Báo cáo khoa học: "Pivot Language Approach for Phrase-Based Statistical Machine Translation" pot

Ngày tải lên : 08/03/2014, 02:21
... phrase-based SMT by using a pivot language. To perform translation between languages L f and L e , we bring in a pivot language L p , for which there exist large bilingual corpora for language ... Europarl: A Parallel Corpus for Statistical Machine Translation. In Proc. of MT Summit X, pages 79-86. Philipp Koehn and Christof Monz. 2006. Manual and Automatic Evaluation of Machine Translation ... method is easy to be adapted to any language pair where a pivot lan- guage and corresponding large bilingual corpora are available. 3 Phrase-Based SMT According to the translation model presented...
  • 8
  • 205
  • 0
Báo cáo khoa học: "Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation" potx

Báo cáo khoa học: "Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation" potx

Ngày tải lên : 23/03/2014, 14:20
... if we allow for a small degradation in translation per- formance, our approaches clearly outperform Moses in terms of translation speed. With phrase-only LM look-ahead, our decoder is faster by a ... Linguistics Fast and Scalable Decoding with Language Model Look-Ahead for Phrase-based Statistical Machine Translation Joern Wuebker, Hermann Ney Human Language Technology and Pattern Recognition Group Computer ... accelerat- ing translation, yielding identical performance at 16 words/sec as Moses at 1.8 words/sec. Application of first-word LM look-ahead shifts the graph to the right, now reaching the same performance...
  • 5
  • 246
  • 0
Tài liệu Báo cáo khoa học: "The impact of language models and loss functions on repair disfluency detection" pptx

Tài liệu Báo cáo khoa học: "The impact of language models and loss functions on repair disfluency detection" pptx

Ngày tải lên : 20/02/2014, 04:20
... that language mod- els trained on large amounts of non-speech data improve performance more than a lan- guage model trained on a more modest amount of speech data, and that optimising f-score rather ... incorporate information from the external language models by defining a reranker feature for each external language model. The value of this feature is the log probability assigned by the language ... 49th Annual Meeting of the Association for Computational Linguistics, pages 703–711, Portland, Oregon, June 19-24, 2011. c 2011 Association for Computational Linguistics The impact of language models...
  • 9
  • 609
  • 0
Tài liệu Báo cáo khoa học: "Improved Smoothing for N-gram Language Models Based on Ordinary Counts" doc

Tài liệu Báo cáo khoa học: "Improved Smoothing for N-gram Language Models Based on Ordinary Counts" doc

Ngày tải lên : 20/02/2014, 09:20
... useful for any language technology task that produces natural -language text as a final (or intermediate) output. In particular, they are extensively used in speech recognition and machine translation. ... quantization error introduced by each N-gram count. We use a single value of δ for all contexts and all N-gram lengths. As an a priori “theory”-based estimate, we assume that, since the distance ... discount parame- ters D, with the intention that the α and β para- meters correct for quantization error, and the D parameters correct for overestimation error. This is accomplished by relaxing the...
  • 4
  • 365
  • 0
Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Ngày tải lên : 22/02/2014, 02:20
... the web data were selected for each language. The adaptation was thought to take place off-line on a server. 3.2.1 Data sets For each language, the adaptation takes place on two baseline models, ... Acoustics, Speech, and Signal Processing (ICASSP ’05), vol- ume I, pages 573–576. Abhinav Sethy, Shrikanth Narayanan, and Bhuvana Ramabhadran. 2007. Data driven approach for lan- guage model adaptation using ... degrading the WER only marginally. The current paper describes a new method for query selection and its applications in LM aug- mentation and adaptation using web data. The language models are part...
  • 9
  • 301
  • 0