0

luanvansieucap

Nạp tiền Tải lên

Đăng ký Đăng nhập

Đăng ký

Đăng nhập

0

constructing statistical language models

Báo cáo khoa học:

Báo cáo khoa học: "Generalized Algorithms for Constructing Statistical Language Models" pdf

Danh mục: Báo cáo khoa học

... .Class-based models. In many applications, it is nat-ural and convenient to construct class-based language models, that is models based on classes of words (Brownet al., 1992). Such models are ... construc-tion of language models found in new language process-ing applications and reported experimental results show-ing their practicality for constructing very large models. These algorithms ... by as-signing them some probabilities. There are classicaltechniques for constructing language models such as-gram models with various smoothing techniques (seeChen and Goodman (1998) and...

8
389
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Reading Level Assessment Using Support Vector Machines and Statistical Language Models" pdf

Danh mục: Báo cáo khoa học

... using statistical language models. In this paper, we also use support vectormachines to combine features from tradi-tional reading level measures, statistical language models, and other language ... that categoryor not, rather than constructing a classiﬁer whichranks documents into different categories relative toeach other.4.1 Statistical Language Models Statistical LMs predict the probability ... of syntax. Our approach uses n-gram language models as a low-cost automatic ap-proximation of both syntactic and semantic analy-sis. Statistical language models (LMs) are used suc-cessfully...

8
446
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Generating statistical language models from interpretation grammars in dialogue systems" potx

Danh mục: Báo cáo khoa học

... decades of statistical language modeling: Where do we go from here? In Proceed-ings of IEEE:88(8).Rosenfeld R. 2000. Incorporating Linguistic Structureinto Statistical Language Models. In ... comparison of in-grammar recognition performance.3 Language modellingTo generate the different trigram language models we used the SRI language modelling toolkit (Stol-cke, 2002) with Good-Turing ... movespeciﬁc statistical language models (DM-SLMs)by using GF to generate all utterances that arespeciﬁc to certain dialogue moves from our in-terpretation grammar. In this way we can pro-duce models...

8
381
0

Báo cáo khoa học:

Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Danh mục: Báo cáo khoa học

... of statistical machine translation: Parameter estimation. Computa-tional Linguistics, 19(2):263–311.Eugene Charniak, Kevin Knight, and Kenji Yamada.2003. Syntax-based language models for statistical machine ... as language models for statistical machine translation. In Proceed-ings of AMTA.Sylvain Raybaud, Caroline Lavecchia, David Langlois,and Kamel Sma¨ıli. 2009. New conﬁdence measuresfor statistical ... Computational LinguisticsEnhancing Language Models in Statistical Machine Translationwith Backward N-grams and Mutual Information TriggersDeyi Xiong, Min Zhang, Haizhou LiHuman Language TechnologyInstitute...

10
415
0

Báo cáo khoa học:

Báo cáo khoa học: "Phrase-based Statistical Language Generation using Graphical Models and Active Learning" potx

Danh mục: Báo cáo khoa học

10
382
0

Báo cáo khoa học:

Báo cáo khoa học: "Continuous Space Language Models for Statistical Machine Translation" pdf

Danh mục: Báo cáo khoa học

8
345
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Incremental Syntactic Language Models for Phrase-based Translation" pptx

Danh mục: Báo cáo khoa học

... research in statistical machine trans-lation has effectively used n-gram word sequence models as language models. Modern phrase-based translation using large scalen-gram language models generally ... to incorporate large-scale n-gram language models in conjunction withincremental syntactic language models. The added decoding time cost of our syntactic language model is very high. By increasing ... translation model. Instead, we incor-porate syntax into the language model.Traditional approaches to language models inspeech recognition and statistical machine transla-tion focus on the use of...

12
510
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "The impact of language models and loss functions on repair disﬂuency detection" pptx

Danh mục: Báo cáo khoa học

... language models trained from text or speech corpora of vari-ous genres and sizes. The largest available language models are based on written text: we investigate theeffect of written text language models ... dif-ferences among the different language models whenextended features are present are relatively small.We assume that much of the information expressedin the language models overlaps with the lexical ... information fromthe external language models by deﬁning a rerankerfeature for each external language model. The valueof this feature is the log probability assigned by the language model to the candidate...

9
609
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "An Empirical Investigation of Discounting in Cross-Domain Language Models" ppt

Danh mục: Báo cáo khoa học

... 2006. MAP adaptation of stochasticgrammars. Computer Speech & Language, 20(1):41 –68.Jerome R. Bellegarda. 2004. Statistical language modeladaptation: review and perspectives. Speech Commu-nication, ... of EnglishBigrams. Computer Speech & Language, 5(1):19–54.Joshua Goodman. 2001. A Bit of Progress in Language Modeling. Computer Speech & Language, 15(4):403–434.Bo-June (Paul) Hsu ... N-gram Language Models Based onOrdinary Counts. In Proceedings of the ACL-IJCNLP2009 Conference Short Papers, pages 349–352.Ronald Rosenfeld. 1996. A Maximum Entropy Ap-proach to Adaptive Statistical...

6
444
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Improved Smoothing for N-gram Language Models Based on Ordinary Counts" doc

Danh mục: Báo cáo khoa học

... Kneser-Ney andthose methods.1 Introduction Statistical language models are potentially usefulfor any language technology task that producesnatural -language text as a ﬁnal (or intermediate)output. ... perplexity of any known methodfor estimating N-gram language models. Kneser-Ney smoothing, however, requiresnonstandard N-gram counts for the lower-order models used to smooth the highest-order model. ... best approach when language models based on ordinary counts are desired.ReferencesChen, Stanley F., and Joshua Goodman. 1998.An empirical study of smoothing techniques for language modeling....

4
365
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Japanese OCR Error Correction using Character Shape Similarity and Statistical Language Model " pptx

Danh mục: Báo cáo khoa học

... distance (Wagner and Fischer, 1974) and ngram distance (Angell et al., 1983). Recently, statistical language models and feature- based method have been used for context-sensitive spelling correction, ... times is c/(n + r). 923 Japanese OCR Error Correction using Character Shape Similarity and Statistical Language Model Masaaki NAGATA NTT Information and Communication Systems Laboratories 1-1 ... novel OCR error correction method for languages without word delimiters that have a large character set, such as Japanese and Chinese. It consists of a statistical OCR model, an approxi- mate...

7
472
0

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Danh mục: Báo cáo khoa học

... 2007. Large language models in machine translation. In Proceedingsof the 2007 Joint Conference on Empirical Meth-ods in Natural Language Processing and Com-putational Natural Language Learning ... Kneser-Ney smoothed n-gram models. IEEE Transac-tions on Audio, Speech and Language Processing,15(5):1617–1624.A. Stolcke. 1998. Entropy-based pruning of backoff language models. In Proc. DARPA ... wereselected for each language. The adaptation wasthought to take place off-line on a server.3.2.1 Data setsFor each language, the adaptation takes place ontwo baseline models, which are the...

9
301
0

Báo cáo khoa học:

Báo cáo khoa học: "The use of formal language models in the typology of the morphology of Amerindian languages" potx

Danh mục: Báo cáo khoa học

... grammars for modeling agglutinationin this language, but first we will present the for-mer class of languages and its acceptor automata.3.1 Linear context free languages andtwo-taped nondeterministic ... 2010.c2010 Association for Computational LinguisticsThe use of formal language models in the typology of the morphology ofAmerindian languagesAndrés Osvaldo PortaUniversidad de Buenos Aireshugporta@yahoo.com.arAbstractThe ... natural representa-tion in terms of linear context-free languages.2 Quichua SantiagueñoThe quichua santiagueño is a language of theQuechua language family. It is spoken in the San-tiago del...

6
439
0

Báo cáo khoa học:

Báo cáo khoa học: "Faster and Smaller N -Gram Language Models" pptx

Danh mục: Báo cáo khoa học

... novel language modelcaching technique that improves the queryspeed of our language models (and SRILM)by up to 300%.1 IntroductionFor modern statistical machine translation systems, language models ... with two different language models. Our ﬁrst language model, WMT2010, was a 5-gram Kneser-Ney language model which storesprobability/back-off pairs as values. We trained this language model on ... and Smaller N -Gram Language Models Adam Pauls Dan KleinComputer Science DivisionUniversity of California, Berkeley{adpauls,klein}@cs.berkeley.eduAbstractN-gram language models are a major...

10
463
0

Báo cáo khoa học:

Báo cáo khoa học: "Randomized Language Models via Perfect Hash Functions" pptx

Danh mục: Báo cáo khoa học

... 2007.Compressing trigram language models with golombcoding. In Proceedings of EMNLP-CoNLL 2007,Prague, Czech Republic, June.P. Clarkson and R. Rosenfeld. 1997 . Statistical language modeling using ... 2007a. Randomised language modelling for statistical machine translation. In 45thAnnual Meeting of the ACL 2007, Prague.D. Talbot and M. Osborne. 2007b. Smoothed Bloomﬁlter language models: Tera-scale ... alignmenttemplate approach to statistical machine translation.Computational Linguistics, 30(4):417–449.Andreas Stolcke. 1998. Entropy-based pruning of back-off language models. In Proc. DARPA Broadcast...

9
273
0

Báo cáo khoa học:

Báo cáo khoa học: "Segmented and unsegmented dialogue-act annotation with statistical dialogue models∗" ppt

Danh mục: Báo cáo khoa học

... the possibility of applying statistical models to the annotation problem is really inter-esting. Moreover, it gives the possibility of evalu-ating the statistical models. The evaluation of theperformance ... or statistical ma-chine translation), an alternative data-based ap-proach has been developed in the last decade (Stol-cke et al., 2000; Young, 2000). This approach re-lies on statistical models ... Pr(Wsk−dsk−(d+1)+1|Uk)This model can be easily implemented usingsimple statistical models (N-grams and HiddenMarkov Models) . The decoding (segmentationand DA assignation) was implemented using...

8
387
0

Báo cáo khoa học:

Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difﬁculty of Texts for FFL" potx

Danh mục: Báo cáo khoa học

... use of language models in-stead of word lists to measure lexical complex-ity. Schwarm and Ostendorf (2005) developeda SVM categoriser combining a classiﬁer basedon trigram language models ... measures forﬁrst and second language texts. In Proceedings ofNAACL HLT, pages 460–467.M. Heilman, K. Collins-Thompson, and M. Eskenazi.2008. An analysis of statistical models and fea-tures for ... Methods in Language Process-ing, volume 12. Manchester, UK.S.E. Schwarm and M. Ostendorf. 2005. Reading levelassessment using support vector machines and sta-tistical language models. Proceedings...

9
514
0

Báo cáo khoa học:

Báo cáo khoa học: "Cutting the Long Tail: Hybrid Language Models for Translation Style Adaptation" doc

Danh mục: Báo cáo khoa học

10
335
0

Báo cáo khoa học:

Báo cáo khoa học: "Deciphering Foreign Language by Combining Language Models and Context Vectors" pdf

Danh mục: Báo cáo khoa học

9
352
0

Báo cáo khoa học:

Báo cáo khoa học: "Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining" ppt

Danh mục: Báo cáo khoa học

9
319
0

Bạn có muốn tìm thêm với từ khóa:

Tìm thêm: hệ việt nam nhật bản và sức hấp dẫn của tiếng nhật tại việt nam xác định các mục tiêu của chương trình xác định các nguyên tắc biên soạn khảo sát các chuẩn giảng dạy tiếng nhật từ góc độ lí thuyết và thực tiễn xác định thời lượng học về mặt lí thuyết và thực tế khảo sát thực tế giảng dạy tiếng nhật không chuyên ngữ tại việt nam xác định mức độ đáp ứng về văn hoá và chuyên môn trong ct phát huy những thành tựu công nghệ mới nhất được áp dụng vào công tác dạy và học ngoại ngữ mở máy động cơ lồng sóc mở máy động cơ rôto dây quấn các đặc tính của động cơ điện không đồng bộ hệ số công suất cosp fi p2 đặc tuyến hiệu suất h fi p2 đặc tuyến mômen quay m fi p2 đặc tuyến tốc độ rôto n fi p2 động cơ điện không đồng bộ một pha sự cần thiết phải đầu tư xây dựng nhà máy phần 3 giới thiệu nguyên liệu từ bảng 3 1 ta thấy ngoài hai thành phần chủ yếu và chiếm tỷ lệ cao nhất là tinh bột và cacbonhydrat trong hạt gạo tẻ còn chứa đường cellulose hemicellulose chỉ tiêu chất lượng 9 tr 25