0

combining a statistical language model

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Báo cáo khoa học

... NIST Language Recognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... Recognition Evaluation (LRE) data. The database was intended to establish a baseline of performance capability for language recognition of conversational tele-phone speech. The database contains recorded ... by a chan-nel noise. The n-gram language model has achieved equal amounts of success in both tasks, e.g. n-character slice for text categorization by lan-guage (Cavnar and Trenkle, 1994) and...
  • 8
  • 436
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Reading Level Assessment Using Support Vector Machines and Statistical Language Models" pdf

Báo cáo khoa học

... measures are inadequate dueto their reliance on vocabulary lists and/or a superfi-cial representation of syntax. Our approach uses n-gram language models as a low-cost automatic ap-proximation of ... syntactic and semantic analy-sis. Statistical language models (LMs) are used suc-cessfully in this way in other areas of NLP such asspeech recognition and machine translation. We alsouse a ... categories relative toeach other.4.1 Statistical Language Models Statistical LMs predict the probability that a partic-ular word sequence will occur. The most commonlyused statistical language...
  • 8
  • 446
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Japanese OCR Error Correction using Character Shape Similarity and Statistical Language Model " pptx

Báo cáo khoa học

... Statistical Language Model Masaaki NAGATA NTT Information and Communication Systems Laboratories 1-1 Hikari-no-oka Yokosuka-Shi Kanagawa, 239-0847 Japan nagata@nttnly, isl. ntt. co. jp Abstract ... approxi- mate word matching method using character shape similarity, and a word segmentation algorithm us- ing a statistical language model. By using a sta- tistical OCR model and character shape ... present a novel OCR error correction method for languages without word delimiters that have a large character set, such as Japanese and Chinese. It consists of a statistical OCR model, an approxi-...
  • 7
  • 472
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Generating statistical language models from interpretation grammars in dialogue systems" potx

Báo cáo khoa học

... Gram-matical Framework (GF) (Ranta, 2004).We create a statistical language model (SLM) directly from our interpretationgrammar and compare recognition per-formance of this model against a ... ofFunctional Programming., Vol. 14, No. 2, pp. 145–189.Ranta A. Grammatical Framework Homepagehttp://www.cs.chalmers.se/˜aarne/GF, as of May2005.Raux A. , Langner B., Black A. and Eskenazi M. ... Structureinto Statistical Language Models. In PhilosophicalTransactions of the Royal Society of London A, 358.Solsona R., Fosler-Lussier E., Kuo H.J., Potamianos A. and Zitouni I. 2002. Adaptive Language...
  • 8
  • 381
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Structured Language Model" ppt

Báo cáo khoa học

... Proceedings of the Human Language Technology Workshop, 272-277. ARPA. Raymond Lau, Ronald Rosenfeld, and Salim Roukos. 1993. Trigger-based language models: a maximum entropy approach. In Proceedings ... University, Baltimore, MD. Frederick Jelinek, John Lafferty, David M. Mager- man, Robert Mercer, Adwait Ratnaparkhi, Salim Roukos. 1994. Decision Tree Parsing using a Hid- den Derivational Model. ... those assigned man- ually in the Penn Treebank (Marcus95) after under- going headword percolation and binarization. All four LMs predict a word wk and they were implemented using the Maximum...
  • 3
  • 342
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Discriminative Language Model with Pseudo-Negative Samples" pptx

Báo cáo khoa học

... that they have the dis-advantage of being computationally expensive, andnot all relevant features can be included. A discriminative language model (DLM) assigns a scoreto a sentence , measuring ... spe-cific applications and therefore were able to obtainreal negative examples easily. For example, Roark(2007) proposed a discriminative language model, inwhich a model is trained so that a correct ... June.Brian Roark, Murat Saraclar, and Michael Collins. 2007.Discriminative n-gram language modeling. computerspeech and language. Computer Speech and Lan-guage, 21(2):373–392.Roni Rosenfeld, Stanley...
  • 8
  • 315
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Statistical Model for Unsupervised and Semi-supervised Transliteration Mining" pptx

Báo cáo khoa học

... International Language Resources and Evaluation (LREC’10), Val-letta, Malta.Sittichai Jiampojamarn, Kenneth Dwyer, Shane Bergsma,Aditya Bhargava, Qing Dou, Mi-Young Kim, andGrzegorz Kondrak. ... systemlearns this as a non-transliteration but it is wronglyannotated as a transliteration in the gold standard.Arabic nouns have an article “al” attached to themwhich is translated in English as ... usesHidden Markov Models (Nabende, 2010; Darwish,2010; Jiampojamarn et al., 2010), Finite State Au-tomata (Noeman and Madkour, 2010) and Bayesianlearning (Kahki et al., 2011) to learn transliterationpairs...
  • 9
  • 521
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Báo cáo khoa học

... signif-icantly. Bear in mind that Charniak et al. (2003) in-tegrated Charniak’s language model with the syntax-based translation model Yamada and Knight pro-posed (2001) to rescore a tree-to-string ... Stochastic analysis of lexical andsemantic enhanced structural language model. The 8thInternational Colloquium on Grammatical Inference(ICGI), 97-111.K. Yamada and K. Knight. 2001. A syntax-based ... (EMNLP),858-867.E. Charniak. 2001. Immediate-head parsing for language models. The 39th Annual Conference on Associationof Computational Linguistics (ACL), 124-131.E. Charniak, K. Knight and K. Yamada. 2003....
  • 10
  • 567
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Discriminative Lexicon Adaptation for Improved Character Accuracy – A New Direction in Chinese Language Modeling" pptx

Báo cáo khoa học

... parts randomly: 5K as the adaptation corpusand 5K as the testing set. We show the ASR char-acter accuracy results after lexicon adaptation bythe proposed approach in Table 3.LAICA-1 LAICA-2 A ... replaced by characters, we cantreat words as a means to enhance character recog-nition accuracy. Such arguments stand at least forChinese ASR since they evaluate on character errorrate and ... total path probability mass. This can beamended by involving the discriminative language model adaptation in the iteration, which results in a unified language model and lexicon adaptationframework....
  • 9
  • 466
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

Báo cáo khoa học

... and Linda C. Bauman Peto. 1995. A hierarchical Dirichlet language model. Natural Lan-guage Engineering, 1(3):1–19.Y.W. Teh. 2006. A hierarchical Bayesian language model based on Pitman-Yor processes. ... n-grams:C(ab) − C(ab∗). A( ab) = max(1, K(C(ab) − C(ab∗))) A different K constant is chosen for each n-gramorder. Using this formulation as an interpolated 5-gram language model gives a cross ... Speech and Language. R. Kneser and H. Ney. 1995. Improved backing-off form-gram language modeling. In International Confer-ence on Acoustics, Speech, and Signal Processing.David J. C. Mackay and...
  • 4
  • 425
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Succinct N-gram Language Model" ppt

Báo cáo khoa học

... com-pression tasks achieved a significant com-pression rate without any loss.1 IntroductionThere has been an increase in available N -gramdata and a large amount of web-scaled N-gramdata has been ... the ACL-IJCNLP 2009 Conference Short Papers, pages 341–344,Suntec, Singapore, 4 August 2009.c2009 ACL and AFNLP A Succinct N-gram Language Model Taro Watanabe Hajime Tsukada Hideki IsozakiNTT ... Communication Science Laboratories2-4 Hikaridai Seika-cho Soraku-gun Kyoto 619-0237 Japan{taro,tsukada,isozaki}@cslab.kecl.ntt.co.jpAbstractEfficient processing of tera-scale text datais an important...
  • 4
  • 457
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Localized Prediction Model for Statistical Machine Translation" ppt

Báo cáo khoa học

... set of candidates. This computational advantageis the main reason that we adopt the local model in thispaper.3.3 Global versus Local ModelsBoth the global and the localized log-linear models ... paper, we present a block-based model for statis-tical machine translation. A block is a pair of phraseswhich are translations of each other. For example, Fig. 1shows an Arabic-English translation ... Boston, MA, May.Christoph Tillmann and Fei Xia. 2003. A Phrase-basedUnigram Model for Statistical Machine Translation. InCompanian Vol. of the Joint HLT and NAACL Confer-ence (HLT 03), pages...
  • 8
  • 578
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "SIMULATING CHILDREN''''S NULL SUBJECTS: A NEARLY LANGUAGE GENERATION MODEL" ppt

Báo cáo khoa học

... Universal Grammar and American Sign Language: Setting the Null Argument Parameters. Dordrecht: Kluwer Academic Publishers. MacWhinney, B., & Snow, C. (1985). The Child Language Data Exchange ... form a 'maximal' phrase or XP. Lexical items are inserted as soon as the appropriate X ° heads (or XPs, for pro-forms) become available. Each time a structural unit is built, and each ... while leaving the NPL and NPI parameters set at the default (negative) values. FELICITY can also be used to address theories pertaining to other aspects of language acquisition that appear slightly...
  • 3
  • 372
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Enhancing Language Models in Statistical Machine Translation with Backward N-grams and Mutual Information Triggers" ppt

Báo cáo khoa học

... Philadelphia, Pennsylva-nia, USA, July.Matt Post and Daniel Gildea. 2008. Parsers as language models for statistical machine translation. In Proceed-ings of AMTA.Sylvain Raybaud, Caroline Lavecchia, ... prediction ability, we present two ex-tensions to standard n-gram language mod-els in statistical machine translation: a back-ward language model that augments the con-ventional forward language model, ... that a language model that embraces a larger context provides better pre-diction ability, we learn additional information fromtraining data to enhance conventional n-gram lan-guage models and...
  • 10
  • 415
  • 0

Xem thêm