ngram language model for spoken language processing

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Tài liệu Báo cáo khoa học: "A Phonotactic Language Model for Spoken Language Identification" pptx

Ngày tải lên : 20/02/2014, 15:20
... June 2005. c 2005 Association for Computational Linguistics A Phonotactic Language Model for Spoken Language Identification Haizhou Li and Bin Ma Institute for Infocomm Research Singapore ... the 1996 NIST Language Recognition Evaluation database. 1 Introduction Spoken language and written language are similar in many ways. Therefore, much of the research in spoken language identification, ... Chen of Institute for Info- comm Research for insightful discussions. References Jerome R. Bellegarda. 2000. Exploiting latent semantic information in statistical language modeling , In Proc....
  • 8
  • 436
  • 0
Tài liệu Báo cáo khoa học: "GEMINI: A NATURAL LANGUAGE SYSTEM FOR SPOKEN-LANGUAGE UNDERSTANDING*" doc

Tài liệu Báo cáo khoa học: "GEMINI: A NATURAL LANGUAGE SYSTEM FOR SPOKEN-LANGUAGE UNDERSTANDING*" doc

Ngày tải lên : 20/02/2014, 21:20
... Language& quot;, Cog- nition, Vol. 2, No. 1, pp. 15-47. MADCOW (1992). "Multi-site Data Collection for a Spoken Language Corpus", in Proceedings of the DARPA Speech and Natural Language ... could also be enforced by the parse preference component de- 57 GEMINI: A NATURAL LANGUAGE SYSTEM FOR SPOKEN -LANGUAGE UNDERSTANDING* John Dowding, Jean Mark Gawron, Doug Appelt, John Bear, ... Internet: dowding@ai.sri.com 1. INTRODUCTION Gemini is a natural language (NL) under- standing system developed for spoken language applications. This paper describes the details of the system,...
  • 8
  • 376
  • 0
Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Tài liệu Báo cáo khoa học: "A Large Scale Distributed Syntactic, Semantic and Lexical Language Model for Machine Translation" doc

Ngày tải lên : 20/02/2014, 04:20
... Dis- tributed language modeling for N-best list re-ranking. The 2006 Conference on Empirical Methods in Natu- ral Language Processing (EMNLP), 216-223. Y. Zhang, 2008. Structured language models for statisti- cal ... Syntax- based language models for statistical machine transla- tion. MT Summit IX., Intl. Assoc. for Machine Trans- lation. C. Chelba and F. Jelinek. 1998. Exploiting syntactic structure for language modeling. ... Large language models in ma- chine translation. The 2007 Conference on Empirical Methods in Natural Language Processing (EMNLP), 858-867. E. Charniak. 2001. Immediate-head parsing for language models....
  • 10
  • 567
  • 0
Tài liệu Báo cáo khoa học: "Exploiting Non-local Features for Spoken Language Understanding" pptx

Tài liệu Báo cáo khoa học: "Exploiting Non-local Features for Spoken Language Understanding" pptx

Ngày tải lên : 20/02/2014, 12:20
... performance on the statistical spoken language understanding (SLU) problem. The statistical natural language parsers trained on text perform unreliably to encode non-local informa- tion on spoken ... selec- tion algorithm is very efficient for both perfor- mance and time complexity. 2 Spoken Language Understanding as a Sequential Labeling Problem 2.1 Spoken Language Understanding The goal of SLU ... (the unstructured model) is bet- ter than CRF (the structured model) not only for time cost but also for the performance on our ex- periment 4 . This result shows that local informa- tion provides...
  • 8
  • 396
  • 0
Báo cáo khoa học: "An exponential translation model for target language morphology" pptx

Báo cáo khoa học: "An exponential translation model for target language morphology" pptx

Ngày tải lên : 07/03/2014, 22:20
... number. • For verbs, generated forms had to match the original form for tense and negation. • For adjectives, generated forms had to match the original form for degree of comparison and negation. • For ... exponen- tial models with surface and lemma features can be straightforwardly trained for all of them. For the ex- periments described below we trained an exponen- tial model for the p(Y |X) lexical model. ... generated forms had to match the original form for number, case, and gender. • Non-standard inflection forms for all POS were excluded. The following criteria were used to select rules for which...
  • 9
  • 426
  • 0
Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

Ngày tải lên : 08/03/2014, 07:20
... Markov language model, and a simple set of unification grammar rules for the Chinese language, although the present model is in fact language independent. The system is written in C language ... signal preprocessor is included to form a complete speech recognition system. The language processor consists of a language model and a parser. The language model properly integrates the unification ... summarized. The Laneua~e Model The goal of the language model is to participate in the selection of candidate constituents for a sentence to be identified. The proposed language model is composed...
  • 6
  • 392
  • 0
Báo cáo khoa học: "Re-Ranking Models For Spoken Language Understanding Marco Dinarelli University of Trento Italy" potx

Báo cáo khoa học: "Re-Ranking Models For Spoken Language Understanding Marco Dinarelli University of Trento Italy" potx

Ngày tải lên : 08/03/2014, 21:20
... probability evaluated by the Conceptual Language Model, described in the next section. 2.1 Stochastic Conceptual Language Model (SCLM) An SCLM is an n-gram language model built on semantic tags. Using ... 202–210, Athens, Greece, 30 March – 3 April 2009. c 2009 Association for Computational Linguistics Re-Ranking Models For Spoken Language Understanding Marco Dinarelli University of Trento Italy dinarelli@disi.unitn.it Alessandro ... convenient way could improve the SLU performance. The best choice in this case is a discriminative model, since it allows for the use of informative features, which, in turn, can model easily feature dependen- cies...
  • 9
  • 330
  • 0
Tài liệu Báo cáo khoa học: "Minimum Cut Model for Spoken Lecture Segmentation" ppt

Tài liệu Báo cáo khoa học: "Minimum Cut Model for Spoken Lecture Segmentation" ppt

Ngày tải lên : 20/02/2014, 11:21
... con- sistently outperforms the similarity-based baseline on all the lecture datasets. We attribute this gain to the presence of more attenuated topic transi- tions in spoken language. Since spoken language is ... did not try to ad- just our model to optimize its performance on the synthetic data. The smoothing method developed for lecture segmentation may not be appropriate for short segments ranging from ... increase for the UI system. We attribute this feature to the fact that the model is less dependent on individual recognition errors, which have a detrimental effect on the local seg- ment language modeling...
  • 8
  • 495
  • 0
Tài liệu Báo cáo khoa học: "Combining Functionality and Object Orientedness for Natural Language Processing" ppt

Tài liệu Báo cáo khoa học: "Combining Functionality and Object Orientedness for Natural Language Processing" ppt

Ngày tải lên : 21/02/2014, 20:20
... Domain A class is defined for each constant of PAL. A class object for a lexical item contains linguistic knowledge in a procedural form. In other words, a class contains information as to how a ... answered based on a predetermined set theoretical model. For example, a noun is interpreted as a set of entities; the noun "penguin", for instance, is interpreted as a set of all penguins. ... to share methods for these cases. Any exceptional method can be attached to lower level items. For example, we can define a class "action verb" which has methods for instrumental...
  • 4
  • 422
  • 0
Báo cáo khoa học: "Grammar Approximation by Representative Sublanguage: A New Model for Language Learning" potx

Báo cáo khoa học: "Grammar Approximation by Representative Sublanguage: A New Model for Language Learning" potx

Ngày tải lên : 08/03/2014, 02:21
... Computational Linguistics Grammar Approximation by Representative Sublanguage: A New Model for Language Learning Smaranda Muresan Institute for Advanced Computer Studies University of Maryland College ... have formally defined the ILP-learning problem as the tu- ple , where is the provability re- lation (also called the generalization model) , is the language of the background knowledge, is the language ... only for very limited subclasses of first-order logic (Kietz and Dˇzeroski, 1994; Cohen, 1995), which are not appropriate to model natural language grammars. Our grammar induction problem can be formu- lated...
  • 8
  • 402
  • 0
Báo cáo khoa học: "A Flexible Stand-Off Data Model with Query Language for Multi-Level Annotation" ppt

Báo cáo khoa học: "A Flexible Stand-Off Data Model with Query Language for Multi-Level Annotation" ppt

Ngày tải lên : 08/03/2014, 04:22
... Sessions, pages 109–112, Ann Arbor, June 2005. c 2005 Association for Computational Linguistics A Flexible Stand-Off Data Model with Query Language for Multi-Level Annotation Christoph M ¨ uller EML Research ... Germany mueller@eml-research.de Abstract We present an implemented XML data model and a new, simplified query language for multi-level an- notated corpora. The new query language involves automatic conversion of queries ... language. It offers a simpler and more con- cise way to formulate certain types of queries for multi-level annotated corpora. Queries are automat- ically converted into the underlying query language and...
  • 4
  • 348
  • 0
Báo cáo khoa học: "Splitting Long or Ill-formed Input for Robust Spoken-language Translation" docx

Báo cáo khoa học: "Splitting Long or Ill-formed Input for Robust Spoken-language Translation" docx

Ngày tải lên : 08/03/2014, 05:21
... sir for how many people please" Figure 3: Structure for (1) 3.2 Splitting input into well-formed parts and ill-formed parts Item (C) splits input into well-formed parts and ill-formed ... Introduction A spoken -language translation system requires the ability to treat long or ill-formed input. An utterance as input of a spoken -language trans- lation system, is not always one well-formed ... Since our splitting method is performed under left-to-right parsing, translation efficiency is not 426 Splitting Long or Ill-formed Input for Robust Spoken -language Translation Osamu FURUSE...
  • 7
  • 359
  • 0