0

improving automatic speech recognition for lectures

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Improving Automatic Speech Recognition for Lectures through Transformation-based Rules Learned from Minimal Data" ppt

Báo cáo khoa học

... 764–772,Suntec, Singapore, 2-7 August 2009.c2009 ACL and AFNLP Improving Automatic Speech Recognition for Lectures throughTransformation-based Rules Learned from Minimal DataCosmin Munteanu∗†∗National ... scoring function.1 Introduction Improving access to archives of recorded lectures is a task that, by its very nature, requires researchefforts common to both Automatic Speech Recog-nition (ASR) ... Workshopon Automatic Speech Recognition and Understand-ing (ASRU), pages 347–354.C. F¨ugen, M. Kolss, D. Bernreuther, M. Paulik,S. St¨uker, S. Vogel, and A. Waibel. 2006. Opendomain speech recognition...
  • 9
  • 427
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "WordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for Meetings" doc

Báo cáo khoa học

... this paper the best performing measuresfrom (Pucher, 2005), which outperform baselinemodels on word prediction for conversational tele-phone speech are used for Automatic Speech Recog-nition ... 129–132,Prague, June 2007.c2007 Association for Computational LinguisticsWordNet-based Semantic Relatedness Measures in Automatic Speech Recognition for MeetingsMichael PucherTelecommunications ... conversational speech. The JCN (Sec-tion 2.1) measure performs best for nouns using thenoun-context. The LESK (Section 2.1) measure per-forms best for verbs and adjectives using a mixedword-context.Text-based...
  • 4
  • 204
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Grounded Language Modeling for Automatic Speech Recognition of Sports Video" doc

Báo cáo khoa học

... transcription. For example, if the ASR output contains the term sequence “… and farther home run for David forty says…” and the closed captioning contains the sequence “…another home run for David ... evaluate the performance of our grounded language model on a speech recognition task using video highlights from Major League Baseball games. Results indicate improved per-formance using three ... captioning transcrip-tions (i.e., no ASR). 5 Conclusions We have described a method for improving speech recognition in video. The method uses grounded language modeling, an extension of tradition...
  • 9
  • 395
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "AUTOMATIC SPEECH RECOGNITION AND ITS APPLICATION TO INFORMATION EXTRACTION" pdf

Báo cáo khoa học

... for spontaneous speech recognition One of the most important issues for speech recognition is how to create language models (rules) for spontaneous speech. When recognizing spontaneous speech ... travel information system at LIMSI The ARISE (Automatic Railway Information Systems for Europe) projects aims developing prototype telephone information services for train travel information ... ROBUST SPEECH RECOGNITION 4.1 Automatic adaptation Ultimately, speech recognition systems should be capable of f robust, speaker- independent or speaker- adaptive, continuous speech recognition ...
  • 10
  • 515
  • 3
Báo cáo khoa học:

Báo cáo khoa học: "Distributed Listening: A Parallel Processing Approach to Automatic Speech Recognition" pot

Báo cáo khoa học

... multiple speech recognizers in an effort to improve speech recognition, as discussed next. 2.1 Enhanced Majority Rules Barry (et al., 1994) took three different Automatic Speech Recognition ... the same input, performed speech recognition, and sent the result to the master system. The EMR resolved inconsistencies by looking for agreement from the individual systems for the recognized ... & Gauvain, J., Improved ROVER using Language Model Information, In ISCA ITRW Workshop on Automatic Speech Recognition: Chal-lenges for the new Millenium, Paris, pp. 47–52, 2000. Young,...
  • 4
  • 252
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Learning Sub-Word Units for Open Vocabulary Speech Recognition" doc

Báo cáo khoa học

... hybrid sys-tem for open vocabulary speech recognition. Ratherthan relying on the text alone, we also utilize sideinformation: a mapping of words to classes so wecan optimize learning for a specific ... and MIT Lectures task respectively.1 IntroductionMost automatic speech recognition systems operatewith a large but limited vocabulary, finding the mostlikely words in the vocabulary for the ... b ax,d ae n. The latter is more useful for automaticallyrecovering the word’s orthographic form, identify-ing that an OOV was spoken, or improving perfor-mance of a spoken term detection system...
  • 10
  • 441
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Discriminative Syntactic Language Modeling for Speech Recognition" pdf

Báo cáo khoa học

... Previous WorkTechniques for exploiting stochastic context-freegrammars for language modeling have been ex-plored for more than a decade. Early approachesincluded algorithms for efficiently calculating ... are a first step inexamining the potential utility of syntactic features for discriminative language modeling for speech recognition. We tried two possible sets of featuresderived from the full ... Using a stochastic context-free grammar as a lan-guage model for speech recognition. In Proceedings of theIEEE Conference on Acoustics, Speech, and Signal Process-ing, pages 189–192.John Lafferty,...
  • 8
  • 409
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence" pptx

Báo cáo khoa học

... to correct the errors in the results of speech recognition to increase the performance of a speech translation system. This paper proposes a method for correcting errors using the statistical ... integrating recognition and translation into a speech translation system, the development of the following processes is therefore important: (1) detection of errors in speech recognition results; ... correct string for the string between A and B in the Error-String (see figure 2-3). 3. Evaluation 3.1 Data Condition for Experiments Results of Speech Recognition: We used 4806 recognition...
  • 5
  • 588
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Finite-Slate Parser for Use in Speech Recognition" pdf

Báo cáo khoa học

... invariants; allophonic variation is traditionally seen as problematic for recognition. (I) "In most systems for sentence recognition, such modifications must be viewed as a kind of 'noise' ... before the labial stop /p/, the cor9nal nasal/n/ before the coronal stop/t/, and the velar nasal/7// before the velar stop/k/. This constraint, like subject-verb agreement. poses a problem for ... This view of allophonic variation is representative of much of the speech recognition literature, especially during the ARPA speech project. One can find similar statements by Cole and Jakim~k...
  • 7
  • 420
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Web augmentation of language models for continuous speech recognition of SMS text messages" docx

Báo cáo khoa học

... rates were 17.0 for En-glish, 18.7 for Spanish, and 22.5 for French. For English, we also created web mixture mod-els with KN smoothing. The error rates were 16.5,15.9 and 15.7 for the 20 MB, ... 2.2.1) for the same number of queries. Alsoresults from language modeling and speech recog-nition experiments favored statistical querying.2.3 Web collections obtained For the speech recognition ... room for improvement.3.1.2 Word error rates Speech recognition results for the different LMsare given in Table 2. The results are consistent inthe sense that the web mixture models outperformthe...
  • 9
  • 301
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Vocabulary Decomposition for Estonian Open Vocabulary Speech Recognition" ppt

Báo cáo khoa học

... sizes.1 Introduction1.1 OOV problemOpen vocabulary speech recognition refers to au-tomatic speech recognition (ASR) of continuous speech, or speech- to-text” of spoken language,where the recognizer ... Computational Model for Word-Form Recognition and Production. University of Helsinki,Helsinki, Finland.Tanel Alumäe. 2006. Methods for Estonian Large Vo-cabulary Speech Recognition. PhD Thesis. ... Vocab-ulary Speech Recognition with Flat Hybrid Models.INTERSPEECH-2005, 725–728.Janne Pylkkönen. 2005. An Efficient One-pass Decoder for Finnish Large Vocabulary Continuous Speech Recognition. ...
  • 7
  • 377
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Preference-first Language Processor Integrating the Unification Grammar and Markov Language Model for Speech Recognition-ApplicationS" potx

Báo cáo khoa học

... developed for the preference-first parsing algorithm for different applications. For example, there can be various construction principles to determine the order of constituent construction for ... Unification Grammar and Markov Language Model for Continuous Speech Recognition. Proceedings of the IEEE 990 International Conference on Acoustics, Speech and Signal Processing, Albuquerque, NM, ... (1986). An Efficient Word Lattice Parsing Algorithm for Continuous Speech Recognition. Proceedings of the 1986 International Conference on Acoustic, Speech and Signal Processing, pp. 1569-1572....
  • 6
  • 392
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Using Chunk Based Partial Parsing of Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition" potx

Báo cáo khoa học

... Spontaneous Speech in Unrestricted Domains for Reducing Word Error Rate in Speech Recognition Klaus Zechner and Alex Waibel Language Technologies Institute Carnegie Mellon University 5000 Forbes ... strong rationale for following this simple ap- proach is the nature of the ill-formed input due to (i) spontaneous speech dysfluencies, and (ii) errors in the hypotheses of the speech recognizer. ... lattices. The hypotheses from the Nbest lists are tagged for part of speech, "cleaned up" by a preprocessing pipe, parsed by a part of speech based chunk parser, and rescored using a backpropagation...
  • 7
  • 388
  • 0

Xem thêm

Tìm thêm: hệ việt nam nhật bản và sức hấp dẫn của tiếng nhật tại việt nam xác định các mục tiêu của chương trình xác định các nguyên tắc biên soạn khảo sát các chuẩn giảng dạy tiếng nhật từ góc độ lí thuyết và thực tiễn khảo sát chương trình đào tạo của các đơn vị đào tạo tại nhật bản tiến hành xây dựng chương trình đào tạo dành cho đối tượng không chuyên ngữ tại việt nam điều tra đối với đối tượng giảng viên và đối tượng quản lí điều tra với đối tượng sinh viên học tiếng nhật không chuyên ngữ1 khảo sát thực tế giảng dạy tiếng nhật không chuyên ngữ tại việt nam nội dung cụ thể cho từng kĩ năng ở từng cấp độ xác định mức độ đáp ứng về văn hoá và chuyên môn trong ct phát huy những thành tựu công nghệ mới nhất được áp dụng vào công tác dạy và học ngoại ngữ mở máy động cơ rôto dây quấn hệ số công suất cosp fi p2 đặc tuyến mômen quay m fi p2 đặc tuyến tốc độ rôto n fi p2 đặc tuyến dòng điện stato i1 fi p2 thông tin liên lạc và các dịch vụ phần 3 giới thiệu nguyên liệu từ bảng 3 1 ta thấy ngoài hai thành phần chủ yếu và chiếm tỷ lệ cao nhất là tinh bột và cacbonhydrat trong hạt gạo tẻ còn chứa đường cellulose hemicellulose