0

noisy parallel software corpora

Báo cáo khoa học:

Báo cáo khoa học: "Extracting Paraphrases of Technical Terms from Noisy Parallel Software Corpora" pot

Báo cáo khoa học

... desired.Paraphrases can be extracted from non -parallel corpora using contextual similarity (Lin, 1998).They can also be obtained from parallel corpora if such data is available (Barzilay and ... 4 August 2009.c2009 ACL and AFNLPExtracting Paraphrases of Technical Termsfrom Noisy Parallel Software Corpora Xiaoyin Wang1,2, David Lo1, Jing Jiang1, Lu Zhang2, Hong Mei21School ... multilingual corpora (Bannard and Callison-Burch, 2005; Zhao et al., 2008).The approach in (Barzilay and McKeown,2001) does not use deep linguistic analysis andtherefore is suitable to noisy corpora...
  • 4
  • 293
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora" doc

Báo cáo khoa học

... dim(V2) 240 A Pattern Matching Method for Finding Noun and Proper Noun Translations from Noisy Parallel Corpora Pascale Fung Computer Science Department Columbia University New York, NY 10027 ... aligned, parallel texts. However, sentence align- ment (Brown et al. 1991; Kay & RSscheisen 1993; Gale & Church 1993; Church 1993; Chen 1993; Wu 1994) is not always practical when corpora ... small Chi- nese/English parallel corpus of approximately 5760 unique English words. The outline of the algorithm is as follows: 1. Tag the English half of the parallel text. In the first...
  • 8
  • 426
  • 0
A parallel implementation on modern hardware for geo-electrical tomographical software

A parallel implementation on modern hardware for geo-electrical tomographical software

Công nghệ thông tin

... three groups [1]:• Instruction-Level Parallel (Fined-Grained Control Parallelism)• Process-Level Parallel (Coarse-Grained Control Parallelism)4OpenCL’s data parallelism is quite like CUDA. There ... 26A PARALLEL IMPLEMENTATION ON 1MODERN HARDWARE FOR GEO-ELECTRICAL TOMOGRAPHICAL SOFTWARE 1NGUYỄN HOÀNG VŨ 2A PARALLEL IMPLEMENTATION ON 2MODERN HARDWARE FOR GEO-ELECTRICAL TOMOGRAPHICAL SOFTWARE ... 3 1.1 An overview of modern parallel architectures 4 1.1.1 Instruction-Level Parallel Architectures 5 1.1.2 Process-Level Parallel Architectures 6 1.1.3 Data parallel architectures 8 1.1.4...
  • 58
  • 374
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "ParaSense or How to Use Parallel Corpora for Word Sense Disambiguation" pdf

Báo cáo khoa học

... as there are parallel corpora avail-able for the targeted languages. Although large mul-tilingual corpora are still rather scarce, we stronglybelieve there will be more parallel corpora availablein ... to WSDthat directly incorporates evidence from four otherlanguages. To this end, we build further on twowell-known research ideas: (1) the possibility touse parallel corpora to extract translation ... of large quantities of parallel text,internet corpora such as the ever growing Wikipediacorpus, etc.). Another line of research could be theexploitation of comparable corpora to acquire addi-tional...
  • 6
  • 537
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "ALIGNING SENTENCES IN PARALLEL CORPORA" doc

Báo cáo khoa học

... sentences of each length less tha.n 8]. We estimated the probabilities 173 ALIGNING SENTENCES IN PARALLEL CORPORA Peter F. Brown, Jennifer C. Lai, a, nd Robert L. Mercer IBM Thomas J. Watson Research ... describe a statistical tech- nique for aligning sentences with their translations in two parallel corpora. In addition to certain anchor points that are available in our da.ta, the only information ... sentence, or even a whole passage, may be missing from one or the other of the corpora. If a person is given two parallel texts and asked to match up the sentences in them, it is na.tural for...
  • 8
  • 387
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Extracting Parallel Sub-Sentential Fragments from Non-Parallel Corpora" pdf

Báo cáo khoa học

... for parallel data acquisition is highly benefi-cial for the SMT field.Comparable corpora exhibit various degrees ofparallelism. Fung and Cheung (2004a) describe corpora ranging from noisy parallel, ... comparable ornon -parallel sentence pairs.Since our approach can extract parallel datafrom texts which contain few or no parallel sen-tences, it greatly expands the range of corpora which can ... the relevant informationconcerning these corpora. 3.2 Extraction ExperimentsOn each of our comparable corpora, and usingeach of our initial parallel corpora, we applyboth the fragment extraction...
  • 8
  • 263
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Paraphrasing with Bilingual Parallel Corpora" pot

Báo cáo khoa học

... probability to include multiple corpora, as follows:ˆe2= arg maxe2=e1Cf in Cp(f|e1)p(e2|f) (5)where C is a parallel corpus from a set of parallel corpora. For this condition we ... word- andsentence-aligned parallel corpora. In Proceedings ofACL.Mona Diab and Philip Resnik. 2002. An unsupervisedmethod for word sense tagging using parallel corpora. In Proceedings of ... andcomfort as console.While monolingual parallel corpora often haveidentical contexts that can be used for identifyingparaphrases, bilingual parallel corpora do not. In-stead, we use phrases...
  • 8
  • 308
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "AUTOMATIC ALIGNMENT IN PARALLEL CORPORA" potx

Báo cáo khoa học

... (associated with content words) reduces the number of parameters 335 AUTOMATIC ALIGNMENT IN PARALLEL CORPORA Harris Papageorgiou, Lambros Cranias, Stelios Piperidis I Institute for Language ... the optimum alignment of units. The proposed scheme has been tested at sentence level on parallel corpora of the CELEX database. The success rate exceeded 99%. The next steps of the work ... paper addresses the alignment issue in the framework of exploitation of large bi- multilingual corpora for translation purposes. A generic alignment scheme is proposed that can meet varying...
  • 3
  • 193
  • 0
Giáo trình SoftWare Testing

Giáo trình SoftWare Testing

Kỹ thuật lập trình

... Confidential - 2 -Table of Contents1 INTRODUCTION TO SOFTWARE 7 1.1 EVOLUTION OF THE SOFTWARE TESTING DISCIPLINE 7 1.2 THE TESTING PROCESS AND THE SOFTWARE TESTING LIFE CYCLE 7 1.3 BROAD CATEGORIES ... workings of the software more visible. These contrast with black box techniques that simply look at the official outputs of a program. White box testing is concerned only with testing the software ... FOR SYSTEM TESTING 51 Software Testing ConfidentialCognizantTechnologySolutions Performance Testing Process & Methodology Proprietary & Confidential - 8 - software development life...
  • 179
  • 1,857
  • 14

Xem thêm