flow network models for word alignment

Báo cáo khoa học: "Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora" docx

Báo cáo khoa học: "Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora" docx

Ngày tải lên : 17/03/2014, 07:20
... French ones, to a sink, to control the flow quantity we want to go through the words. 2.1 Flow networks We meet here the notion of flow networks that we can formalise in the following way (we ... reader to (Ford and Fulkerson, 1962; Klein, 1967). 2.2 Alignment models Flows and networks define a general framework in which it is possible to model alignments be- tween words, and to ... translations for the English term. 4 Conclusion We presented a new model for word alignment based on flow networks. This model allows us to integrate different types of constraints in the search for...
  • 7
  • 379
  • 0
Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

Ngày tải lên : 17/03/2014, 04:20
... Retrieving Word Alignments Two word- alignment retrieval schemes are de- signed for BiTAMs: the uni-direction alignment (UDA) and the bi-direction alignment (BDA). Both use the posterior mean of the alignment ... Null word and Laplace smoothing for the BiTAM models. We train, for comparison, IBM-1&4 and HMM models with 8 iterations of IBM-1, 7 for HMM and 3 for IBM-4 (1 8 h 7 4 3 ) with Null word ... Incorporation of Word “Null” Similar to IBM models, “Null” word is used for the source words which have no translation coun- terparts in the target language. For example, Chi- nese words “de” ()...
  • 8
  • 354
  • 0
Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf

Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf

Ngày tải lên : 08/03/2014, 01:20
... are less than 20 percent. 2 1 : n Word Alignment Our discussion of uni-directional alignments of word alignment is limited to IBM Model 4. Definition 1 (Word alignment task) Let e i be the i-th ... two word alignments as an alignment point, 2) add new alignment points that exist in the union with the constraint that a new alignment point connects at least one previ- ously unaligned word, ... purpose of the word alignment task is to obtain a lexical translation probability p( ¯ f i |¯e i ), which is a 1 : n uni-directional word alignment. The initial idea underlying the IBM Models, consisting...
  • 9
  • 487
  • 0
Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Ngày tải lên : 08/03/2014, 02:21
... bilin- gual word alignment finds word- to -word connec- tions across languages. Originally introduced as a byproduct of training statistical translation models in (Brown et al., 1993), word alignment ... Log-linear models for word alignment. In Meeting of the Association for Computa- tional Linguistics, pages 459–466, Ann Arbor, USA. I. D. Melamed. 2000. Models of translational equivalence among words. ... new information resulting in im- proved alignments. 2 Constrained Alignment Let an alignment be the complete structure that connects two parallel sentences, and a link be one of the word- to-word...
  • 8
  • 325
  • 0
Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

Báo cáo khoa học: "Learning Expressive Models for Word Sense Disambiguation" pot

Ngày tải lên : 08/03/2014, 02:21
... words to the right and left of the verb, identified using POS tags, represented by has_narrow(snt, word_ position, word) : has_narrow(snt 1 , 1st _word_ left, mind). has_narrow(snt 1 , 1st _word_ right, ... the positions of the words, represented by has_narrow_trns(snt, word_ position, portuguese _word) : has_narrow_trns(snt 1 , 1st _word_ right, como). has_narrow_trns(snt 1 , 2nd _word_ right, um). … ... for the disambigua- tion of verbs. We plan to further evaluate our approach for other sets of words, including other parts-of-speech to allow further comparisons with other approach- es. For...
  • 8
  • 380
  • 0
Báo cáo khoa học: "Using Similarity Scoring To Improve the Bilingual Dictionary for Word Alignment" doc

Báo cáo khoa học: "Using Similarity Scoring To Improve the Bilingual Dictionary for Word Alignment" doc

Ngày tải lên : 08/03/2014, 07:20
... Automatically-extracted thesauri for cross-language IR: When better is worse. In Proceed- ings of COMPUTERM’98. Eric Gaussier. 1998. Flow network models for word alignment and terminology extraction ... language word. is expressed as follows: a word qualifies for clus- tering if As before, are all the target language words that cooccur with source language word . Similarly to the most frequent words, ... clustering. Those words that are considered for clustering should account for more than of the cooccur- rences of the source language word with any tar- get language word. If a word falls below...
  • 8
  • 363
  • 0
Báo cáo khoa học: "Combining Clues for Word Alignment" pdf

Báo cáo khoa học: "Combining Clues for Word Alignment" pdf

Ngày tải lên : 08/03/2014, 21:20
... short words. 340 Combining Clues for Word Alignment Rirg Tiedemann Department of Linguistics Uppsala University Box 527 SE-751 20 Uppsala, Sweden joerg@stp.ling.uu.se Abstract In this paper, a word ... bilingual lexical information. Word alignment approaches focus on the automatic identification of translation relations in translated texts. Alignments are usu- ally represented as a set of links between words and ... an alignment clue for the cor- responding word pairs. The likelihood of each translation alternative can be weighted, e.g., by frequency (if available). 2.3 Clue Combinations So far, word alignment...
  • 8
  • 579
  • 0
Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Ngày tải lên : 17/03/2014, 01:20
... based on word alignment. In this paper we introduce a confidence mea- sure for word alignment, which is robust to extra or missing words in the bilingual sentence pairs, as well as word alignment ... confidence sentence alignments and alignment links from mul- tiple word alignments of the same sen- tence pair. Additionally, we remove low confidence alignment links from the word alignment of a bilingual ... the same word does in- crease the confusion for word alignment and re- duce the link confidence. On the other hand, ad- ditional information (such as the distance of the word pair, the alignment...
  • 9
  • 317
  • 0
Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

Ngày tải lên : 20/02/2014, 12:20
... two parameters for the dis- tortion probability: one for head words and the other for non-head words. Distortion Probability for Head Words The distortion probability for head words represents ... two word alignment models for language pairs L1-L3 and L2-L3, respectively. And then, with L3 as a pivot language, we can build a word alignment model for L1 and L2 based on the above two models. ... language word similarity of the Chinese word c and the Japanese word given the English word );,( efcsim f e Figure 1. Similarity Calculation English word e. For the ambiguous English word e,...
  • 8
  • 359
  • 0
Tài liệu Báo cáo khoa học: "Guiding Statistical Word Alignment Models With Prior Knowledge" pdf

Tài liệu Báo cáo khoa học: "Guiding Statistical Word Alignment Models With Prior Knowledge" pdf

Ngày tải lên : 20/02/2014, 12:20
... as 1. In building word alignment models, a special “NULL” word is usually introduced to address tar- get words that align to no source words. Since this physically non-existing word is not in the ... computational 4 2 Constrained Word Alignment Models The framework that we propose to incorporate sta- tistical constraints into word alignment models is generic. It can be applied to complicated models such IBM ... candidate. This information is de- rived before word alignment model training and will act as soft constraints that need to be respected dur- ing training and alignments. For a given word pair, the...
  • 8
  • 495
  • 0
Báo cáo khoa học: "The S-Space Package: An Open Source Package for Word Space Models" pdf

Báo cáo khoa học: "The S-Space Package: An Open Source Package for Word Space Models" pdf

Ngày tải lên : 17/03/2014, 00:20
... with all word space models, which facilitates word space based applications. The package is written in Java and defines a standardized Java interface for word space algo- rithms. While other word ... July 2010. c 2010 Association for Computational Linguistics The S-Space Package: An Open Source Package for Word Space Models David Jurgens University of California, Los Angeles, 4732 Boelter ... algorithms, code documentation and mailing list archives. 2 Word Space Models Word space models are based on the contextual distribution in which a word occurs. This ap- proach has a long history in linguistics,...
  • 6
  • 410
  • 0
Báo cáo khoa học: "Better Word Alignments with Supervised ITG Models" pdf

Báo cáo khoa học: "Better Word Alignments with Supervised ITG Models" pdf

Ngày tải lên : 17/03/2014, 01:20
... rea- sonable alignments, word alignment models must constrain the set of alignments considered. In this section, we discuss and compare alignment fami- lies used to train our discriminative models. Initially, ... many-to-one block alignment potential, and efficient pruning, ITG models can yield state-of-the art word alignments, even when the underlying gold alignments are highly non- ITG. Our models yielded ... across alignments. Specif- ically, for each alignment cell (i, j) which is not a possible alignment in a ∗ , we incur a loss of 1 when a ij = a ∗ ij ; note that if (i, j) is a possible alignment, ...
  • 9
  • 319
  • 0
Báo cáo khoa học: "Optimizing Word Alignment Combination For Phrase Table Training" pptx

Báo cáo khoa học: "Optimizing Word Alignment Combination For Phrase Table Training" pptx

Ngày tải lên : 17/03/2014, 02:20
... syntax-based translation framework. Most word alignment models distinguish trans- lation direction in deriving word alignment matrix. Given a parallel sentence, word alignments in two directions are ... different word align- ment combination methods 4 Conclusions We presented a simple yet effective method for word alignment symmetrization and combination in general. The problem is formulated ... DARPA TransTac program for funding and the anonymous reviewers for their constructive suggestions. References N. F. Ayan. 2005. Combining Linguistic and Machine Learn- ing Techniques for Word Alignment Improvement....
  • 4
  • 210
  • 0
Báo cáo khoa học: "Discriminative Pruning of Language Models for Chinese Word Segmentation" ppt

Báo cáo khoa học: "Discriminative Pruning of Language Models for Chinese Word Segmentation" ppt

Ngày tải lên : 17/03/2014, 04:20
... model for Chinese word segmentation was pro- posed. Gao et al. (2005) further developed it to a linear mixture model. In these statistical models, language models are essential for word segmen- tation ... bigram probability P m (w y |w x ) for seen bigram w x w y in training corpus, unigram probability P m (w) and backoff coefficient α m (w) for any word w. For any w x and w y in the vocabulary, ... pages 1001–1008, Sydney, July 2006. c 2006 Association for Computational Linguistics Discriminative Pruning of Language Models for Chinese Word Segmentation Jianfeng Li Haifeng Wang Dengjun...
  • 8
  • 294
  • 0

Xem thêm