0

detecting highly confident word translations

Báo cáo khoa học:

Báo cáo khoa học: "Detecting Highly Confident Word Translations from Comparable Corpora without Any Prior Knowledge" doc

Báo cáo khoa học

... 449–459,Avignon, France, April 23 - 27 2012.c2012 Association for Computational Linguistics Detecting Highly Confident Word Translations from ComparableCorpora without Any Prior KnowledgeIvan Vuli´c and ... In other words, if the most prob-able translation candidate for a source word wS1isa target word wT2and, vice versa, the most prob-able translation candidate of the target word wT2451Proceedings ... the list. In otherwords, if the first translation candidate for the source word isola is the target word island, and, vice versa, the firsttranslation candidate for the target word island is isola,...
  • 11
  • 290
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Identification of Word Translations from Unrelated English and German Corpora" pot

Báo cáo khoa học

... two words ahead of another word B, a second vector for the case that word A is one word ahead of word B, a third vector for A directly following B, and a fourth vector for A following two words ... frequencies: kl~ = frequency of common occurrence of word A and word B kl2 = corpus frequency of word A - kll k21 = corpus frequency of word B - kll k22 = size of corpus (no. of tokens) - ... of test words. 4 This means that alternative translations of a word were not considered. Another approach, as conducted by Fung & Yee (1998), would be to consider all possible translations...
  • 8
  • 438
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Identifying Word Translations in Non-Parallel Texts" potx

Báo cáo khoa học

... algorithms for sentence and word- alignment allow the automatic iden- tification of word translations from paxalhl texts. This study suggests that the identi- fication of word translations should also ... determine the translations of words from comparable or even unrelated texts. 2 Approach It is assumed that there is a correlation between the co-occurrences of words which are translations ... co-occurrences of German word pairs in the German corpus. As a starting point, word order in the two matrices was chosen such that word n in the German matrix was the translation of word n in the English...
  • 3
  • 219
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Identifying Word Translations from Comparable Corpora Using Latent Topic Models" potx

Báo cáo khoa học

... Italian word vectors and English word vectors with TF-IDF scores in the original word- document space (Cos), with aligned documents.Table 1 shows the Precision@1 scores (the per-centage of words ... These methods needan initial lexicon of translations, cognates or simi-lar words which are then used to acquire additional translations of the context words. In contrast, ourmethod does not ... TF-IDF scores for the orig-inal word- document space (Manning and Sch¨utze,1999). If we are given a source word wi, n(wi)k,Sde-notes the number of times the word wiis associatedwith...
  • 6
  • 449
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Detecting Compositionality in Multi-Word Expressions" doc

Báo cáo khoa học

... sequences ofwords that tend to cooccur more frequently thanchance and are either idiosyncratic or decompos-able into multiple simple words (Baldwin, 2006).Deciding idiomaticity of MWEs is highly ... evaluationset is derived from WordNet in a semi-supervised way. Graph connectivity mea-sures are employed for unsupervised pa-rameter tuning.1 Introduction and related workMulti -word expressions (MWEs) ... Papers, pages 65–68,Suntec, Singapore, 4 August 2009.c2009 ACL and AFNLP Detecting Compositionality in Multi -Word ExpressionsIoannis KorkontzelosDepartment of Computer ScienceThe University...
  • 4
  • 278
  • 0
Tài liệu

Tài liệu "Word of Mouth": con dao 2 lưỡi docx

Tiếp thị - Bán hàng

... trong cũng bỏ ra bán nốt. 5. Thực hiện như một công nghệ " ;Word of Mouth": con dao 2 lưỡi Phương pháp Marketing Word of Mouth (WOMM) chính là một hình thức của chiến dịch quảng ... sản phẩm hay dịch vụ của bạn có chất lượng thực sự “đỉnh” thì nỗ lực Marketing bằng phương pháp Word of Mouth này sẽ được tự động thực hiện bởi chính những khách hàng. Lúc ấy, bạn chỉ cần nỗ ... nghĩ và cứ thế mà bấm Copy, gửi đi hàng loạt thì sẽ gây hậu quả thế nào? Hay một ví dụ khác mà Word of Mouth” làm điêu đứng một thương hiệu nước ngọt khi mọi người rỉ tai nhau: “Có ai nghe chuyện...
  • 8
  • 300
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Báo cáo khoa học

... English words following each Chi-nese word is its literal translation. We find untrans-lated Chinese and English words (marked withunderlines). These spurious words cause signifi-cant word alignment ... lexical translation probability of thealigned word pair with the translation probabilitiesof all the target words given the source word. If a word t occurs N times in the target sentence, forany ... learned based on word alignment.In this paper we introduce a confidence mea-sure for word alignment, which is robust to extraor missing words in the bilingual sentence pairs,as well as word alignment...
  • 9
  • 317
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Mining Parenthetical Translations from the Web by Word Alignment" potx

Báo cáo khoa học

... between words, we also compute the φ2 scores of prefixes and suffixes of Chinese and English words. For both languages, the prefix of a word is defined as the first three bytes of the word ... Competitive Linking to deal with multi -word alignments and takes advantage of word- internal correspondences between transliter-ated words or morphologically composed words. Finally, through our discussion ... the alignments are restricted word- to -word align-ments, which implies that multi -word expressions can only be partially linked at best. 4.1 Dealing with multi -word alignment We made a small...
  • 9
  • 612
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Subword-based Tagging for Confidence-dependent Chinese Word Segmentation" pdf

Báo cáo khoa học

... characters in a word. The length of a Chi-nese word has discriminative roles for word composition. For example, single-characterwords are more apt to form new words thanare multiple-character words. ... stands for word and t, for IOB tag.The subscripts are position indicators, where0 means the current word/ tag; −1, −2, the firstor second word/ tag to the left; 1, 2, the first orsecond word/ tag ... parts: a dictionary-based N-gram word segmentation for segmenting IVwords, a maximum entropy subword-based taggerfor recognizing OOVs, and a confidence-dependent word disambiguation used for merging...
  • 8
  • 348
  • 0

Xem thêm