0

word alignment for languages with scarce resources

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Word Alignment for Languages with Scarce Resources Using Bilingual Corpora of Other Language Pairs" pptx

Báo cáo khoa học

... proposes an approach to im-prove word alignment for languages with scarce resources using bilingual corpora of other language pairs. To perform word alignment between languages L1 and L2, we introduce ... two parameters for the dis-tortion probability: one for head words and the other for non-head words. Distortion Probability for Head Words The distortion probability for head words represents ... Class-based Approach to Word Alignment. Computational Lin-guistics, 23(2): 313-343. Adam Lopez and Philip Resnik. 2005. Improved HMM Alignment Models for Languages with Scarce Resources. In Proc....
  • 8
  • 359
  • 0
570 ACADEMIC WORD FAMILIES  FOR IELTS (WITH VIETNAMESE MEANING)

570 ACADEMIC WORD FAMILIES FOR IELTS (WITH VIETNAMESE MEANING)

Anh ngữ phổ thông

... chính, tiền bạc Finance for education comes from taxpayers ã financial ã financially ã financier Formula: Cụng thc ã formulate / ã reformulate ã formulation / • reformulation Function: ... energetically Enforce: Ép buộc. (obey: tuân theo) The legislation will be difficult to enforce. It is the duty of the police to enforce the law. United Nations troops enforced a ceasefire ... database Integrate into /with something These programs will integrate with your existing software. Integrate A (into /with B)| integrate A and B These programs can be integrated with your existing...
  • 94
  • 6,930
  • 44
Báo cáo khoa học:

Báo cáo khoa học: "A Novel Word Segmentation Approach for Written Languages with Word Boundary Markers" pptx

Báo cáo khoa học

... 29–32,Suntec, Singapore, 4 August 2009.c2009 ACL and AFNLPA Novel Word Segmentation Approach for Written Languages with Word Boundary MarkersHan-Cheol Cho, Do-Gil LeeĐ, Jung-Tae LeeĐ, ... module.1 Introduction Word segmentation (WS) has been a fundamen-tal research issue for languages that do not have word boundary markers (WBMs); on the con-trary, other languages that do have ... information of the user in-put, for languages with WBMs. By utilizing theuser input, the proposed method effectively refinesthe output of the baseline WS model and improvesthe overall performance.The...
  • 4
  • 268
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Word Alignment with Synonym Regularization" doc

Báo cáo khoa học

... framework for word alignment that incorporates synonymknowledge collected from monolinguallinguistic resources in a bilingual proba-bilistic model. Synonym information ishelpful for word alignment ... monolingual resources in a bilin-gual word alignment model. We formulate a syn-onym pair generative model with a topic variableand use this model as a regularization term with abilingual word alignment ... the different words coupled with the same word in the synonym pairs as synonyms. For in-stance, the words ‘head’, ‘chief’ and ‘forefront’ inthe bilingual sentences are replaced with ‘chief’,since...
  • 5
  • 470
  • 2
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Discriminative Word Alignment with Conditional Random Fields" ppt

Báo cáo khoa học

... many-to-one word alignments,where each source word is aligned with zero orone target words, and therefore each target word can be aligned with many source words. Eachsource word is labelled with ... algorithms for maximumentropy parameter estimation. In Proceedings of CoNLL,pages 49–55.J. Martin, R. Mihalcea, and T. Pedersen. 2005. Word align-ment for languages with scarce resources. ... alignments, where each target word is aligned with zero or more source words.Many-to-many alignments are recoverable usingthe standard techniques for superimposing pre-dicted alignments in both translation...
  • 8
  • 460
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Guiding Statistical Word Alignment Models With Prior Knowledge" pdf

Báo cáo khoa học

... candidate. This information is de-rived before word alignment model training and willact as soft constraints that need to be respected dur-ing training and alignments. For a given word pair,the ... are used to guide word alignment model training for each iteration. TheBLEU score and TER with this constraint are shownin the line “BiLSA-1” of Table 1.To exploit word alignment statistics ... as 1.In building word alignment models, a special“NULL” word is usually introduced to address tar-get words that align to no source words. Since thisphysically non-existing word is not in the...
  • 8
  • 495
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Dealing with Spurious Ambiguity in Learning ITG-based Word Alignment" pdf

Báo cáo khoa học

... a family of word alignment. Definition 1. The ITG alignment family is a set of word alignments that has at least one BTG deriva-tion.ITG alignment family is only a subset of word alignments because ... am-biguity in word alignment is the case where two ormore derivations d1, d2, dkof G have the sameunderlying word alignment A. A grammar G is non-spurious if for any given word alignment, ... PA, USA. Association for Computational Lin-guistics.Aria Haghighi, John Blitzer, and Dan Klein. 2009. Bet-ter word alignments with supervised itg models. InAssociation for Computational Linguistics,...
  • 5
  • 499
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Data Cleaning for Word Alignment" pdf

Báo cáo khoa học

... in its word alignment. Secondly, the aforementioned phrase alignment (Marcu and Wong, 02) considers the n : m map-ping directly bilingually generated by some con-cepts without word alignment. ... two word alignmentsas an alignment point, 2) add new alignment pointsthat exist in the union with the constraint that anew alignment point connects at least one previ-ously unaligned word, ... areless than 20 percent.2 1 : n Word Alignment Our discussion of uni-directional alignments of word alignment is limited to IBM Model 4.Definition 1 (Word alignment task) Let eibethe i-th...
  • 9
  • 487
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Soft Syntactic Constraints for Word Alignment through Discriminative Training" pot

Báo cáo khoa học

... bilin-gual word alignment finds word- to -word connec-tions across languages. Originally introduced as abyproduct of training statistical translation modelsin (Brown et al., 1993), word alignment ... this formalism:A → [AA] | AA | e/fThis grammar enforces its own weak cohesionconstraint: for every possible alignment, a corre-sponding binary constituency tree must exist for which the alignment ... new information resulting in im-proved alignments.2 Constrained Alignment Let an alignment be the complete structure thatconnects two parallel sentences, and a link beone of the word- to-word...
  • 8
  • 325
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Using Similarity Scoring To Improve the Bilingual Dictionary for Word Alignment" doc

Báo cáo khoa học

... clustering. Those wordsthat are considered for clustering should account for more than of the cooccur-rences of the source language word with any tar-get language word. If a word falls below ... the target language wordsthat cooccur with source language word .Similarly to the most frequent words, dictionaryscores for word pairs that are too rare for clusteringremain unchanged. 0.220.240.260.280.30.320.340.360.380.40 ... threshold . enforcesthat all words within one cluster must have an av-erage similarity score of at least . The sec-ond threshold, , enforces that only certainwords are considered for clustering....
  • 8
  • 363
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Combining Clues for Word Alignment" pdf

Báo cáo khoa học

... 0.720opened 0.2 0.860The matrix is simply filled with all values ofcombined clues for each word pair. For ex-ample, the total clue value for the word pairs ="baggage" and t ="handbagaget" ... findthe word- to -word relation with the highestvalue in the matrix to zero.2.Check for overlaps: If the link overlaps with other links from more than one accepted linkclustercontinue with ... bilinguallexical information. Word alignment approachesfocus on the automatic identification of translationrelations in translated texts. Alignments are usu-ally represented as a set of links between wordsand...
  • 8
  • 579
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Unsupervised Word Alignment with Arbitrary Features" potx

Báo cáo khoa học

... evaluation exer-cise for word alignment. In Proc. of the Workshop onBuilding and Using Parallel Texts.R. C. Moore. 2005. A discriminative framework for bilingual word alignment. In Proc. of ... Czech-English poses problems for word alignment models since, unlike English,Czech words have a complex inflectional morphol-ogy, and the syntax permits relatively free word or-der. For this language ... theirperformance in a translation system.Since we only have gold alignments for Czech-English (Bojar and Prokopov´a, 2006), we can re-port alignment error rate (AER; Och and Ney, 2003)only for...
  • 11
  • 292
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Confidence Measure for Word Alignment" potx

Báo cáo khoa học

... based on word alignment. In this paper we introduce a confidence mea-sure for word alignment, which is robust to extraor missing words in the bilingual sentence pairs,as well as word alignment ... the same word does in-crease the confusion for word alignment and re-duce the link confidence. On the other hand, ad-ditional information (such as the distance of the word pair, the alignment ... confidence sentencealignments and alignment links from mul-tiple word alignments of the same sen-tence pair. Additionally, we removelow confidence alignment links from the word alignment of a bilingual...
  • 9
  • 317
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Optimizing Word Alignment Combination For Phrase Table Training" pptx

Báo cáo khoa học

... Translation results (BLEU score) with phrase tables trained with different word align-ment combination methods4 ConclusionsWe presented a simple yet effective method for word alignment symmetrization ... syntax-based translation framework.Most word alignment models distinguish trans-lation direction in deriving word alignment matrix.Given a parallel sentence, word alignments in twodirections are ... Statistics of word alignment set and theresulting phrase table size (number of entries inthousand (K)) with different combination methods3.2 Translation ResultsThe ultimate goal of word alignment...
  • 4
  • 210
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

Báo cáo khoa học

... and document-pair. Formally, we dene the following terms1:ã A word- pair (fj, ei) is the basic unit for word alignment, where fjis a French word and eiis an English word; j and i are ... corpus-level word- correlation and contextual-level topicalinformation may help to disambiguate translationcandidates and word- alignment choices. For ex-ample, the most frequent source words (e.g., ... improve alignment. 3.3 BiTAM-3: Word- level AdmixtureIt is straightforward to extend the sentence-levelBiTAM-1 to a word- level admixture model, bysampling topic indicator zn,j for each word- pair(fj,...
  • 8
  • 354
  • 0

Xem thêm

Tìm thêm: hệ việt nam nhật bản và sức hấp dẫn của tiếng nhật tại việt nam khảo sát các chuẩn giảng dạy tiếng nhật từ góc độ lí thuyết và thực tiễn khảo sát chương trình đào tạo của các đơn vị đào tạo tại nhật bản tiến hành xây dựng chương trình đào tạo dành cho đối tượng không chuyên ngữ tại việt nam điều tra đối với đối tượng giảng viên và đối tượng quản lí khảo sát thực tế giảng dạy tiếng nhật không chuyên ngữ tại việt nam khảo sát các chương trình đào tạo theo những bộ giáo trình tiêu biểu nội dung cụ thể cho từng kĩ năng ở từng cấp độ xác định mức độ đáp ứng về văn hoá và chuyên môn trong ct phát huy những thành tựu công nghệ mới nhất được áp dụng vào công tác dạy và học ngoại ngữ mở máy động cơ lồng sóc mở máy động cơ rôto dây quấn các đặc tính của động cơ điện không đồng bộ đặc tuyến hiệu suất h fi p2 đặc tuyến mômen quay m fi p2 đặc tuyến dòng điện stato i1 fi p2 động cơ điện không đồng bộ một pha sự cần thiết phải đầu tư xây dựng nhà máy từ bảng 3 1 ta thấy ngoài hai thành phần chủ yếu và chiếm tỷ lệ cao nhất là tinh bột và cacbonhydrat trong hạt gạo tẻ còn chứa đường cellulose hemicellulose chỉ tiêu chất lượng 9 tr 25