bilingual topic admixture models

Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

Báo cáo khoa học: "Bilingual Topic AdMixture Models for Word Alignment" ppt

Ngày tải lên : 17/03/2014, 04:20
... We propose a new statistical formalism: Bilingual Topic AdMixture model, or BiTAM, to facilitate topic- based word alignment in SMT. Variants of admixture models have appeared in population genetics ... assumed to constitute a mixture of hidden topics; each word-pair follows a topic- specific bilingual translation model. Three BiTAMmodels are proposed to cap- ture topic sharing at different levels of ... higher-order alignment models can be embedded similarly within the pro- posed framework. 3 Bilingual Topic AdMixture Model Now we describe the BiTAM formalism that captures the latent topical structure...
  • 8
  • 354
  • 0
Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Tài liệu Báo cáo khoa học: "Topic Models for Dynamic Translation Model Adaptation" pptx

Ngày tải lên : 19/02/2014, 19:20
... the probability under topic 1, topic 2, etc., or F 2 : What is the probability under the most probable topic, second most, etc. A model using F 1 learns whether a specific topic is useful for translation, ... machine translation systems toward relevant transla- tions based on topic- specific contexts, where topics are induced in an unsupervised way using topic models; this can be thought of as inducing subcorpora ... probabilistic member- ship. This topic model infers the topic distribution of a test set and biases sentence translations to ap- propriate topics. We accomplish this by introduc- ing topic dependent lexical...
  • 5
  • 532
  • 0
Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

Báo cáo khoa học: "Employing Topic Models for Pattern-based Semantic Class Discovery" doc

Ngày tải lên : 08/03/2014, 00:20
... details) before build- ing topic models for C R (q), where some low- frequency items are removed. Determine the number of topics: Most topic models require the number of topics to be known beforehand 1 . ... be related to multiple topics in some topic models (e.g., pLSI and LDA). Topic modeling Semantic class construction word item (word or phrase) document RASC topic semantic class Table ... top- ic models here. To employ topic models, we treat RASCs as “documents”, items as “words”, and the final semantic classes as “topics”. There are, however, several challenges in ap- plying topic...
  • 9
  • 398
  • 0
Báo cáo khoa học: "Unsupervised Sense Disambiguation Using Bilingual Probabilistic Models" pdf

Báo cáo khoa học: "Unsupervised Sense Disambiguation Using Bilingual Probabilistic Models" pdf

Ngày tải lên : 08/03/2014, 04:22
... Work We have presented two novel probabilistic models for unsupervised word sense disambiguation using parallel corpora and have shown that both models outperform existing unsupervised approaches. ... significant difference in model performance. 5 Experimental Evaluation Both the models are generative probabilistic models learned from parallel corpora and are expected to fit the training and subsequent ... English and Spanish senses to build the concepts. Unsupervised Sense Disambiguation Using Bilingual Probabilistic Models Indrajit Bhattacharya Dept. of Computer Science University of Maryland College...
  • 8
  • 361
  • 0
Báo cáo khoa học: "Unsupervised Bilingual Morpheme Segmentation and Alignment with Context-rich Hidden Semi-Markov Models" pdf

Báo cáo khoa học: "Unsupervised Bilingual Morpheme Segmentation and Alignment with Context-rich Hidden Semi-Markov Models" pdf

Ngày tải lên : 17/03/2014, 00:20
... we can recover several previously used models for monolingual segmentation and bilingual joint seg- mentation and alignment. We discuss the relation- ship ... model configuration but a simpler uniform transition distortion distribution. Note that the bilingual models perform worse than the monolingual ones in segmentation F1. This finding is in line with ... subset of variables to use in each of three component sub -models. This might in part explain their advantage over previous- state-of-the-art models, which might use fewer (e.g. (Poon et al., 2009)...
  • 10
  • 273
  • 0
Báo cáo khoa học: "Using Bilingual Comparable Corpora and Semi-supervised Clustering for Topic Tracking" ppt

Báo cáo khoa học: "Using Bilingual Comparable Corpora and Semi-supervised Clustering for Topic Tracking" ppt

Ngày tải lên : 17/03/2014, 04:20
... the 60 topics. Each story was labelled according to whether the story dis- cussed the topic or not. Not all the topics were present in the Japanese corpora. We therefore col- lected 1 topic from ... evaluation data, TDT1, 2, or 3. ‘ID’ denotes topic number de- fined by the TDT. ‘OnT.’(On -Topic) refers to the number of stories discussing the topic. Bold font stands for the topic which happened in Japan. ... calcu- 1 m  refers to the difference of dates between English and 232 Table 2: Topic Name TDT ID Topic name OnT. TDT ID Topic name OnT. 1 15 Kobe Japan quake 9,912 2 31015 Japan Apology to Korea...
  • 8
  • 254
  • 0
Báo cáo khoa học: "Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora" docx

Báo cáo khoa học: "Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora" docx

Ngày tải lên : 17/03/2014, 07:20
... account. We can envisage more complex models, in- cluding distortion parameters, multiword no- tions, or information on part-of-speech, infor- mation derived from bilingual dictionaries or from ... candidate terms are extracted in one language, 449 Flow Network Models for Word Alignment and Terminology Extraction from Bilingual Corpora l~ric Gaussier Xerox Research Centre Europe 6, ... solved alignment 4. 3.2 Experiment In order to test the previous model, we se- lected a small bilingual corpus consisting of 1000 aligned sentences, from a corpus on satel- lite telecommunications....
  • 7
  • 379
  • 0
Báo cáo khoa học: "Authorship Attribution with Author-aware Topic Models" pptx

Báo cáo khoa học: "Authorship Attribution with Author-aware Topic Models" pptx

Ngày tải lên : 23/03/2014, 14:20
... in DADT author topics are disjoint from document topics, with different priors for each topic set. Thus, the number of author topics can be differ- ent from the number of document topics, enabling us ... χ. To fairly compare the topic- based methods, we used the same overall number of topics for all the topic models. We present only the results obtained with the best topic settings: 100 for PAN’11 ... number of document topics. It is worth not- ing that AT can be seen as an extreme version of DADT, where all the topics are author topics. A fu- ture extension is to learn the topic balance automat- ically,...
  • 6
  • 230
  • 0
System models

System models

Ngày tải lên : 14/09/2012, 11:26
... Various object models may be produced • Inheritance models; • Aggregation models; • Interaction models. ©Ian Sommerville 2004 Software Engineering, 7th edition. Chapter 8 Slide 26 Object models  Natural ... 2004 Software Engineering, 7th edition. Chapter 8 Slide 3 Topics covered  Context models  Behavioural models  Data models  Object models  CASE workbenches ©Ian Sommerville 2004 Software Engineering, ... Aggregation models are similar to the part-of relationship in semantic data models. ©Ian Sommerville 2004 Software Engineering, 7th edition. Chapter 8 Slide 10 Behavioural models  Behavioural models...
  • 15
  • 510
  • 0
Writing Topic

Writing Topic

Ngày tải lên : 04/10/2012, 09:39
... Which would you choose to buy? Give specific reasons to explain your choice. Writing Topics (continued) 27 WRITING TOPICS Topics in the following list may appear in your actual test. You should become ... TOEFL test. Remember that when you take the test you will not have a choice of topics. You must write only on the topic that is assigned to you. People attend college or university for many different ... support or oppose this plan? Why? Use specific reasons and details in your answer. Writing Topics 29 WRITING TOPICS Should a city try to preserve its old, historic buildings or destroy them and replace...
  • 6
  • 1.6K
  • 5
101 ielts speaking part two topic cards that tie in with ielts speaking part one

101 ielts speaking part two topic cards that tie in with ielts speaking part one

Ngày tải lên : 04/10/2012, 11:45
... future is unlikely to come up, this section is arranged with the topics which are most common in the Speaking Part One “future plans” topic nearest the top. Talk about one thing you are going ... jump. Luckily, the 6 to 8 most popular topics in IELTS Speaking Part One (e.g. Friends and Families) can also come up in Part Two (although the same topic is never used twice in the same test). ... presentations. It is not possible to say which topics are most likely to come up in the IELTS Speaking Test Part Two, but the most typical questions for each of the topics that are the same as IELTS Speaking...
  • 25
  • 2.7K
  • 19

Xem thêm