Báo cáo khoa học: "Learning Common Grammar from Multilingual

Báo cáo khoa học: "Learning Common Grammar from Multilingual Corpus" potx

... syntax across languages, and try to extract a common grammar from non-parallel multilingual corpora. For this purpose, we propose a generative model for multilingual grammars that is learned in an unsupervised ... corpora, where each sentence is generated from a language dependent probabilistic context- free grammar (PCFG), and these PCFGs are generated from a prior grammar...

Ngày tải lên : 07/03/2014, 22:20

5
326
0

Báo cáo khoa học: "Learning Semantic Links from a Corpus of Parallel Temporal and Causal Relations" doc

... null label is NO-REL. train/test split from Table 1 and the feature sets: Syntactic The syntactic features from Section 4. Semantic The semantic features from Section 4. All Both syntactic and ... took, take, VBD and began, to, trade, begin, trade, VBD,TO,VB. • The syntactic paths from the ﬁrst event to the common ancestor to the second event, e.g. VBD>VP, VP and VP<VBD. 1 Tra...

Ngày tải lên : 08/03/2014, 01:20

4
363
0

Tài liệu Báo cáo khoa học: "Learning Event Durations from Event Descriptions" docx

... instances), from the TimeBank corpus annotated in TimeML (Pustejovky et al., 2003). The non- WSJ articles (mainly political and disaster news) include both print and broadcast news that are from ... two peaks in this distribution. One is from 5 to 7 in the natural logarithmic scale, which corresponds to about 1.5 minutes to 30 minutes. The other is from 14 to 17 in the natural l...

Ngày tải lên : 20/02/2014, 12:20

8
381
0

Báo cáo khoa học: "Learning Semantic Categories from Clickthrough Logs" pdf

... both precision and recall. We cast semantic category acquisition from search logs as the task of learning labeled instances from few labeled seeds. To our knowledge this is the ﬁrst study that ... different from ours. An- other line of new research is to combine various re- sources such as web documents with search query logs (Pas¸ca and Durme, 2008; Talukdar et al., 2008). We differ...

Ngày tải lên : 08/03/2014, 01:20

4
316
0

Báo cáo khoa học: "Learning Constraint Grammar-style disambiguation rules using Inductive Logic Programming" potx

... of the Constraint Grammar (CG) (Karlsson et al., 1995) approach to part of speech tagging and surface syntactic depen- dency parsing is due to the minutely hand- crafted grammar and two-level ... Progol machine learning system will be presented very briefly. 1.1 Constraint Grammar POS tagging Constraint Grammar is a system for part of speech tagging and (shallow) syntactic dep...

Ngày tải lên : 17/03/2014, 07:20

5
244
0

Báo cáo khoa học: "Learning Bilingual Lexicons from Monolingual Corpora" pot

... improvement over the 92.3 using only the Wiktionary lexicon. Of the true errors, the most common arose from semantically related words which had strong context feature correlations (see table ... aria42,pliang,tberg,klein }@cs.berkeley.edu Abstract We present a method for learning bilingual translation lexicons from monolingual corpora. Word types in each language are charac- terize...

Ngày tải lên : 31/03/2014, 00:20

9
300
0

Báo cáo khoa học: "Learning Transliteration Lexicons from the Web" pptx

... from corpora. The EX approach aims to construct a large and up-to- date transliteration lexicon from live corpora. Towards this objective, some have proposed extracting translation pairs from ... extraction of transliteration pairs (EX) from corpora. The TM approach models phoneme-based or grapheme-based mapping rules using a generative model that is trained from a large bi...

Ngày tải lên : 31/03/2014, 01:20

8
341
0

Báo cáo khoa học: "Learning Tense Translation from Bilingual Corpora" docx

... called me up. The following two grammar fragments describe the relevant CVP syntax for English and Ger- man. Every auxiliary verb governs only one verb, so the CVP grammar is basically 2 regu- ... Fortunately, the task can be (partly) automated if the tables associating words with biases are learned from a corpus. Statistical approaches also support empirical evaluation of diffe...

Ngày tải lên : 31/03/2014, 04:20

5
279
0

Báo cáo khoa học: "Acquiring a Lexicon from Unsegmented Speech" potx

... a word grammar could be learned in conjunction with this acquisition process, and used as a disambiguation step. 3 Tests and Results To test the algorithm, we used 34438 utterances from the ... cgdemarc@ai.mit.edu Abstract We present work-in-progress on the machine acquisition of a lexicon from sen- tences that are each an unsegmented phone sequence paired with a primitive r...

Ngày tải lên : 08/03/2014, 07:20

3
315
0

Báo cáo khoa học: "Learning Semantic Correspondences with Less Supervision" potx

... mapping which does not arise from string operations but must instead be learned. We used the dataset created by Chen and Mooney (2008), which contains 1919 scenarios from the 2001–2004 Robocup ... 3,753 cities in the US (those with population at least 10,000) over three days (February 7–9, 2009) from www.weather.gov. For each city and date, we created two scenarios, one for the day fore...

Ngày tải lên : 17/03/2014, 01:20

9
330
0