Báo cáo khoa học: "Improving the Use of Pseudo-Words for Eva

Báo cáo khoa học: "Improving the Use of Pseudo-Words for Evaluating Selectional Preferences" docx

... implement. The Backoff Erk model is the best, using the Baseline for the majority of decisions and backing off to the Erk smoothing model when the Baseline cannot answer. Figure 5 (shown on the next ... which of two verbs was the correct predicate for a given noun object. One verb v was the original from the source document, and the other v  was randomly gener...

Ngày tải lên : 07/03/2014, 22:20

9
405
0

Tài liệu Báo cáo khoa học: "Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition" pdf

... and the former was used as the training data and the latter as the development data. For semi-CRFs, we used amis 3 for training the semi- CRF with feature-forest. We used GENIA taggar 4 for POS-tagging ... than the system without it (the p-value is less than 1.0 < 10 −4 ). The result of the preceding entity information improves the performance. On the other han...

Ngày tải lên : 20/02/2014, 12:20

8
527
0

Tài liệu Báo cáo khoa học: "Exploring the Use of Linguistic Features in Domain and Genre Classification" potx

... vectors: Another lesson of Tab. 3 is that the effect of the com- position of the feature vectors can vary depend- ing both on the task and on the size of the feature vector. The dramatic ... because some classes, such as L, are very small. This problem was not so grave for the LPE experiments because of the ceiling effect and the small size of the c...

Ngày tải lên : 22/02/2014, 03:20

8
689
1

Báo cáo khoa học: "Improving the Interpretation of Noun Phrases with Cross-linguistic Information" doc

... the contribution the features exempliﬁed in one baseline and six versions of the SVM model. The baseline is deﬁned only for the English part of the NP feature set and measures the the contribution of the ... EXPERIENCER, THEME, BENEFICIARY. Out of these instances, 74.81% use the preposition of. In CLUVI, 11.71% of the examples were ver- bal, from which the...

Ngày tải lên : 17/03/2014, 04:20

8
386
0

Báo cáo khoa học: "Improving the Accuracy of Subcategorizations Acquired from Corpora" pdf

... lexicon of the target grammar, 4 and make use of the existing sets of 4 When the lexicon is less accurate, I can determine the number of clusters using other algorithms (Hamerly, 2003). SCFs for the ... vectors from the training SCFs and the acquired SCFs for the words in the testing SCFs. The number of the resulting data objects was 8,679 for XTAG an...

Ngày tải lên : 17/03/2014, 06:20

6
317
0

Báo cáo khoa học: "Improving the Performance of the Random Walk Model for Answering Complex Questions" pptx

... cases where the query is composed of two or more sentences, we compute the similarity between the document sen- tence (s) and each of the query-sentences (q i ) then we take the average of the scores. 3 ... but at the same time makes it not well suited for the semantic trees (ST) deﬁned in Section 3. For instance, although the two STs of Figure 1 share most of...

Ngày tải lên : 23/03/2014, 17:20

4
456
0

Báo cáo khoa học: "On the use of Comparable Corpora to improve SMT performance" ppt

... on the English part of the bitexts and the Gigaword corpus of about 3.2 billion words. Therefore, it is likely that the target language model includes at least some of the translations of the ... amounts of parallel texts to translate the source side of the non- parallel corpus. The target side texts are used, along with other corpora, in the language model...

Ngày tải lên : 31/03/2014, 20:20

8
427
0

Báo cáo khoa học: "Comparing the Accuracy of CCG and Penn Treebank Parsers" docx

... is the number of sentences in the sample, and the % column gives the sample size as a percentage of the whole section. We compare the CCG parser to the Berkeley parser using the accurate mode of ... closer to the PTB than CCGbank, or due to their conversion method. We leave the application of their method to the CCG parser for future work. to use the comple...

Ngày tải lên : 17/03/2014, 02:20

4
369
0

Báo cáo khoa học: "ON THE INDEPENDENCE OF DISCOURSE STRUCTURE AND SEMANTIC DOMAIN" docx

... layouts, there is a minority strategy, used by 4% of the speakers (3 out of 72 cases of the data of Linde (1974)) describing the layout in the form of a map. The speaker first describes the outside ... describe the layout of their apartment. The vast majority of speakers used a "tour strategy," which takes the hearer on an imaginary tour of th...

Ngày tải lên : 17/03/2014, 19:20

4
354
0

Tài liệu Báo cáo khoa học: Seeking the determinants of the elusive functions of Sco proteins pptx

... function of the CXXXC motif of human Sco proteins could therefore be impli- cated not only in the maturation of the Cu A site of Cox2 but also in the maintenance of cellular copper homeostasis. The ... assembly of cbb 3 oxidase, but rather is required for the maturation of the Cu A -containing COX which is predominant for aerobic growth, thus leaving open the...

Ngày tải lên : 14/02/2014, 18:20

19
743
0