domain similarity for parsing

Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

Tài liệu Báo cáo khoa học: "Bilingual Sense Similarity for Statistical Machine Translation" ppt

Ngày tải lên : 20/02/2014, 04:20
... analogous similarity me- tric for γ . They range from 0 to 1. These two metrics both evaluate the similarity for two vec- tors in the same language, so using cosine dis- tance to compute the similarity ... performance over the baseline. The Alg2 cosine similarity function got 0.7 BLEU- score (p<0.01) improvement over the baseline for NIST 2006 test set, and a 0.5 BLEU-score (p<0.05) for ... of the Association for Computational Linguistics, pages 834–843, Uppsala, Sweden, 11-16 July 2010. c 2010 Association for Computational Linguistics Bilingual Sense Similarity for Statistical Machine...
  • 10
  • 594
  • 0
Tài liệu Báo cáo khoa học: "A Unified Syntactic Model for Parsing Fluent and Disfluent Speech∗" ppt

Tài liệu Báo cáo khoa học: "A Unified Syntactic Model for Parsing Fluent and Disfluent Speech∗" ppt

Ngày tải lên : 20/02/2014, 09:20
... any information about repair is stripped from the input, including partial words, repair sym- bols 3 , and interruption point information. While an integrated system for processing and parsing ... results from Hale et al. RCT results are on the right-corner transformed grammar (transformed back to flat treebank-style trees for scoring purposes). CYK and TAG lines show relevant results from ... syntactic information to find repairs, and thus may have access to some of this information about where interruptions occur, this experiment is intended to evaluate the use of the right corner transform...
  • 4
  • 581
  • 0
Tài liệu Báo cáo khoa học: "Keyword Extraction using Term-Domain Interdependence for Dictation of Radio News" ppt

Tài liệu Báo cáo khoa học: "Keyword Extraction using Term-Domain Interdependence for Dictation of Radio News" ppt

Ngày tải lên : 20/02/2014, 18:20
... any other domains, domainj seems to be the domain of unit~. The system se- lects the domain which is the largest of all sim- ilarities in N of domains as the domain of the unit (formula (6)) ... vectors. 5.3 Domain identification experiment The system selects suitable domain of each unit for keyword extraction. Table I shows the results of domain identification. We con- ducted domain identification ... kinds of domains, i.e. 141 domains and 9 large domains. We also compared the results and the result us- ing previous method (Suzuki et al., 1997). For comparison, we selected 5 domains which...
  • 5
  • 414
  • 1
Tài liệu Báo cáo khoa học: "Statistical Decision-Tree Models for Parsing*" ppt

Tài liệu Báo cáo khoa học: "Statistical Decision-Tree Models for Parsing*" ppt

Ngày tải lên : 20/02/2014, 22:20
... Statistical Pattern Recognition. Doctoral dissertation. Stanford University, Stanford, Cali- fornia. 283 Statistical Decision-Tree Models for Parsing* David M. Magerman Bolt Beranek and Newman ... sentence length for Wall Street Journal experiments. 5 Conclusion Regardless of what techniques are used for parsing disambiguation, one thing is clear: if a particular piece of information is ... and 7 illustrate the performance of SPATTER as a function of sentence length. SPAT- TER's performance degrades slowly for sentences up to around 28 words, and performs more poorly and more...
  • 8
  • 389
  • 0
Tài liệu Báo cáo khoa học: "Text-to-text Semantic Similarity for Automatic Short Answer Grading" pdf

Tài liệu Báo cáo khoa học: "Text-to-text Semantic Similarity for Automatic Short Answer Grading" pdf

Ngày tải lên : 22/02/2014, 02:20
... corpus-based measures of similarity perform comparably when used for the task of short answer grading. However, since the corpus- based measures can be improved by account- ing for domain and corpus ... unsupervised techniques for the task of automatic short answer grading. We compare a number of knowledge-based and corpus-based mea- sures of text similarity, evaluate the effect of domain and size on ... improve the performance of the system by integrating automatic feedback from the student answers. Overall, our system significantly and consistently out- performs other unsupervised methods for short...
  • 9
  • 577
  • 0
Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation" pdf

Báo cáo khoa học: "Estimating Class Priors in Domain Adaptation for Word Sense Disambiguation" pdf

Ngày tải lên : 08/03/2014, 02:21
... gathered from parallel texts and the evaluation data for the two SENSEVAL tasks. This gave a set of 6 nouns for SENSEVAL-2 and 9 nouns for SENSEVAL- 3. For each noun, we gathered a maximum of 500 parallel ... we use for our experiments before presenting our experimental results. Next, we propose using the well calibrated probabilities of logistic regression to estimate the sense priors, and perform ... improves performance. For example, row 1 of Table 4 shows that adjusting the pre- dictions of multiclass naive Bayes classifiers by sense priors estimated by logistic regression (NB- EM ) performs significantly...
  • 8
  • 268
  • 0
Báo cáo khoa học: "Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification" potx

Báo cáo khoa học: "Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification" potx

Ngày tải lên : 08/03/2014, 02:21
... target domain labeled instances. We chose this number since we believe it to be a reasonable amount for a single engineer to label with minimal effort. For reasons of space, for each target domain dom ... different domains, and annotating corpora for every possible domain of interest is impractical. We investigate domain adap- tation for sentiment classifiers, focusing on online reviews for different ... domains of discourse makes it an ideal candidate for domain adaptation. This work addressed two important questions of domain adaptation. First, we showed that for a given source and target domain, ...
  • 8
  • 425
  • 0
Báo cáo khoa học: "Syntactic Features and Word Similarity for Supervised Metonymy Resolution" pot

Báo cáo khoa học: "Syntactic Features and Word Similarity for Supervised Metonymy Resolution" pot

Ngày tải lên : 08/03/2014, 04:22
... reduction Pakistan Scotland-subj-of-losePakistan-subj-of-win similarity semantic class head similarity role similarity Pakistan had won the World Cup lost in the semi-finalScotland Figure 1: Context reduction and similarity levels draw ... 1997; Stern, 1931)). In a place -for- people pattern, a place stands for any persons/organisations associ- ated with it, e.g., for sports teams in (2), (3), and (4), and for the government in (7). 4 (7) ... a place -for- event pattern, a location name refers to an event that occurred there (e.g., us- ing the word Vietnam for the Vietnam war). In a place -for- product pattern a place stands for a product...
  • 8
  • 603
  • 0
Báo cáo Y học: Identification of residues in the PXR ligand binding domain critical for species specific and constitutive activation docx

Báo cáo Y học: Identification of residues in the PXR ligand binding domain critical for species specific and constitutive activation docx

Ngày tải lên : 08/03/2014, 16:20
... designed: 5¢-TGAGATGTGCCAGCTGAGGTTCA-3¢ for I282Q (forward), 5¢-CAACGCCCAGCATACCCAGCAGT-3¢ for Q404H (forward), 5¢-CAACGCCCAGGCAACCCAG CAGT-3¢ for Q404A (forward), 5¢-TGAACCTCAGCT GGCACATCTA-3¢ for I282Q (reverse), ... were obtained by Transformer Site-directed mutagenesis Kit (Clontech). The following primers were used: 5¢-TCGAGCTGTGTATACTGAGATTCA-3¢ for Q285I, 5¢-TCAATGCTCAGCAGACCCAGCGGC-3¢ for H407Q, 5¢-TCAATGCTCAGGCCACCCAGCG GC-3¢ ... luciferase activity. All experiments were performed at least three times in duplicates and luciferase activity was normalized for alkaline phosphatase activity. For curve fitting and EC50 calculations, XLFIT version...
  • 9
  • 552
  • 0
Báo cáo khoa học: "Aligning Medical Domain Ontologies for Clinical Query Extraction" potx

Báo cáo khoa học: "Aligning Medical Domain Ontologies for Clinical Query Extraction" potx

Ngày tải lên : 08/03/2014, 21:20
... in a specific domain (medicine) and as we are not domain experts, we are in lack of domain knowl- edge. This missing domain knowledge shall be acquired from external resources, for example UMLS. ... (c) is it normal or is it ab- normal? Therefore, when a radiologist looks for information, his search queries most likely con- tain terms from various information sources that provide this kind ... noun) Using a transformation rule of the form, 82 Ontologies (OBO) 5 framework. The OBO con- sortium establishes a set of principles to which the biomedical ontologies shall conform to for purposes...
  • 9
  • 384
  • 0
Báo cáo khoa học: "Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars" doc

Báo cáo khoa học: "Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars" doc

Ngày tải lên : 16/03/2014, 19:20
... Association for Computational Linguistics Exploiting Multiple Treebanks for Parsing with Quasi-synchronous Grammars Zhenghua Li, Ting Liu ∗ , Wanxiang Che Research Center for Social Computing and Information ... punctuation. We adopt Dan Bikel’s randomized parsing evaluation comparator for significance test (Noreen, 1989). 7 For all models used in current work (POS tagging and parsing) , we adopt averaged perceptron ... algorithm for pro- jective dependency parsing. In Proceedings of the 8th International Workshop on Parsing Technologies (IWPT), pages 149–160. Eric W. Noreen. 1989. Computer-intensive methods for testing...
  • 10
  • 245
  • 0
Báo cáo khoa học: "Domain Adaptation for Machine Translation by Mining Unseen Words" doc

Báo cáo khoa học: "Domain Adaptation for Machine Translation by Mining Unseen Words" doc

Ngày tải lên : 17/03/2014, 00:20
... gains in performance when moving from Parliament domain to News domain. 3 Data Our source domain is European Parliament proceedings (http://www.statmt.org/ europarl/). We use three target domains: ... methods for mining dic- tionaries from comparable corpora to the domain adaptation setting, by “bootstrapping” them based on known translations from the source domain. (3) Develop methods for integrating ... establish baseline performance for the domains. In these ex- periments, we built a translation model based only on the Parliament proceedings. We then tune it us- ing the small amount of target -domain tuning...
  • 6
  • 349
  • 0
Báo cáo khoa học: "Exploiting Heterogeneous Treebanks for Parsing" pptx

Báo cáo khoa học: "Exploiting Heterogeneous Treebanks for Parsing" pptx

Ngày tải lên : 17/03/2014, 01:20
... that the use of probability information from the parser for tree conversion helps target grammar parsing. 4.3 Using Unlabeled Data for Parsing Recent studies on parsing indicate that the use ... treebanks with same grammar for- malism for domain adaptation of parsers. Roark and Bachiani (2003) presented count merging and model interpolation techniques for domain adap- tation of parsers. ... Parsing Through grammar formalism conversion, we have successfully turned the problem of using hetero- geneous treebanks for parsing into the problem of parsing on homogeneous treebanks. Before using converted...
  • 9
  • 289
  • 0
Báo cáo khoa học: "Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation" potx

Báo cáo khoa học: "Demonstration of Joshua: An Open Source Toolkit for Parsing-based Machine Translation" potx

Ngày tải lên : 17/03/2014, 02:20
... (Fig- ure 1): one for the live “instant translation” user interface, one for demonstrating the different com- ponents of the system and algorithmic visualiza- tions, and one designated for technical ... and David Chiang. 2005. Better k-best parsing. In Proceedings of the International Work- shop on Parsing Technologies. 27 We will rely on 3 workstations: one for the instant translation demo, ... Open source toolkit for statistical machine translation. In Proceedings of the ACL-2007 Demo and Poster Ses- sions. Zhifei Li and Sanjeev Khudanpur. 2008. A scalable decoder for parsing- based machine...
  • 4
  • 275
  • 0
Báo cáo khoa học: "Construction of Domain Dictionary for Fundamental Vocabulary" pdf

Báo cáo khoa học: "Construction of Domain Dictionary for Fundamental Vocabulary" pdf

Ngày tải lên : 17/03/2014, 04:20
... Dong, 2006) and WordNet pro- vide domain information for Chinese and English, but there has been no domain resource for Japanese that are publicly available. 8 Domain dictionary construction methods ... Preparing key- words for each domain (§3.1). 2 Associating JFWs with domains (§3.2). 3 Reassociating JFWs with NODOMAIN (§3.3). 4 Manual correction (§3.5). 3.1 Preparing Keywords for each Domain About ... NODOMAIN was prepared for those words that do not belong to any particular domain. As for the latter issue, you might use keyword ex- traction techniques; identifying words that represent a domain...
  • 4
  • 353
  • 0