0

an unsupervised model for joint phrase alignment and extraction

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "An Unsupervised Model for Joint Phrase Alignment and Extraction" ppt

Báo cáo khoa học

... 5-gram model. For GIZA ++, we use the standard training reg-imen up to Model 4, and combine alignmentswith grow-diag-final -and. For the proposedmodels, we train for 100 iterations, and use the ... Japan2National Institute of Information and Communication Technology3-5 Hikari-dai, Seika-cho, Soraku-gun, Kyoto, JapanAbstractWe present an unsupervised model for joint phrase alignment and ... removed for TMtraining. For both tasks, we perform weight tuning and testing on specified development and test sets.We compare the accuracy of our proposed methodof joint phrase alignment and extraction...
  • 10
  • 641
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Discriminative Model for Joint Morphological Disambiguation and Dependency Parsing" ppt

Báo cáo khoa học

... definitearticle; Hungarian has both a definite and an indefi-nite article. In both languages (Tables 5 and 6), noun and adjective gender, number, and case are moreaccurately predicted than in Czech and Latin. ... authors’ and do not necessarily reflect those ofthe sponsors.ReferencesDavid Bamman and Gregory Crane. 2006. The Design and Use of a Latin Dependency Treebank. Proc. Work-shop on Treebanks and ... ResultsWe compare the performance of the pipeline model (§4) and the joint model (§3) on morphological dis-ambiguation and unlabeled dependency parsing. Model Tagger Joint Tagger Joint Attr. ↓ all...
  • 10
  • 411
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "An Error-Driven Word-Character Hybrid Model for Joint Chinese Word Segmentation and POS Tagging" docx

Báo cáo khoa học

... Dong and Qiang Dong. 2006. Hownet and the Computation of Meaning. World Scientific.Kuzman Ganchev, Koby Crammer, Fernando Pereira,Gideon Mann, Kedar Bellare, Andrew McCallum,Steven Carroll, Yang ... Lafferty, Andrew McCallum, and FernandoPereira. 2001. Conditional random fields: Prob-abilistic models for segmenting and labeling se-quence data. In Proceedings of ICML, pages 282–289.Ryan McDonald, ... discriminativeword-character hybrid model for joint Chi-nese word segmentation and POS tagging.Our word-character hybrid model offershigh performance since it can handle bothknown and unknown words. We...
  • 9
  • 338
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Joint Statistical Model for Simultaneous Word Spacing and Spelling Error Correction for Korean" pdf

Báo cáo khoa học

... correction candidates. Candidates are increased in number by inserting the blank cha-racters on the created candidates, which cover the spacing error correction candidates. We find the best candidate ... Jianfeng Gao, Mu Li and Chang-Ning Huang. 2003. Improved Source-Channel Models for Chinese Word Segmentation. Proceedings of the 41st Annual Meet-ing of the ACL, pp. 272-279 Seung-Shik Kang ... word spacing error and spelling error simulta-neously for Korean. This algorithm is based on noisy-channel model, which uses Jaso3 transition probabilities and Eojeol transition probabilities...
  • 4
  • 523
  • 0
An Equilibrium Model of Rare-Event Premia and Its Implication for Option Smirks potx

An Equilibrium Model of Rare-Event Premia and Its Implication for Option Smirks potx

Tổ chức sự kiện

... include Gilboa and Schmeidler (1989), Epstein and Wang(1994), Anderson, Hansen, and Sargent (2000), Chen and Epstein (2002),Hansen and Sargent (2001), Epstein and Miao (2003), Routledge and Zin(2002), ... derivatives are examined by Liu and Pan (2003), Liu, Longstaff, and Pan (2003) and Das and Uppal (2001). Dufresne and Hugonnier (2001) study the impact of event risk on pricing and hedging ofcontingent ... 2005134 An Equilibrium Model of Rare-EventPremia and Its Implication for Option SmirksJun LiuAnderson School at UCLAJun PanMIT Sloan School of Management, CCFR and NBERTan WangSauder School...
  • 34
  • 500
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

Báo cáo khoa học

... seg-mentation only and joint segmentation and part-of-speech tagging. On the Penn ChineseTreebank 5.0, we obtain an error reduction of18.5% on segmentation and 12% on joint seg-mentation and part-of-speech ... outside-layerlinear model. We used SRI Language ModellingToolkit (Stolcke and Andreas, 2002) to train a 3-gram word LM with modified Kneser-Ney smooth-ing (Chen and Goodman, 1998), and a 4-gram POSFeatures ... 2002), Chinese word seg-mentation (Ng and Low, 2004; Zhang and Clark,2007) and so on. We trained a character-based per-ceptron for Chinese Joint S&T, and found that theperceptron itself...
  • 8
  • 445
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

Báo cáo khoa học

... explored in previouswork (Zhang and Clark, 2010; Jiang et al., 2008b).In this paper, we present an effective and effi-cient solution for joint Chinese word segmentation and POS tagging. Our work ... (Ng and Low, 2004; Jiang et al., 2008a; Zhang and Clark,2008).2.2 Character-Based and Word-BasedMethodsTwo kinds of approaches are popular for joint wordsegmentation and POS tagging. The ... June. Association for Computational Lin-guistics.Yue Zhang and Stephen Clark. 2010. A fast decoder for joint word segmentation and POS-tagging using a sin-gle discriminative model. In Proceedings...
  • 10
  • 412
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A global model for joint lemmatization and part-of-speech prediction" doc

Báo cáo khoa học

... log-linear model capturing such dependen-cies, and demonstrated its effectiveness on English and three Slavic languages.AcknowledgementsWe would like to thank Galen Andrew and Lucy Vander-wende for ... tag pre-diction and lemmatization are strongly dependent and that by building state-of-the art models for the two subtasks and performing joint inferencewe can improve performance on both tasks. ... the annotations.491lemmatization subtasks, which a joint model couldexploit.6.3 Evaluation of joint modelsSince our joint model re-ranks candidates pro-duced by the component tagger and...
  • 9
  • 430
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Joint Rule Selection Model for Hierarchical Phrase-based Translation" pptx

Báo cáo khoa học

... proposed joint probability model is factored into four sub-models that canbe further classified into source-side and target-side rule selection models or context-based and context-free selection models. ... same. As Rule (1) cannot be applied to Fig-ure 1(b) for the translation and Rule (2) cannotbe applied to Figure 1(a) for the translation either,υ = 1, C(ras), C(rat) and υ = 1, C(rbs), ... context infor-mation.3 Model Training of CBSM and CBTM3.1 The acquisition of training instancesCBSM and CBTM are trained by ME approach for the binary classification, where a training instanceconsists...
  • 6
  • 314
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "An Unsupervised Approach to Prepositional Phrase Attachment using Contextually Similar Words" potx

Báo cáo khoa học

... 7.1. Table 8 presents theprecision and recall of our algorithm and Table 9presents a performance comparison between oursystem and previous supervised and unsupervised approaches using the same ... PGSB207797.ReferencesAltmann, G. and Steedman, M. 1988. Interaction with ContextDuring Human Sentence Processing. Cognition, 30:191-238.Brill, E. 1995. Transformation-based Error-driven Learning and Natural Language ... classifier outperforms all previous unsupervised techniques and approaches theperformance of supervised algorithm.We reconstructed the two earlier unsupervised classifiers clHR and clR2. Table...
  • 8
  • 376
  • 0
Báo cáo khoa học: Functional analysis of cell-free-produced human endothelin B receptor reveals transmembrane segment 1 as an essential area for ET-1 binding and homodimer formation pptx

Báo cáo khoa học: Functional analysis of cell-free-produced human endothelin B receptor reveals transmembrane segment 1 as an essential area for ET-1 binding and homodimer formation pptx

Báo cáo khoa học

... incubated for 1 h at 25 °C, and purified on Strep-Tactin columns as described above.AcknowledgementsWe are grateful to Clemens Glaubitz and AndreasEngel for valuable discussions, and we thank WalterRosenthal ... 3263Functional analysis of cell-free-produced humanendothelin B receptor reveals transmembrane segment 1as an essential area for ET-1 binding and homodimerformationChristian Klammt1, Ankita Srivastava2, ... Brij78,1%; and digitonin, 0.4%.Cloning procedures and protein analysisCoding regions of full-length ETB and its derivatives wereamplified from cDNA by standard PCR techniques, and the fragments...
  • 13
  • 433
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Toolkit for Multi-Level Alignment and Information Extraction from Comparable Corpora" pptx

Báo cáo khoa học

... term-tagging tools for English, Latvian, Lithuanian, and Romanian, but can be easily extended for other languages if a POS-tagger, a phrase pattern list, a stop-word list, and an inverse document ... Levenshtein distance between term candidates. For evaluation, Eurovoc (Steinberger et al., 2002) was used. Tables 4 and 5 show the performance figures of the mapper for English-Romanian and English-Latvian. ... performance for English-Latvian. 3 Conclusions and Related Information This demonstration paper describes the ACCURAT toolkit containing tools for multi-level alignment and information extraction...
  • 6
  • 289
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Language-Independent Unsupervised Model for Morphological Segmentation" pot

Báo cáo khoa học

... syntac-tic and semantic information from the context theword occurs (Schone and Jurafsky, 2000; Bordag,2006; Yarowski and Wicentowski, 2000; Jacquemin,1997). Exploiting semantic and syntactic informa-tion ... thank Emily Pitler and Samarth Ke-shava for making available the code of the RePortSalgorithm, and Stefan Bordag and Delphine Bern-hard for running their algorithms on the Germandata. Many ... stem candidate auff¨uhr is thenstored together with the suffix candidates {ender,ung, en, t, laune}.Step 2: Ranking candidate stemsThere are two types of affix candidates: type-1 affixcandidates...
  • 8
  • 288
  • 0
Báo cáo Y học: An alternative model for photosystem II/light harvesting complex II in grana membranes based on cryo-electron microscopy studies pptx

Báo cáo Y học: An alternative model for photosystem II/light harvesting complex II in grana membranes based on cryo-electron microscopy studies pptx

Báo cáo khoa học

... withsoftware and Dr S. Prince, Dr S. V. Rue and Prof. G. Garab for useful suggestions and debate. T. D. Flint is thanked f or plant g rowth and specimen p reparation as well as L. Child and P. McPhie for ... Tris/urea-treatedmembranes as determined by SDS/PAGE and Coomassie staining. The left lane shows banda and the right lane band c. Molecular massmarkers are ind icated o n t he left of the panel.Ó FEBS ... adjacent membrane. This ®ts thestructural and biochemical data, w here PSII core complexescan be observed in one discrete plane and membranefraction, and LHCII complexes can be observed in anothermembrane...
  • 11
  • 455
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Topic Similarity Model for Hierarchical Phrase-based Translation" ppt

Báo cáo khoa học

... liketo thank Yun Huang, Zhengxian Gong, WenliangChen, Jun lang, Xiangyu Duan, Jun Sun, JinsongSu and the anonymous reviewers for their insightfulcomments.ReferencesNicola Bertoldi and Marcello ... Hanna M. Wallach, Jason Naradowsky,David A. Smith, and Andrew McCallum. 2009.Polylingual topic models. In Proc. of EMNLP 2009.Franz J. Och and Hermann Ney. 2002. Discriminativetraining and ... 2009.Andreas Stolcke. 2002. Srilm – an extensible languagemodeling toolkit. In Proc. ICSLP 2002.Yik-Cheung Tam, Ian R. Lane, and Tanja Schultz. 2007.Bilingual lsa-based adaptation for statistical...
  • 9
  • 399
  • 0

Xem thêm