automatic extraction of lexicosyntactic patterns

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

Ngày tải lên : 20/02/2014, 04:20
... Statistics of the BioScope corpus. The 2nd and 3d columns show the total number of cues within the datasets; the 4th and 5th columns show the percentage of negated and spec- ulative sentences. 70% of ... systems. In future work, we will explore the use of statisti- cally based methods for the creation of an optimum set of lexico-syntactic tree patterns and will evalu- ate the system performance ... neighboring identical siblings of type *scope* or * are replaced by a single node of the corresponding type. Figure 3 shows an example of this transformation. (a) The children of nodes JJ/NN/NN are pruned...
  • 5
  • 543
  • 1
Web Mining and Knowledge Discovery of Usage Patterns

Web Mining and Knowledge Discovery of Usage Patterns

Ngày tải lên : 31/08/2012, 16:46
... the irrelative rules or patterns and to extract the interesting rules or patterns from the output of the pattern discovery process. The output of Web mining algorithms is often not in the form ... such as the problems of the profile data being subjective, as well getting out of date as user preferences change over time. 5.2 User Navigation Pattern The research of user navigation pattern ... Analysis process and makes use of the preprocessed content and structure information to automatically filter the results of the knowledge discovery algorithms for patterns that are potentially...
  • 25
  • 630
  • 3
The Gang of Four patterns_01

The Gang of Four patterns_01

Ngày tải lên : 19/10/2013, 01:20
... patterns Below is a list of the 23 Gang of Four patterns presented in this document: Creational Patterns Abstract Factory Creates an instance of several families of classes Builder Separates ... implementations of the Factory design pattern. Copyright © 2006, Data & Object Factory. All rights reserved. Page 10 of 87 Design Pattern Framework™ 2.0 3. The Gang of Four patterns Below ... accomplished. Builders often encapsulate construction of Composite objects (another design pattern, see Composite pattern) because construction of these structures are often repetitive and complex....
  • 18
  • 352
  • 0
Tài liệu Automatic Management of Network Security Policy pptx

Tài liệu Automatic Management of Network Security Policy pptx

Ngày tải lên : 14/02/2014, 16:20
... actual enforcement of these policies. Many of our design decisions were influenced by the lack of verifiable enforcement mechanisms for certain security phenomena. An example of this is Denial of Service ... provide required by policies. Often it suffices to reason about connectivity to analyze availability of services to groups of users. For instance, instead of modeling all the details of a file server, we ... environment. This generally leads either to over or under management of resourses. One of the specific goals of this work is management of security configurations in networks that span multiple administrative domains...
  • 15
  • 467
  • 0
Tài liệu Báo cáo khoa học: "Cross-Domain Co-Extraction of Sentiment and Topic Lexicons" pdf

Tài liệu Báo cáo khoa học: "Cross-Domain Co-Extraction of Sentiment and Topic Lexicons" pdf

Ngày tải lên : 19/02/2014, 19:20
... values of r in the “product vs. movie” task. Observe that for sentiment word extraction, the results of the proposed methods are not sensitive to the values of r. While for the topic word extraction, ... and Ryan McDonald. 2008. A joint model of text and aspect ratings for sentiment summarization. In Proceedings of the 46th Annual Meeting of the As- sociation of Computational Linguistics: Human ... study the effect of different parameter settings. There are several parameters in the framework: the number of generated seeds r, the number of new candidates k 2 and the number of selections k...
  • 10
  • 447
  • 0
Tài liệu Báo cáo khoa học: "Robust Extraction of Named Entity Including Unfamiliar Word" doc

Tài liệu Báo cáo khoa học: "Robust Extraction of Named Entity Including Unfamiliar Word" doc

Ngày tải lên : 20/02/2014, 09:20
... Extraction of Japanese Named Entity 2.1 Task of the IREX Workshop The task of NE extraction of the IREX workshop (Sekine and Eriguchi, 2000) is to recognize eight NE types in Table 1. The organizer of ... features of original morphemes and fea- tures of similar morphemes. The experiments of extracting Japanese NEs from IREX corpus and NHK corpus show the effectiveness of the proposed method. 2 Extraction ... 2003; Nakano and Hirai, 2004) formalized the task of extracting NEs as a chunking problem of a sequence of characters instead of a sequence of morphemes. In this paper, we keep the naive formal- ization,...
  • 4
  • 384
  • 1
Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

Ngày tải lên : 20/02/2014, 12:20
... ex- amples of the previous section. From the point of view of bag -of- word methods, the pairs (T 1 , H 1 ) and (T 1 , H 2 ) have both the same intra-pair simi- larity since the sentences of T 1 and ... head of constituents. The example of Fig. 1 shows that the placeholder 0 climbs up to the node governing all the NPs. 5.3 Pruning irrelevant information in large text trees Often only a portion of ... t, the set of its nodes N (t), and a set of anchors, we build a tree t  with all the nodes N  that are anchors or ancestors of any anchor. Moreover, we add to t  the leaf nodes of the original...
  • 8
  • 413
  • 0
Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

Ngày tải lên : 20/02/2014, 12:20
... the polarity of words There are some works that discuss learning the po- larity of words instead of sentences. Hatzivassiloglou and McKeown proposed a method of learning the polarity of adjectives ... are not in the resources. 458 (1) kono software-no riten-ha hayaku ugoku koto this software-POST advantage-POS T quickly run to The advantage of this software is to run quickly. (2) ketten-ha jikan-ga ... polarity of each sentence. This is simi- lar to the extraction from the itemization. 4.3 Extraction based on linguistic pattern The third method uses linguistic pattern. The char- acteristic of this...
  • 8
  • 409
  • 0
Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

Ngày tải lên : 20/02/2014, 12:20
... examples of sen- tences that our system identified as reasons of complaints. (1) Unfortunately, I find that I am no longer comfortable in your establishment because of the unprofessional, ... Sources of Opinions with Conditional Random Fields and Extraction Pat- terns. Proceedings of HLT/EMNLP-05. Esuli, Andrea and Fabrizio Sebastiani. 2005. De- termining the semantic orientation of ... Orientation of Adjectives. Proceedings of 35th Annual Meet- ing of the Assoc. for Computational Linguistics (ACL-97): 174-181 Hatzivassiloglou, Vasileios and Janyce Wiebe. 2000. Effects of Adjective...
  • 8
  • 461
  • 1
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

Ngày tải lên : 20/02/2014, 12:20
... grammar, rhythm and flow, appropriateness of tone, and several other specific characteristics of good text. In terms of automatic evaluation, we are not aware of any technique that measures only fluency ... Methods PoStag In the first of these, we constructed a rough approximation of typical sentence grammar structure by taking bigrams over part -of- speech tags. 6 Then, given a string of PoS tags of length n, t 1 . ... can be fooled by the method of sentence generation; GLEU, how- ever, gives a consistent estimate of fluency regard- less of generation type; and, across all types of gen- erated sentences examined...
  • 8
  • 507
  • 0
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

Ngày tải lên : 20/02/2014, 16:20
... construction of N-best translation lexicons from parallel text. Melamed (1995) used the ratio (LCSR) between the length of the LCS of two words and the length of the longer word of the two ... present the evaluations of ROUGE-L, ROUGE-S, and compare their per- formance with other automatic evaluation meas- ures. 5 Evaluations One of the goals of developing automatic evalua- tion ... Proceedings of COLING-92, Nantes, France. Thompson, H. S. 1991. Automatic Evaluation of Translation Quality: Outline of Methodology and Report on Pilot Experiment. In Proceedings of the Evaluator’s...
  • 8
  • 442
  • 0
Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Ngày tải lên : 20/02/2014, 16:20
... 1 shows the average number of clusters with each clustering method shown chapter 3 by the part of speech. WC and WF are the average number of senses by the part of speech. In Table 1 and ... the word senses numbered i of the word x. I x is the word sense indexing function of x that gives an index to each sense of the word x. All contextual words x i ±j of a central word x have ... is like this: the contextual words used in the same sense of the central word show the similar pattern of context. If collocation patterns between contextual words are similar, it means that...
  • 4
  • 425
  • 0
Tài liệu Báo cáo khoa học: "Counter-Training in Discovery of Semantic Patterns" doc

Tài liệu Báo cáo khoa học: "Counter-Training in Discovery of Semantic Patterns" doc

Ngày tải lên : 20/02/2014, 16:20
... semantic class. (Riloff and Jones, 1999; Riloff, 1996; Yangarber et al., 2000) present different combinations of learners of patterns and concept classes specifically for IE. In (Riloff, 1996) the ... as in (Yangarber et al., 2000) (3) where is the set of accepted patterns that match ; this is a rough estimate of the likelihood of relevance of , based on the pattern accuracy mea- sure. Pattern ... capitalization rules of conventional proper names. 7 The two papers appeared within two months of each other. 8 A view, in the sense of relational algebra, is a sub-set of features of the data-points....
  • 8
  • 423
  • 0
Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Ngày tải lên : 20/02/2014, 16:20
... (20%) out of 210 terms were col- lected by the system. This low recall primarily comes from the failure of automatic term recogni- tion (case A in the above classification). Improve- ment of this ... collected 610 terms in total; the average number of output terms per input is 12.2 terms. We checked whether each of the 610 terms is a correct related term of the original seed term by hand. The result ... issue: Japanese term extraction. Terminolgy, 6(2). Kyo Kageura and Bin Umino. 1996. Methods of au- tomatic term recognition: A review. Terminology, 3(2):259–289. Hiroshi Nakagawa. 2000. Automatic term...
  • 4
  • 437
  • 0
Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

Ngày tải lên : 20/02/2014, 21:20
... The completeness of the output list increases monotonically with the total number of occurrences of each verb in the corpus. False positive rates are one to three percent of observa- tions. ... architecture of the system, and that of this pa- per, directly reflects the three challenges described above. The system consists of three modules: 1. Verb detection: Finds some occurrences of verbs ... is evaluated in terms of efficiency and accuracy. The most useful estimate of effi- ciency is simply the density of observations in the corpus, shown in the first column of Table 3. The SF...
  • 6
  • 416
  • 0