learning to extract relations from the web using minimal supervision

Tài liệu Báo cáo khoa học: "Coreference for Learning to Extract Relations: Yes, Virginia, Coreference Matters" ppt

Tài liệu Báo cáo khoa học: "Coreference for Learning to Extract Relations: Yes, Virginia, Coreference Matters" ppt

Ngày tải lên : 20/02/2014, 04:20
... ability to learn the relations. 5.2 Precision Results on precision are mixed. While for 4 of the relations +Coref is higher, for the 6 others the addition of coreference reduces precision. The ... match the mentions X=her and Y=son, and build the relation instance hasChild(Ethel Kennedy, Robert F. Kennedy Jr.). During assessment, the annotator is asked whether, in the context of the ... names, -Coref may need to use more specific, complex patterns to learn the instance (e.g. “Sue asked her son, Bob, to set the table”). We expect the ability to run using a ‘denser,’ more local...
  • 6
  • 381
  • 0
Báo cáo khoa học: "Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs" pdf

Báo cáo khoa học: "Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs" pdf

Ngày tải lên : 17/03/2014, 02:20
... used to instantiate the pattern. On the first iteration, the pattern is given to Google as a web query, and new class members are extracted from the retrieved text snippets. We wanted the system to ... 2005) also uses hyponym patterns to extract class instances from the web and then evalu- ates them further by computing mutual information scores based on web queries. The work by (Widdows and Dorow, ... seed class member, issuing a web query to generate new candidate instances, and adding these new instances to the graph. A score is then assigned to every node in the graph, using one of several different...
  • 9
  • 340
  • 0
Tài liệu Module 1: Introduction to Exchange 2000 and the Web Storage System pdf

Tài liệu Module 1: Introduction to Exchange 2000 and the Web Storage System pdf

Ngày tải lên : 11/12/2013, 14:15
... Stores Storage Group A Store Store Store Store Store Store Store Store Store Store Transaction Log Transaction Log Transaction Log Storage Group B Store Store Store Store Store Store Store Store Store Store Transaction ... items in the Web Storage System, you cannot use the protocols to view items and their properties. In these circumstances, you may want to use Web forms. Web Form Functionality By using Web forms, ... while they are streaming to the client instead of waiting for the entire file to download. Topic Objective To list the database features of the Web Storage System Lead-in Each Exchange store...
  • 50
  • 671
  • 0
Tài liệu How To Acquire Customers On The Web pptx

Tài liệu How To Acquire Customers On The Web pptx

Ngày tải lên : 13/12/2013, 14:15
... CDnow Web site and actually bought a CD, CDnow gave 3% of the revenue from the sale back to the af- filiate. That gave member Web sites the inducement they needed to join the program and provided them ... marketing expenditures to set themselves out from the crowd, inspire Web shop- pers to visit their sites, and then get them to actually make a purchase. Many e-tailers, in fact, are averag- ing more than $100 to ... various compact discs to their cyber- browsers that they could then pur- chase at CDnow’s site. The links that such sites placed next to their music reviews gave their visitors the option to effortlessly...
  • 8
  • 568
  • 0
Tài liệu Báo cáo khoa học: "Extraction and Approximation of Numerical Attributes from the Web" pdf

Tài liệu Báo cáo khoa học: "Extraction and Approximation of Numerical Attributes from the Web" pdf

Ngày tải lên : 20/02/2014, 04:20
... also extract bounds and comparison informa- tion in order to verify the extracted values and to approximate the missing ones. To allow us to extract attribute-specific informa- tion, we provided the ... width 1.695m]’). We then extract new pat- terns from the retrieved search engine snippets and re-query the Web with the new patterns to obtain more attribute values. We provided the framework with ... with the ob- ject (this can be estimated using any fixed corpus). However, this is not essential. We then extract new terms from the retrieved web snippets and use these terms iteratively to re- trieve...
  • 10
  • 465
  • 0
Tài liệu Báo cáo khoa học: "A Composite Kernel to Extract Relations between Entities with both Flat and Structured Features" ppt

Tài liệu Báo cáo khoa học: "A Composite Kernel to Extract Relations between Entities with both Flat and Structured Features" ppt

Ngày tải lên : 20/02/2014, 12:20
... trees. Their tree kernels require the match- able nodes to be at the same layer counting from the root and to have an identical path of ascend- ing nodes from the roots to the current nodes. The ... other feature-based kernels. We can also benefit from machine learning algorithms to study how to solve the data imbalance and sparseness issues from the learning algorithm viewpoint. In the ... the occurrence of each sub-tree without considering the layer and the ancestors of the root node of the sub-tree, our method is not limited by the constraints (identi- cal layer and ancestors...
  • 8
  • 467
  • 0
Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Ngày tải lên : 20/02/2014, 16:20
... query. In case the query is a term, its hit is the number of pages that contain the term on the Web. We use the following notation. H(x)= the number of pages that contain the term x” The number H ... that the system can be used as a tool that helps us compile a glossary. Second, we tried to examine the recall of the system. It is impossible to calculate the actual re- call value, because the ... defined. To estimate the recall, we first prepared three to five target terms that should be collected from each seed word, and then checked whether each of the target terms was included in the system...
  • 4
  • 437
  • 0
Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc

Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc

Ngày tải lên : 08/03/2014, 02:21
... that, using the new web mining scheme, the web mining throughput is increased by 32%; (ii) The quality of the mined data is improved. By lever- aging the web pages’ HTML structures, the sen- tence ... downloaded from the Department of Justice of the Hong Kong Special Administrative Region website. Recently, web mining systems have been built to automatically acquire parallel data from the web. ... parallel data from the web. The mining procedure is initiated by acquiring Chinese website list. We have downloaded about 300,000 URLs of Chinese websites from the web directories at cn.yahoo.com,...
  • 8
  • 435
  • 0
Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx

Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx

Ngày tải lên : 08/03/2014, 02:21
... approach to automatically learning qualia structures from the Web. Such an approach is especially interesting either for lexicog- 894 matched. On the basis of these, we then calculate the probability ... appropriate queries to the web search engine and choosing the article leading to the highest number of results. The corresponding patterns are then matched in the 50 snippets returned by the search engine ... (Web- Jac) measure relies on the web search engine to calculate the number of documents in which x and y co-occur close to each other, divided by the number of documents each one occurs, i.e. Web- Jac(x,...
  • 8
  • 378
  • 0
Báo cáo khoa học: "Question Answering System in the Web using the Web" doc

Báo cáo khoa học: "Question Answering System in the Web using the Web" doc

Ngày tải lên : 08/03/2014, 21:20
... meaning a phrase pattern. Then, these patterns are searched in the Web (using Google at the moment) and the system extracts the first 100 document snippets created by the search engine. Some ... system. It tries to take advantage of the great amount of in- formation existent in the World Wide Web. Since Portuguese is one of the most used languages in the web and the web itself is a ... * L), through the first 100 snippets resulting from the web search; where F is the n-gram frequency, S is the score of the search pattern that recovered the document and L is the n-gram length....
  • 4
  • 454
  • 0
A Complete Guide for All Ages: Easy to understand information from the nation’s leaders in women’s health doc

A Complete Guide for All Ages: Easy to understand information from the nation’s leaders in women’s health doc

Ngày tải lên : 14/03/2014, 12:20
... Straight-ahead information on the things we can do to stay healthy, tests we should get to monitor our health, how to cope with disease, and how to talk with our doctors. Simply put, how to take charge of ... hormone therapy is started may be the key to whether this therapy reduces your chanc- es of getting heart disease. Most of the women in the NIH study did not start menopausal hormone therapy ... attack, the injured area of the heart muscle is re- placed by scar tissue. is weakens the pumping action of the heart. carry blood to the heart. Over time, this buildup causes the arteries to...
  • 177
  • 560
  • 0
Báo cáo khoa học: "Mining Parenthetical Translations from the Web by Word Alignment" potx

Báo cáo khoa học: "Mining Parenthetical Translations from the Web by Word Alignment" potx

Ngày tải lên : 17/03/2014, 02:20
... is too short 2 the extracted translation is too long 3 the extracted translation contains only the last name * the extracted term is completely wrong. Note that Exact Match is a rather ... with top φ 2 In our modified version of the competitive link- ing algorithm, the link score of a pair of words is the sum of the φ 2 scores of the words themselves, their prefixes and their ... pairs, where the translation of the in-parenthesis terms is a suffix of the pre-parenthesis text. The lengths and frequency counts of the suffixes have been used to determine what is the translation...
  • 9
  • 612
  • 0
Báo cáo khoa học: "Extracting Hypernym Pairs from the Web" potx

Báo cáo khoa học: "Extracting Hypernym Pairs from the Web" potx

Ngày tải lên : 17/03/2014, 04:20
... hypernym relations from the web. We compare our approach with hypernym ex- traction from morphological clues and from large text corpora. We show that the abun- dance of available data on the web ... ap- plied to large and very large text corpora. Today, the web contains more data than the largest available text corpus. For this reason, we are interested in em- ploying the web for the extraction ... about whether the size of the web allows to achieve meaningful results with basic extraction techniques. In section two we introduce the task, hypernym extraction. Section three presents the results...
  • 4
  • 395
  • 0
Báo cáo khoa học: "Compiling French-Japanese Terminologies from the Web" pptx

Báo cáo khoa học: "Compiling French-Japanese Terminologies from the Web" pptx

Ngày tải lên : 17/03/2014, 22:20
... candidate related terms from the corpus. Because the sentences compos- ing the corpus are related to the seed, the same should be true for the terms they contain. The process of extracting terms is ... translation. They use a compositional method to generate a set of translation candidates from which they select the most likely translation by using empirical evidence from the web. The method ... precedence to the alignments obtained with the more accurate methods. Con- sequently, we start by adding the alignments in FJ to the output set. Then, we augment it with the alignments from FJJ...
  • 8
  • 372
  • 0
Báo cáo khoa học: "Extracting Sequences from the Web" pptx

Báo cáo khoa học: "Extracting Sequences from the Web" pptx

Ngày tải lên : 23/03/2014, 16:20
... very first the JJS the best the RB JJS the very best the ORD JJS the third biggest the RBS JJ the most popular the ORD RBS JJ the second least likely Table 2: The patterns used by SEQ to detect ... both the numeric form of the ordinal and the number spelled out (e.g the 2nd ” and the second ”). We took up to 100 results per query. 288 Pattern Example the ORD the fifth the RB ORD the very ... features: the total con- fidence totalConf(x, k, s|C) and the same total confidence normalized to sum to 1 over all x, hold- ing k and s constant. To train the classifier, we use a set of extractions...
  • 5
  • 309
  • 0

Xem thêm