0

collecting thai unknown words from the web

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Extraction and Approximation of Numerical Attributes from the Web" pdf

Báo cáo khoa học

... each kind. These patterns are the onlyattribute-specific resource in our framework.Value extraction. The first pattern group,Pvalues, allows extraction of the attribute values from the Web. All ... width 1.695m]’). We then extract new pat-terns from the retrieved search engine snippets andre-query the Web with the new patterns to obtainmore attribute values.We provided the framework with ... value for the givenobject. During the first stage it is possible thatwe directly extract from the text a set of valuesfor the requested object. The bounds processingstep rejects some of these...
  • 10
  • 465
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Báo cáo khoa học

... query is a term, its hitis the number of pages that contain the term on the Web. We use the following notation.H(x)= the number of pages that contain the term x” The number H (x) can be used ... in the compiled corpus.R: the target term did not exist on the collected web pages.Only 43 terms (20%) out of 210 terms were col-lected by the system. This low recall primarilycomes from the ... explanation of the term.4. There are several technical terms that are re-lated to the term.We have implemented the checking program of the first two conditions in the system: the thirdconditioncan...
  • 4
  • 437
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc

Báo cáo khoa học

... Given a web site, the root page and web pages directly linked from the root page are downloaded. Then for each of the downloaded web page, all of its anchor texts (i.e. the hyperlinked words ... English-Chinese parallel data from the web. The mining procedure is initiated by acquiring Chinese website list. We have downloaded about 300,000 URLs of Chinese websites from the web directories at ... that, using the new web mining scheme, the web mining throughput is increased by 32%; (ii) The quality of the mined data is improved. By lever-aging the web pages’ HTML structures, the sen-tence...
  • 8
  • 435
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx

Báo cáo khoa học

... (not calculated over the Web) as well as the conditional probability cal-culated over the Web (Web- P) delivered the best re-sults, while the PMI-based ranking measure yielded the worst results. ... coefficient (Web- Jac), the PointwiseMutual Information (Web- PMI) and the conditionalprobability (Web- P). We also present a version of the conditional probability which does not use the Web but merely ... improved the resultsof the Jaccard measure by about 15%.6We determine this number experimentally as the number of web pages containing the words the and ’and’.891Proceedings of the 45th...
  • 8
  • 378
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Mining Parenthetical Translations from the Web by Word Alignment" potx

Báo cáo khoa học

... our modified version of the competitive link-ing algorithm, the link score of a pair of words is the sum of the φ2 scores of the words themselves, their prefixes and their suffixes. In addition ... of their scores, selecting pairs based on the resultant order. A pair of words is linked if none of the two words were previously linked to any other words. The algorithm terminates when there ... pairs, where the translation of the in-parenthesis terms is a suffix of the pre-parenthesis text. The lengths and frequency counts of the suffixes have been used to determine what is the translation...
  • 9
  • 612
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs" pdf

Báo cáo khoa học

... hyponym patterns toextract class instances from the web and then evalu-ates them further by computing mutual informationscores based on web queries. The work by (Widdows and Dorow, 2002) on lex-ical ... to instantiate the pattern. On the first iteration, the pattern is given to Google as a web query, and new class members are extracted from the retrieved text snippets. We wanted the system to ... progresses. Initially, the seed is the onlytrusted class member and the only vertex in the graph. The bootstrapping process begins by instan-tiating the doubly-anchored pattern with the seedclass...
  • 9
  • 340
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Extracting Hypernym Pairs from the Web" potx

Báo cáo khoa học

... relations from the web. Wecompare our approach with hypernym ex-traction from morphological clues and from large text corpora. We show that the abun-dance of available data on the web enablesobtaining ... A, B and C are siblings of each otherHere, sibling refers to the relative position of the words in the hypernymy tree. Two words are sib-lings of each other if they share a parent.We compute ... the two web ex-periments and a combination of the best web ap-proach with the morphological approach. The con-junctive web pattern N en N rates best, because of itshigh frequency. The recall...
  • 4
  • 395
  • 0
The meaning of tingo and other extraordinary words from around the world

The meaning of tingo and other extraordinary words from around the world

Anh ngữ phổ thông

... bare the fangs like a dog laglerolarpok (Inuit) the gnashing of teeth kashr (Persian) displaying the teeth in laughter zhaghzhagh (Persian) the chattering of the teeth from the cold or from ... rights reserved THE LIBRARY OF CONGRESS HAS CATALOGED THE HARDCOVER EDITION AS FOLLOWS: Jacot de Boinod, Adam. The meaning of tingo and other extraordinary words from around the world / Adam ... hands pressed together in salutation Legging it Undue attention is put on their shapeliness but the bottom line is it’s good to have two of them and they should, ideally, be the same length:...
  • 223
  • 671
  • 3
Báo cáo khoa học: Subunit sequences of the 4 · 6-mer hemocyanin from the golden orb-web spider, Nephila inaurata Intramolecular evolution of the chelicerate hemocyanin subunits pot

Báo cáo khoa học: Subunit sequences of the 4 · 6-mer hemocyanin from the golden orb-web spider, Nephila inaurata Intramolecular evolution of the chelicerate hemocyanin subunits pot

Báo cáo khoa học

... assuming that the LpoHc2 and the a-subunits ofN. inaurata and E. californicum on the one hand, andTtrHcA and the arachnid g-subunits on the other hand areorthologous proteins (see above). The fossil ... allows the unambiguous assignment todistinct subunit types. The orthologous subunits of thesespecies share 69.1–76.2% of their amino acids, with the asubunits being the most conserved and the ... studies The web- based tools provided by the ExPASy MolecularBiology Server of the Swiss Institute of Bioinformatics(http://www.expasy.org) and the programGENEDOC2.6[25] were used for the analyses...
  • 8
  • 415
  • 0
Expert Service Oriented Architecture in C Sharp  Using the Web Services Enhancements

Expert Service Oriented Architecture in C Sharp Using the Web Services Enhancements

Kỹ thuật lập trình

... codedirectly in the code-behind file of the .asmx Web service. But in a service-orientedarchitecture, it is important to design the Web service components themselves sothat they truly act as ... to understanding the material in the second half of the book. The remaining chapters of the book cover all of the WS-Specifications that are imple-mented by WSE 2.0. Finally, the book closes ... and the characteristics of a Web service from the perspective of SOA. This chap-ter reviews the following topics:• SOA concepts and application architecture• The WS-I Basic Profile• The WS-Specifications•...
  • 336
  • 841
  • 2

Xem thêm