0

terms from the web

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Báo cáo khoa học

... indidates that the term passed the test. Twenty terms out of the thrity candidate terms passed the first techinical-term test (Tech.) and six-teen terms out of the twenty terms passed the secondrelation ... target terms that should becollected from each seed word, and then checkedwhether each of the target terms was included in the system output. We counted the number of tar-get terms in the following ... examined whether each of the 210 target terms passes the filtering step. The result was that133 (63%) terms passed; 44 terms did not satisfy the condition H(x) ≥ 100; 15 terms did not satisfy the condition...
  • 4
  • 437
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Extraction and Approximation of Numerical Attributes from the Web" pdf

Báo cáo khoa học

... strong PMI with the ob-ject (this can be estimated using any fixed corpus).However, this is not essential.We then extract new terms from the retrieved web snippets and use these terms iteratively ... width 1.695m]’). We then extract new pat-terns from the retrieved search engine snippets andre-query the Web with the new patterns to obtainmore attribute values.We provided the framework with ... value for the givenobject. During the first stage it is possible thatwe directly extract from the text a set of valuesfor the requested object. The bounds processingstep rejects some of these...
  • 10
  • 465
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A DOM Tree Alignment Model for Mining Parallel Data from the Web" doc

Báo cáo khoa học

... that, using the new web mining scheme, the web mining throughput is increased by 32%; (ii) The quality of the mined data is improved. By lever-aging the web pages’ HTML structures, the sen-tence ... English-Chinese parallel data from the web. The mining procedure is initiated by acquiring Chinese website list. We have downloaded about 300,000 URLs of Chinese websites from the web directories at ... (1) Given a web site, the root page and web pages directly linked from the root page are downloaded. Then for each of the downloaded web page, all of its anchor texts (i.e. the hyperlinked...
  • 8
  • 435
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Acquisition of Ranked Qualia Structures from the Web" potx

Báo cáo khoa học

... (not calculated over the Web) as well as the conditional probability cal-culated over the Web (Web- P) delivered the best re-sults, while the PMI-based ranking measure yielded the worst results. ... coefficient (Web- Jac), the PointwiseMutual Information (Web- PMI) and the conditionalprobability (Web- P). We also present a version of the conditional probability which does not use the Web but merely ... appropriatequeries to the web search engine and choosing the article leading to the highest number of results. The corresponding patterns are then matched in the 50snippets returned by the search engine...
  • 8
  • 378
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Mining Parenthetical Translations from the Web by Word Alignment" potx

Báo cáo khoa học

... pairs, where the translation of the in-parenthesis terms is a suffix of the pre-parenthesis text. The lengths and frequency counts of the suffixes have been used to determine what is the translation ... our modified version of the competitive link-ing algorithm, the link score of a pair of words is the sum of the φ2 scores of the words themselves, their prefixes and their suffixes. In addition ... English word in the Wikipedia data, we first find whether there is a translation for the word in the extracted translation pairs. The Coverage of the Wikipedia data is measured by the percentage...
  • 9
  • 612
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Semantic Class Learning from the Web with Hyponym Pattern Linkage Graphs" pdf

Báo cáo khoa học

... hyponym patterns toextract class instances from the web and then evalu-ates them further by computing mutual informationscores based on web queries. The work by (Widdows and Dorow, 2002) on lex-ical ... to instantiate the pattern. On the first iteration, the pattern is given to Google as a web query, and new class members are extracted from the retrieved text snippets. We wanted the system to ... progresses. Initially, the seed is the onlytrusted class member and the only vertex in the graph. The bootstrapping process begins by instan-tiating the doubly-anchored pattern with the seedclass...
  • 9
  • 340
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Extracting Hypernym Pairs from the Web" potx

Báo cáo khoa học

... relations from the web. Wecompare our approach with hypernym ex-traction from morphological clues and from large text corpora. We show that the abun-dance of available data on the web enablesobtaining ... reason, we are interested in em-ploying the web for the extraction of hypernym re-lations. We are especially curious about whether the size of the web allows to achieve meaningful resultswith ... the two web ex-periments and a combination of the best web ap-proach with the morphological approach. The con-junctive web pattern N en N rates best, because of itshigh frequency. The recall...
  • 4
  • 395
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Compiling French-Japanese Terminologies from the Web" pptx

Báo cáo khoa học

... considered related terms of the seed. 3 Term Alignment Once we have collected related terms in both French and Japanese, we must link the terms in the source language to the terms in the target language. ... around the seed. 2.2 Automatic Term Recognition The next step is to extract candidate related terms from the corpus. Because the sentences compos-ing the corpus are related to the seed, the ... use bilingual corpora collected from the web and tailor made for the seed terms. For each language, we extract from the corpus a set of MWTs pertaining to the seed’s semantic domain, and...
  • 8
  • 372
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Extracting Sequences from the Web" pptx

Báo cáo khoa học

... Example the ORD the fifth the RB ORD the very first the JJS the best the RB JJS the very best the ORD JJS the third biggest the RBS JJ the most popular the ORD RBS JJ the second least likelyTable 2: The ... some cases, the ordering rela-tion of the sequence name was ambiguous (e.g.,2We queried for both the numeric form of the ordinal and the number spelled out (e.g the 2nd ” and the second ”).We ... extractions.We then randomly sampled and manually la-beled 2, 000 of these extractions for evaluation.We did a Web search to verify the correctness of the sequence name s and that x is the kth item inthe...
  • 5
  • 309
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Unsupervised Relation Extraction by Mining Wikipedia Texts Using Information from the Web" pdf

Báo cáo khoa học

... leveraging the vast size of the Web. Our hypothesis is that there exist some key terms and patterns that provide clues to the rela-tions between pairs. From the snippets retrievedby the search ... heterogeneous text on the Web. Therefore, we do not parse informa-tion from the Web corpus, but from well writtentexts. Particularly, we specifically examine unsu-pervised relation extraction from existing ... sequence of words. The distanceof two surface patterns is defined as the fraction of the LD value to the length of the longer sequence.For estimating the number of clusters k, we ap-ply the stability-based...
  • 9
  • 345
  • 0
Báo cáo khoa học: Subunit sequences of the 4 · 6-mer hemocyanin from the golden orb-web spider, Nephila inaurata Intramolecular evolution of the chelicerate hemocyanin subunits pot

Báo cáo khoa học: Subunit sequences of the 4 · 6-mer hemocyanin from the golden orb-web spider, Nephila inaurata Intramolecular evolution of the chelicerate hemocyanin subunits pot

Báo cáo khoa học

... assuming that the LpoHc2 and the a-subunits ofN. inaurata and E. californicum on the one hand, andTtrHcA and the arachnid g-subunits on the other hand areorthologous proteins (see above). The fossil ... allows the unambiguous assignment todistinct subunit types. The orthologous subunits of thesespecies share 69.1–76.2% of their amino acids, with the asubunits being the most conserved and the ... studies The web- based tools provided by the ExPASy MolecularBiology Server of the Swiss Institute of Bioinformatics(http://www.expasy.org) and the programGENEDOC2.6[25] were used for the analyses...
  • 8
  • 415
  • 0
A study on the translation of technical terms from English into Vietnamese

A study on the translation of technical terms from English into Vietnamese

Khoa học xã hội

... shift from the concrete to the abstract or from the abstract to the concrete. Metaphor : giving a new meaning by analogy to that of an established terms. Metonymy: taking the part of for the ... reproduces the matter without the manner, or the content with out of the form of the original. The advantage of this type of translation is that the text in TL sounds more natural. On the contrary, the ... and they keep their lives and meaning s only when they serve the system of knowledge that create them Despite the distinction between term and word mentioned above, the boundary between them...
  • 52
  • 811
  • 2
The difficulties and suggested solutions in translating tourism terms from English into Vietnamese

The difficulties and suggested solutions in translating tourism terms from English into Vietnamese

Khoa học xã hội

... công tác: Trường Đại Học Dân Lập Hải Phòng Nội dung hướng dẫn: The difficulties and suggested solutions in translating tourism terms from English into Vietnamese Người hướng dẫn thứ hai: Họ và ... LANGUAGES DEPARTMENT GRADUATION PAFER THE DIFFICULTIES AND SUGGESTED SOLUTIONS IN TRANSLATING TOURISM TERMS FROM ENGLISH INTO VIETNAMESE By: Nguyễn Thị Biên...
  • 73
  • 1,077
  • 3
A study on the translation of technical terms in the interface of common business website and their Vietnamese equivalent

A study on the translation of technical terms in the interface of common business website and their Vietnamese equivalent

Khoa học xã hội

... join them together, “Tool” modifies “bar” and they form a new word – “Toolbar”. Similarly, the second and the third example are in the same rule. 1.2.2. Adverb + Noun In technical terms, another ... study.However,due to the limitation of time and knowledge,my study could not cover all the terms, but only focus on the popular terms in the interface of common business websites. But hopefully these will ... seeks to translate the meaning of the original in such as way that the TL wording will trigger the same impact on the TC audience as the original wording did upon the ST audience. They argue that...
  • 72
  • 743
  • 1

Xem thêm

Tìm thêm: hệ việt nam nhật bản và sức hấp dẫn của tiếng nhật tại việt nam xác định các mục tiêu của chương trình xác định các nguyên tắc biên soạn khảo sát các chuẩn giảng dạy tiếng nhật từ góc độ lí thuyết và thực tiễn khảo sát chương trình đào tạo của các đơn vị đào tạo tại nhật bản khảo sát chương trình đào tạo gắn với các giáo trình cụ thể xác định thời lượng học về mặt lí thuyết và thực tế tiến hành xây dựng chương trình đào tạo dành cho đối tượng không chuyên ngữ tại việt nam điều tra đối với đối tượng giảng viên và đối tượng quản lí khảo sát thực tế giảng dạy tiếng nhật không chuyên ngữ tại việt nam khảo sát các chương trình đào tạo theo những bộ giáo trình tiêu biểu phát huy những thành tựu công nghệ mới nhất được áp dụng vào công tác dạy và học ngoại ngữ hệ số công suất cosp fi p2 đặc tuyến hiệu suất h fi p2 đặc tuyến dòng điện stato i1 fi p2 động cơ điện không đồng bộ một pha thông tin liên lạc và các dịch vụ phần 3 giới thiệu nguyên liệu từ bảng 3 1 ta thấy ngoài hai thành phần chủ yếu và chiếm tỷ lệ cao nhất là tinh bột và cacbonhydrat trong hạt gạo tẻ còn chứa đường cellulose hemicellulose chỉ tiêu chất lượng 9 tr 25