meaningful clustering of senses

Tài liệu Báo cáo khoa học: "Selecting the “Right” Number of Senses Based on Clustering Criterion Functions" pdf

Tài liệu Báo cáo khoa học: "Selecting the “Right” Number of Senses Based on Clustering Criterion Functions" pdf

Ngày tải lên : 22/02/2014, 02:20
... distribution of P K2’s predictions were most like those of the actual senses. 4 Conclusions This paper shows how to use clustering criterion functions as a means of automatically selecting the number of senses ... (PK1-3). 112 Selecting the “Right” Number of Senses Based on Clustering Criterion Functions Ted Pedersen and Anagha Kulkarni Department of Computer Science University of Minnesota, Duluth Duluth, MN 55812 ... determining the number of senses in which an ambiguous word is used in a large corpus. It is based on the use of global criterion functions that assess the quality of a clustering solution. 1...
  • 4
  • 361
  • 0
Tài liệu Báo cáo khoa học: "Inducing Gazetteers for Named Entity Recognition by Large-scale Clustering of Dependency Relations" ppt

Tài liệu Báo cáo khoa học: "Inducing Gazetteers for Named Entity Recognition by Large-scale Clustering of Dependency Relations" ppt

Ngày tải lên : 20/02/2014, 09:20
... semantics of MNs well, the MN clusters constructed by using dependency relations should serve as a good gazetteer. However, the high level of computa- tional cost has prevented the use of clustering for ... for storing only a part of classes C l , i.e., 1/|P | of the parame- ter matrix, where P is the number of cluster nodes. This data splitting enables linear scalability of mem- ory sizes. However, ... and, in terms of execution speed, may 4 Acknowledgements: This corpus was provided by Dr. Daisuke Kawahara of NICT. 5 To be precise, we need two copies of these. 6 Each node has a copy of the training...
  • 9
  • 428
  • 0
Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Ngày tải lên : 20/02/2014, 16:20
... number of clusters with each clustering method shown chapter 3 by the part of speech. WC and WF are the average number of senses by the part of speech. In Table 1 and 2, the most clustering ... z Fuzzy clustering (F1, F2) 2 (Song, Cao and Bruza, 2003) Used clustering methods cover both the popularity and the variety of the algorithms – soft and hard clustering and graph clustering ... gives the word senses numbered i of the word x. I x is the word sense indexing function of x that gives an index to each sense of the word x. All contextual words x i ±j of a central word...
  • 4
  • 425
  • 0
Báo cáo khoa học: A hybrid clustering of protein binding sites ppt

Báo cáo khoa học: A hybrid clustering of protein binding sites ppt

Ngày tải lên : 15/03/2014, 10:20
... 5. Number of binding sites contained in clusters as a function of the number of clusters allowed to be used (gp = 1 ⁄ 10). The color coding is given in Table 2. A hybrid clustering of protein ... ‘silhouette value’ of a cluster is the smallest possible distance between an element of this cluster and an element of the neighboring clusters. The silhouette coefficient of the overall clustering is ... the distance function and clustering algorithm, three main parameters affected the properties of clustering: OPTICS MINPTS, OPTICS cut-off level and gap penalty (gp) of the distance func- tion....
  • 9
  • 229
  • 0
Báo cáo khoa học: "Automatic Retrieval and Clustering of Similar Words" potx

Báo cáo khoa học: "Automatic Retrieval and Clustering of Similar Words" potx

Ngày tải lên : 17/03/2014, 07:20
... Ilcell, subj -of, adapt[l=l Ilcell, subj -of, behavell=l [Icell, pobj -of, in11=159 [[cell, pobj -of, insidell=16 Ilcell, pobj -of, intoll=30 Ilcell, nmod -of, abnormalityll=3 Ilcell, nmod -of, anemiall=8 ... Ilcell, nmod -of, architecturell=l [[cell, obj -of, attackl[=6 [[cell, obj -of, bludgeon[[=l [Icell, obj -of, callll=l 1 Hcell, obj -of, come froml[=3 Ilcell, obj -of, containll 4 Ilcell, obj -of, decoratell=2 ... IR(wx)l+lR(w2)l where S(w) is the set of senses of w in the WordNet, super(c) is the set of (possibly indirect) superclasses of concept c in the WordNet, R(w) is the set of words that belong to a same...
  • 7
  • 322
  • 0
Báo cáo khoa học: "DISTRIBUTIONAL CLUSTERING OF ENGLISH WORDS" pptx

Báo cáo khoa học: "DISTRIBUTIONAL CLUSTERING OF ENGLISH WORDS" pptx

Ngày tải lên : 17/03/2014, 08:20
... simple tabulation of fre- quencies of certain words participating in certain configurations, for example of frequencies of pairs of a transitive main verb and the head noun of its direct object, ... some indication of what aspects of distributional relationships may be discovered by clustering. However, we also need to evaluate clustering more rigorously as a basis for models of distributional ... looked at two kinds of measurements of model quality: (i) relative en- tropy between held-out data and the asymmetric model, and (ii) performance on the task of decid- ing which of two verbs is...
  • 8
  • 310
  • 0
Tài liệu Technical Overview of Clustering in Windows Server 2003 pdf

Tài liệu Technical Overview of Clustering in Windows Server 2003 pdf

Ngày tải lên : 17/12/2013, 13:15
... the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information presented after the date of publication. This document is for informational purposes only. MICROSOFT MAKES ... trademarks of their respective owners. Technical Overview of Clustering in Windows Server 2003 ii Microsoft® Windows® Server 2003 Technical Article • Dynamic Disk Size – If you increase the size of ... in Clustering for Windows Server 2003 9 Technical Overview of Clustering in Windows Server 2003 Microsoft Corporation Published: January 2003 Abstract This white paper summarizes the new clustering...
  • 23
  • 536
  • 1
Tài liệu FUZZY CLUSTERING ALGORITHMS ON LANDSAT IMAGES FOR DETECTION OF WASTE AREAS: A COMPARISON pdf

Tài liệu FUZZY CLUSTERING ALGORITHMS ON LANDSAT IMAGES FOR DETECTION OF WASTE AREAS: A COMPARISON pdf

Ngày tải lên : 16/01/2014, 16:33
... the combination of bands 5, 4 and 1 which is of great efficacy for the aims of our analysis. In Figures 1 and 2 the set of bands 5, 4 and 1 are depicted respectively for the month of May 1994 and ... algorithms for the detection of waste areas using LANDSAT TM images. It is worth of noting that the 30 meters spatial resolution of the Landsat-TM sensor makes the process of detecting waste areas ... values of all samples in all clusters; • m ∈ (1 , ∞ ) is a control parameter of fuzziness. The minimization of J m , under the probabilistic constraint  c j=1 u jk = 1, leads to the iteration of...
  • 11
  • 371
  • 0
Báo cáo khoa học: "Discriminating image senses by clustering with multimodal features" potx

Báo cáo khoa học: "Discriminating image senses by clustering with multimodal features" potx

Ngày tải lên : 08/03/2014, 02:21
... set of outlier images, de- fined as images whose ratio of distances to the second closest and closest clusters was below a threshold. Word All senses Meta senses Core senses BASS 6 senses 4 senses ... lists the number of senses, median, and range of global cluster purity, followed by the baseline. All senses used the full set of sense labels and 40 clusters. Meta senses merged core senses with ... senses 2 senses Median 0.60 0.73 0.94 Range 0.03 0.02 0.02 Baseline 0.35 0.45 0.55 CRANE 9 senses 6 senses 4 senses Median 0.49 0.65 0.86 Range 0.05 0.07 0.07 Baseline 0.27 0.37 0.50 SQUASH 6 senses...
  • 8
  • 317
  • 0
Báo cáo khoa học: "Clustering Hungarian Verbs on the Basis of Complementation Patterns" pot

Báo cáo khoa học: "Clustering Hungarian Verbs on the Basis of Complementation Patterns" pot

Ngày tải lên : 08/03/2014, 03:20
... an other semantic role – cause – in the case of verbs of existence (Table 2). It is important to note that we do not dispose of a preliminary list of semantic roles. To avoid arbitrary 2 Com is ... language analysis. Our further work will emphasize both the refinement of the clustering methods and the linguistic interpre- tation of the resulting classes. References Anna Babarczy, B ´ alint G ´ abor, ... Proceed- ings of the 3th Hungarian Conference of Computa- tional Linguistics (MSZNY05), pages 20-28, Szeged, Hungary. Michael R. Brent. 1993. From grammar to lexicon: un- supervised learning of lexical...
  • 6
  • 486
  • 0
Báo cáo khoa học: "SenseClusters: Unsupervised Clustering and Labeling of Similar Contexts" potx

Báo cáo khoa học: "SenseClusters: Unsupervised Clustering and Labeling of Similar Contexts" potx

Ngày tải lên : 08/03/2014, 04:22
... portion of the English Giga- Word corpus as the source of contexts. While there are many ambiguous names in this data, it is difficult to evaluate the results of our approach given the ab- sence of ... the percentage of the majority class (MAJ.) and count (N) of the total number of contexts for the names or newsgroups. The majority percentage provides a simple baseline for level of performance, ... optimal number of clusters, to avoid setting this value man- ually. In general all of our results significantly improve upon the majority classifier, which suggests that the clustering of contexts...
  • 4
  • 322
  • 0
Báo cáo khoa học: "TOWARDS THE AUTOMATIC IDENTIFICATION OF ADJECTIVAL SCALES: CLUSTERING ADJECTIVES ACCORDING TO MEANING" ppt

Báo cáo khoa học: "TOWARDS THE AUTOMATIC IDENTIFICATION OF ADJECTIVAL SCALES: CLUSTERING ADJECTIVES ACCORDING TO MEANING" ppt

Ngày tải lên : 08/03/2014, 07:20
... employs two similarity modules, each of which processes a part of the output of stage one and produces a measure of similarity for each possible pair of adjectives. The first module processes ... assessment of the significance of an improvement over the base line of the random algo- rithm much harder. As a consequence of point (3) made above, we need to understand the significance of the ... in the intensity of temperature of the modified noun (at least when used in their non- metaphorical senses; metaphorical usage of scalar words normally also follows the order of the scale by...
  • 11
  • 379
  • 0
Báo cáo khoa học: "A STRUCTURED REPRESENTATION OF WORD-SENSESIR OR SEMANTIC ANALYSIS" pdf

Báo cáo khoa học: "A STRUCTURED REPRESENTATION OF WORD-SENSESIR OR SEMANTIC ANALYSIS" pdf

Ngày tải lên : 09/03/2014, 01:20
... " ;of& quot; or by the ending "'s", indicates a social relation (SOC_I,~F,|,) as in "the doctor of John" or in "the father of my friend", part -of (PART -OF) ... with. The purpose of lhis section is to give a brief overview of the text understanding system and its current status of implementatim~. Figure 1 shows the three modules of the text analyzer. ... study of examples found in the analyzed domain. The final set is a trade-off between two competing requirements: 2. A large number of conceptual relations improves the expressiveness of the...
  • 9
  • 358
  • 0
The Aesthetics of Mixing the Senses potx

The Aesthetics of Mixing the Senses potx

Ngày tải lên : 16/03/2014, 18:20
... understanding of the nature of aesthetic experience and in the process elided the potential contribution of the nonvisual senses to the appreciation of pictorial art as of the nonauditory senses to ... activation potential of the senses of sight and smell, and by so doing enriches the field of vision without detracting from the pleasures of olfaction. The modernist monomodal definition of aesthetics ... appreciation of the formal relations intrinsic to a work of art, irrespective of that work’s content. Thus, in one characterization of the proper object of aesthetics, Robert Redfield offered the...
  • 7
  • 275
  • 0
Báo cáo khoa học: "Unsupervised Part-of-Speech Tagging Employing Efficient Graph Clustering" ppt

Báo cáo khoa học: "Unsupervised Part-of-Speech Tagging Employing Efficient Graph Clustering" ppt

Ngày tải lên : 17/03/2014, 04:20
... All of them employ a syntactic version of Harris’ distributional hypothesis: Words of similar parts of speech can be observed in the same syntactic contexts. Contexts in that sense are often ... the immediate neighbourhood of a word. The word’s global context is the sum of all its contexts. For clustering, a similarity measure has to be defined and a clustering algorithm has to be ... the clusters of both partitionings as nodes; weights of edges are the number of common elements, if at least two elements are shared. And again, CW is used to cluster this graph of clusters....
  • 6
  • 352
  • 0

Xem thêm