0

meaningful clustering of senses

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Selecting the “Right” Number of Senses Based on Clustering Criterion Functions" pdf

Báo cáo khoa học

... distribution of P K2’s predictions were most like those of theactual senses. 4 ConclusionsThis paper shows how to use clustering criterionfunctions as a means of automatically selecting thenumber of senses ... (PK1-3).112Selecting the “Right” Number of Senses Based on Clustering Criterion FunctionsTed Pedersen and Anagha KulkarniDepartment of Computer ScienceUniversity of Minnesota, DuluthDuluth, MN 55812 ... determining the number of senses in which an ambiguous word isused in a large corpus. It is based on theuse of global criterion functions that assessthe quality of a clustering solution.1...
  • 4
  • 361
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Inducing Gazetteers for Named Entity Recognition by Large-scale Clustering of Dependency Relations" ppt

Báo cáo khoa học

... semantics of MNswell, the MN clusters constructed by usingdependency relations should serve as a goodgazetteer. However, the high level of computa-tional cost has prevented the use of clustering for ... for storingonly a part of classes Cl, i.e., 1/|P | of the parame-ter matrix, where P is the number of cluster nodes.This data splitting enables linear scalability of mem-ory sizes. However, ... and, in terms of execution speed, may4Acknowledgements: This corpus was provided by Dr.Daisuke Kawahara of NICT.5To be precise, we need two copies of these.6Each node has a copy of the training...
  • 9
  • 428
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Báo cáo khoa học

... number of clusters with each clustering method shown chapter 3 by the part of speech. WC and WF are the average number of senses by the part of speech. In Table 1 and 2, the most clustering ... z Fuzzy clustering (F1, F2)2 (Song, Cao and Bruza, 2003) Used clustering methods cover both the popularity and the variety of the algorithms – soft and hard clustering and graph clustering ... gives the word senses numbered i of the word x. Ix is the word sense indexing function of x that gives an index to each sense of the word x. All contextual words xi±j of a central word...
  • 4
  • 425
  • 0
Báo cáo khoa học: A hybrid clustering of protein binding sites ppt

Báo cáo khoa học: A hybrid clustering of protein binding sites ppt

Báo cáo khoa học

... 5. Number of binding sites contained in clusters as a function of the number of clusters allowed to be used (gp = 1 ⁄ 10). Thecolor coding is given in Table 2.A hybrid clustering of protein ... ‘silhouettevalue’ of a cluster is the smallest possible distancebetween an element of this cluster and an element of the neighboring clusters. The silhouette coefficient of the overall clustering is ... the distance function and clustering algorithm, three main parameters affected theproperties of clustering: OPTICS MINPTS, OPTICScut-off level and gap penalty (gp) of the distance func-tion....
  • 9
  • 229
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Automatic Retrieval and Clustering of Similar Words" potx

Báo cáo khoa học

... Ilcell, subj -of, adapt[l=l Ilcell, subj -of, behavell=l [Icell, pobj -of, in11=159 [[cell, pobj -of, insidell=16 Ilcell, pobj -of, intoll=30 Ilcell, nmod -of, abnormalityll=3 Ilcell, nmod -of, anemiall=8 ... Ilcell, nmod -of, architecturell=l [[cell, obj -of, attackl[=6 [[cell, obj -of, bludgeon[[=l [Icell, obj -of, callll=l 1 Hcell, obj -of, come froml[=3 Ilcell, obj -of, containll 4 Ilcell, obj -of, decoratell=2 ... IR(wx)l+lR(w2)l where S(w) is the set of senses of w in the WordNet, super(c) is the set of (possibly indirect) superclasses of concept c in the WordNet, R(w) is the set of words that belong to a same...
  • 7
  • 322
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "DISTRIBUTIONAL CLUSTERING OF ENGLISH WORDS" pptx

Báo cáo khoa học

... simple tabulation of fre- quencies of certain words participating in certain configurations, for example of frequencies of pairs of a transitive main verb and the head noun of its direct object, ... some indication of what aspects of distributional relationships may be discovered by clustering. However, we also need to evaluate clustering more rigorously as a basis for models of distributional ... looked at two kinds of measurements of model quality: (i) relative en- tropy between held-out data and the asymmetric model, and (ii) performance on the task of decid- ing which of two verbs is...
  • 8
  • 310
  • 0
Tài liệu Technical Overview of Clustering in Windows Server 2003 pdf

Tài liệu Technical Overview of Clustering in Windows Server 2003 pdf

Tin học văn phòng

... the part of Microsoft, and Microsoft cannot guarantee the accuracy of any information presented after the date of publication.This document is for informational purposes only. MICROSOFT MAKES ... trademarks of their respective owners.Technical Overview of Clustering in Windows Server 2003 ii Microsoft® Windows® Server 2003 Technical Article• Dynamic Disk Size – If you increase the size of ... in Clustering for Windows Server 2003 9Technical Overview of Clustering in Windows Server 2003Microsoft CorporationPublished: January 2003AbstractThis white paper summarizes the new clustering...
  • 23
  • 536
  • 1
Tài liệu FUZZY CLUSTERING ALGORITHMS ON LANDSAT IMAGES FOR DETECTION OF WASTE AREAS: A COMPARISON pdf

Tài liệu FUZZY CLUSTERING ALGORITHMS ON LANDSAT IMAGES FOR DETECTION OF WASTE AREAS: A COMPARISON pdf

Tin học văn phòng

... the combination of bands 5, 4 and 1 which is of great efficacy for the aims of our analysis. In Figures 1 and 2 the set of bands 5, 4 and 1 are depicted respectivelyfor the month of May 1994 and ... algorithms for the detection of waste areas using LANDSAT TMimages.It is worth of noting that the 30 meters spatial resolution of the Landsat-TM sensormakes the process of detecting waste areas ... values of all samples in all clusters;• m∈ (1, ∞) is a control parameter of fuzziness.The minimization of Jm, under the probabilistic constraintcj=1ujk= 1, leads to theiteration of...
  • 11
  • 371
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Discriminating image senses by clustering with multimodal features" potx

Báo cáo khoa học

... set of outlier images, de-fined as images whose ratio of distances to the second closestand closest clusters was below a threshold.Word All senses Meta senses Core senses BASS 6 senses 4 senses ... lists the number of senses, median, and range of globalcluster purity, followed by the baseline. All senses used thefull set of sense labels and 40 clusters. Meta senses mergedcore senses with ... senses 2 senses Median 0.60 0.73 0.94Range 0.03 0.02 0.02Baseline 0.35 0.45 0.55CRANE 9 senses 6 senses 4 senses Median 0.49 0.65 0.86Range 0.05 0.07 0.07Baseline 0.27 0.37 0.50SQUASH 6 senses...
  • 8
  • 317
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Clustering Hungarian Verbs on the Basis of Complementation Patterns" pot

Báo cáo khoa học

... an other semantic role – cause –in the case of verbs of existence (Table 2).It is important to note that we do not dispose of apreliminary list of semantic roles. To avoid arbitrary2Com is ... language analysis. Ourfurther work will emphasize both the refinement of the clustering methods and the linguistic interpre-tation of the resulting classes.ReferencesAnna Babarczy, B´alint G´abor, ... Proceed-ings of the 3th Hungarian Conference of Computa-tional Linguistics (MSZNY05), pages 20-28, Szeged,Hungary.Michael R. Brent. 1993. From grammar to lexicon: un-supervised learning of lexical...
  • 6
  • 486
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "SenseClusters: Unsupervised Clustering and Labeling of Similar Contexts" potx

Báo cáo khoa học

... portion of the English Giga-Word corpus as the source of contexts. While thereare many ambiguous names in this data, it is difficultto evaluate the results of our approach given the ab-sence of ... thepercentage of the majority class (MAJ.) and count(N) of the total number of contexts for the namesor newsgroups. The majority percentage provides asimple baseline for level of performance, ... optimalnumber of clusters, to avoid setting this value man-ually.In general all of our results significantly improveupon the majority classifier, which suggests that the clustering of contexts...
  • 4
  • 322
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "TOWARDS THE AUTOMATIC IDENTIFICATION OF ADJECTIVAL SCALES: CLUSTERING ADJECTIVES ACCORDING TO MEANING" ppt

Báo cáo khoa học

... employs two similarity modules, each of which processes a part of the output of stage one and produces a measure of similarity for each possible pair of adjectives. The first module processes ... assessment of the significance of an improvement over the base line of the random algo- rithm much harder. As a consequence of point (3) made above, we need to understand the significance of the ... in the intensity of temperature of the modified noun (at least when used in their non- metaphorical senses; metaphorical usage of scalar words normally also follows the order of the scale by...
  • 11
  • 379
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A STRUCTURED REPRESENTATION OF WORD-SENSESIR OR SEMANTIC ANALYSIS" pdf

Báo cáo khoa học

... " ;of& quot; or by the ending "'s", indicates a social relation (SOC_I,~F,|,) as in "the doctor of John" or in "the father of my friend", part -of (PART -OF) ... with. The purpose of lhis section is to give a brief overview of the text understanding system and its current status of implementatim~. Figure 1 shows the three modules of the text analyzer. ... study of examples found in the analyzed domain. The final set is a trade-off between two competing requirements: 2. A large number of conceptual relations improves the expressiveness of the...
  • 9
  • 358
  • 0
The Aesthetics of Mixing the Senses potx

The Aesthetics of Mixing the Senses potx

Thời trang - Làm đẹp

... understanding of the nature of aesthetic experience and in the process elided the potential contribution of the nonvisual senses to the appreciation of pictorial art as of the nonauditory senses to ... activation potential of the senses of sight and smell, and by so doing enriches the field of vision without detracting from the pleasures of olfaction. The modernist monomodal definition of aesthetics ... appreciation of the formal relations intrinsic to a work of art, irrespective of that work’s content. Thus, in one characterization of the proper object of aesthetics, Robert Redfield offered the...
  • 7
  • 275
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Unsupervised Part-of-Speech Tagging Employing Efficient Graph Clustering" ppt

Báo cáo khoa học

... All of them employ a syntactic version of Harris’ distributional hypothesis: Words of similar parts of speech can be observed in the same syntactic contexts. Contexts in that sense are often ... the immediate neighbourhood of a word. The word’s global context is the sum of all its contexts. For clustering, a similarity measure has to be defined and a clustering algorithm has to be ... the clusters of both partitionings as nodes; weights of edges are the number of common elements, if at least two elements are shared. And again, CW is used to cluster this graph of clusters....
  • 6
  • 352
  • 0

Xem thêm