... portion of the English Giga-Word corpus as the source of contexts. While thereare many ambiguousnames in this data, it is difficultto evaluate the results of our approach given the ab-sence of ... a disambiguated version of the text. Thus,we automatically create ambiguousnames by con-flating the occurrences associated with two or threerelatively unambiguous names into a single obfus-cated ... thepercentage of the majority class (MAJ.) and count(N) of the total number of contexts for the names or newsgroups. The majority percentage provides asimple baseline for level of performance,...
... t)SNPPRPVPVBDNPSBARINSNNPVPVBDNPPPPRP INNPNNGoal SourceTheme TargetNPHe heardthe sound of liquid slurping in a metal containeras approached him from behindFarrell
... the C-C corpus, out of the total of 4,507 characters, only 776 of them are for surnames. It is interesting to find that female given names are represented by a smaller set of characters than ... given names (see Table 5), hard decision of gender had led to deterioration in MRR performance of the male names compared to the case where no prior information was assumed. Soft decision of ... the original pattern ofnames in Eastern Asia such as China, Korea and Vietnam, in which a limited number of characters4 are used for surnames while those for given names are less restrictive....
... first names, which ap-pear before middle names, which in turn appearbefore surnames, etc. Similarly, many company names end in fixed phrases such as Inc. Herewe think of first names as a kind of ... grammar based onthe insights of the PCFG encoding of LDA topicmodels that learns some of the structure of proper names. The key idea is that elements in proper names typically appear in a fixed ... expansion to provideanalyses for proper names that don’t fit either of the first two expansions).We extracted all of the proper names (i.e.,phrases of category NNP and NNPS) in the PennWSJ...
... examined the expression status of several of theidentified proteins using western blotting. These repre-sentative proteins were selected based on changes of more than twofold in in their expression ... extraction with 200 lLof25mmtriethylammonium bicarbonate, 200 lL of 0.1% (v ⁄ v)trifluoroacetic acid in water, 200 lL of 0.1% (v ⁄ v) trifluo-roacetic acid in acetonitrile and 200 lL of 100% acetoni-trile. ... functions of the identified proteins canprovide clues about their roles in the pathogenesis of CRC. In general, factors that contribute to the patho-genesis of CRC include the accumulation of mutationsand...
... high labeling accuracy when em-ploying each of these parsing alternatives? Each of these four questions is addressed in the four subsequent sections of this paper, fol-lowed by a discussion of ... difficult research problems. Just as the Penn Treebank offers the possibility of developing systems capable of accurate syntactic parsing, corpora of semantic role annotations open up new possibilities ... dependency parser (Lin, 1998)? 2. Given that each of these alternatives creates a different formulation of the parse tree of a sen-tence, which of them encodes branches that are easiest to align...
... pair consisting of the left and right neighbor of a particular token is characteristic of the part of speech at this position, and by clustering the neighbor pairs on the basis of their middle ... context of cognition and child lan-guage has been proposed by Mintz (2003), is that words of a particular part of speech often have the same left and right neighbors, i.e. a pair of such neighbors ... instead of global co-occurrence vec-tors. As can be seen from human performance, in almost all cases the local context of a syntactically ambiguous word is sufficient to disambiguate its part of...
... Systematic names are those expressed in terms of the official nomenclature, whereas trivial terms are usual designations for them. Semi-systematic names are a combination of trivial or class names ... Correct names: names that represent theoretically possible chemical compounds written according to the IUPAC Official Nomenclature Rules (IUPAC-ONR); • Inadequate names: names that, in despite of ... or not the name respects the current official nomenclature. This capacity of treating even names which, in spite of do not respect the constraints of the official nomenclatures, correspond to...
... underspecifying names do not need to be re-solved but denote a set of compounds, analogousto class names. The particularities of chemical compound names mentioned above, namely synonymy, class names, ... underspecifying names and interaction be-tween morpheme’s meanings, complicate auto-matic classification and mapping of the names. To achieve mapping of synonymous chemicalcompound names, name ... ProcessingUniversity of StuttgartAzenbergstr. 1270174 Stuttgart, Germanyengelken@eml-research.deAbstractMapping and classification of chemicalcompound names are important aspects of the tasks of BioNLP....
... po-tential applications of role labeling may require cor-rect labelingof all (or at least the core) argumentsin a sentence in order to be effective, and partiallycorrect labelings may not be ... 3.1.The labelingof the tree in Figure 1 is a specificexample of the kind of errors fixed by the joint mod-els. The local classifier labeled the first argument inthe tree as ARG0 instead of ARG1, ... labels of the semantic argu-ment nodes of a verb. A drawback of local modelsis that, when they decide the label of a parse treenode, they cannot use information about the labelsand features of...
... number of features to rep-resent various aspects of the syntactic structure of a pair of arguments. All features are listed in Table1. The Path features are designed as a sequentialcollection of ... aj, such as labeling ai ajas ””, identification of thematic rankcan be formulated as a classification problem. De-lemma, POS Tag, voice, and SCF of predicatecategories, position of two arguments; ... referenced argument of ai, 4) aiRAaj: aiisthe referenced argument of aj, 5) aiACaj: ajisthe continuation argument of ai, 6) aiCAaj: aiisthe continuation argument of aj, 7)...