... words previously seen in the training corpus,and therefore their overall coverage is not 100%.Starting with an annotated corpus consisting of all annotated files in SemCor, a separate training ... Min-imally supervised word sense disambiguation for all wordsin open text. In Proceedings of ACL/SIGLEXSenseval-3, Barcelona, Spain, July.R. Mihalcea and D. Moldovan. 2002. Pattern learningand ... advantage of providinglarger coverage. In this paper, we present a method for solving thesemantic ambiguity of all content wordsin a text. Thealgorithm can be thought of as a minimally supervisedword...
... which includes all the coordinated noun phrases (in this case John and Bill). We will detect the coordination of noun phrases from the SS returned by the SUG fact coordinated. In one- 4 In ... principles of reasoning with uncertainty: e.g. Connoly (1994) and Mitkov (1997). Our system can be included into the first approach. In these integrated approaches the semantic and domain ... worked on different texts (Spanish texts). apply a partial parsing and we deal with other kinds of anaphors. As a future aim we will include semantic information in our algorithm in order to check...
... Vector Space Model (VSM) intext information processing, document indexing (term extraction) acts as a pre-requisite step in most text information proc-essing tasks such as Information Retrieval ... Intelligent Text Processing (CI-CLing 2003), 602-614. Yuan Liu, Nanyuan Liang. 1986. Basic Engineering for Chinese Processing – Contemporary Chinese Words Frequency Count, Journal of Chinese In- formation ... Tsinghua University, Beijing 100084, China lijingyang@gmail.com sms@tsinghua.edu.cn kevinn9@gmail.com Abstract Words and character-bigrams are both used as features in Chinese text...
... the nominal sense of the words is not. For instance, “Lễ xem mặt” is translated into “Looking at the face” as in: “The first would be an introduction ceremony called Lễ xem mặt (Looking at ... environment. Allin all, languages tend to have a superordinate but lack hyponyms since each language makes only those distinctions in meaning which seem relevant to its particular environment. In English, ... involving introducing Vietnamese culture to the world, which will bring about the indispensable task of translating Vietnamese authentic words of culture into English. His greatest of all work,...
... but in a new context. There are many difficulties made by unfamiliar words in reading comprehension such as: slowing reading speed, interrupting reading process and lack of interest in reading. ... participants in a particular communicative situation”. (Roy Harris in Rethinking Writing, 2000)“Reading is asking questions of printed text. And reading with comprehension becomes a matter of getting ... Considering the sentences and surrounding ones, students can know the meaning of the same words presented. In any cases, a word’s context is important in terms of understanding its meaning and...
... Y)326NN = 2048N = 128N = 64N = 2048331Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, pages 324–332,Suntec, Singapore, 2-7 August 2009.c2009 ACL and AFNLP1...
... available in a straightfor-ward manner the large amount of structured text in Wikipedia (e.g. for building a language model), aswell as its rich internal link structure (e.g. the linksbetween ... Evaluating WordNet-basedmeasures of semantic distance. Computational Linguistics,32(1).Finkelstein, L., E. Gabrilovich, Y. Matias, E. Rivlin, Z. Solan,G. Wolfman & E. Ruppin (2002). Placing ... (1995). Using information content to evaluate seman-tic similarity in a taxonomy. In Proc. of IJCAI-95, Vol. 1, pp.448–453.Seco, N., T. Veale & J. Hayes (2004). An intrinsic informationcontent...
... 2000.Mapping wordnets using structural information. In Proceedings of the 38th Annual Meeting of theAssociation for Computational Linguistics, HongKong.Finding Predominant Word Senses in Untagged Text Diana ... BNC text represents imaginative writing, the remaining80% being classified as informative.sense according to SemCor. This seems intuitivegiven our expected relative usage of these senses in modern ... baseline for choosing the predominantsense over all these words ( )is 32%. Both WordNet similarity measures beatthis baseline. The random baseline for( ) is 24%. Again, theautomatic ranking...
... Cambridge, MA. 450 Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line Line 1234567890123456789012345678901234567890123456789012345678901234567890 ... Computational Lin- guistics, pages 19-24. Shona Douglas, Matthew Hurst, and David Quinn. 1995. Using natural language processing for iden- tifying and interpreting tables in plain text. In Fourth ... blank lines or delimiters that immediately precede or follow a table within an input text. In this paper, we assume that our input texts are plain texts that do not contain any formatting codes,...
... occurring in their corpus of compounds.For each category pair, they manually examined20% of the compounds falling under that categorypair, paraphrasing the relation between the nouns in that ... Computational LinguisticsUsing WordNet to Automatically Deduce Relations between Words in Noun-Noun CompoundsFintan J. Costello,School of Computer Science,University College Dublin,Dublin 6, Ireland.fintan.costello@ucd.ieTony ... disam-biguation algorithm, one aim in constructing thiscorpus was to examine the relations that exist in naturally occurring noun-noun compounds. Thisfollows from existing research on the relations...
... manually annotated markables in four di-alogs from the core data set, while testing was per-formed on the automatically detected chunks in theremaining fifth dialog. For training and testing, ... anaphoricchains in the core data set, broken down accord-ing to the type of the chain-initial antecedent. Therare type OTHER mainly contains adjectival an-tecedents. More than 75% of all chains consist ... features both during training and testing.We first ran a simple baseline system which re-solved pronouns to their most recent compatible an-tecedent, applying the same settings and constraints9http://www.cs.waikato.ac.nz/ml/weka/10The...