a wide coverage lexicon

Báo cáo khoa học: "Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities" ppt

Báo cáo khoa học: "Enhancing Unlexicalized Parsing Performance using a Wide Coverage Lexicon, Fuzzy Tag-set Mapping, and EM-HMM-based Lexical Probabilities" ppt

Ngày tải lên : 17/03/2014, 22:20
... small annotated corpora, such as the He- brew treebank (Sima’an et al., 2001). Hebrew has a wide coverage lexicon / morphological-analyzer (henceforth, KC Ana- lyzer) available 2 , but its tagset ... exist a small tree- bank and a wide- coverage lexical resource. For example parsing Arabic using the Arabic Tree- bank and the Buckwalter analyzer, or parsing En- glish biomedical text using a biomedical ... quantities of training data than treebanks can provide, and lexicon- based unsupervised approaches to POS tagging are practically unlimited in the amount of training data they can use. POS taggers...
  • 9
  • 330
  • 0
Báo cáo khoa học: "Creating a CCGbank and a wide-coverage CCG lexicon for German" pdf

Báo cáo khoa học: "Creating a CCGbank and a wide-coverage CCG lexicon for German" pdf

Ngày tải lên : 08/03/2014, 02:21
... on Language Resources and Evaluation (LREC), pages 1974–1981, Las Palmas, Spain, May. Julia Hockenmaier and Mark Steedman. 2002b. Generative models for statistical parsing with Combinatory Categorial Grammar. ... graphs to CCG derivations are extraposed relative clauses, which CCG treats as sentential modifiers with an anaphoric depen- dency. Arguments that are moved u p are marked as extracted, and an additional ... 1994. Formal and Computational Aspects of Natural Language Syntax. Ph.D. thesis, University of Pennsylvania, Philadelphia PA. Libin Shen and Aravind K. Joshi. 2005. Incremental LTAG parsing. In...
  • 8
  • 305
  • 0
Báo cáo khoa học: "It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text" docx

Báo cáo khoa học: "It Makes Sense: A Wide-Coverage Word Sense Disambiguation System for Free Text" docx

Ngày tải lên : 23/03/2014, 16:20
... in a given context. As a fundamental task in natural language processing (NLP), WSD can benefit applications such as machine transla- tion (Chan et al., 200 7a; Carpuat and Wu, 2007) and information ... SemEval tasks, the benchmark data sets for WSD. The evaluation on both lexical- sample and all-words tasks measures the accuracy of our IMS system as well as the quality of the training data we have ... English-Chinese parallel corpora: Hong Kong Hansards, Hong Kong News, Hong Kong Laws, Sinorama, Xinhua News, and the English translation of Chinese Treebank. They are all available from the Linguistic Data...
  • 6
  • 355
  • 0
Báo cáo khoa học: "Lexicon acquisition with a large-coverage unification-based grammar" pot

Báo cáo khoa học: "Lexicon acquisition with a large-coverage unification-based grammar" pot

Ngày tải lên : 17/03/2014, 22:20
... with a grammar of a similar coverage. We have described a way how the information concerning unknown words can be restricted in a grammatically sound way, by the definition of lexical types and ... Germany fouvry@coli.uni—sb.de Abstract We describe how unknown lexical en- tries are processed in a unification-based framework with large -coverage gram- mars and how from their usage lexi- cal entries are extracted. ... return a num- ber of alternatives each associated with a proba- bility, so that the parser can decide what will be used in the analysis. Even when the range of al- ternatives is left wide open...
  • 4
  • 251
  • 0
Tài liệu Báo cáo khoa học: "A Broad-Coverage Grammar Checker Using Pattern Grammar" doc

Tài liệu Báo cáo khoa học: "A Broad-Coverage Grammar Checker Using Pattern Grammar" doc

Ngày tải lên : 20/02/2014, 05:20
... ESL Assistant are shown for comparison, and grammatical suggestions are un- derscored. As suggested, lexical and PoS informa- tion in learner texts is useful for a grammar checker, pattern grammar ... systems trained on learner data tend to offer high precision but low recall. Broad coverage grammar checkers may be developed using readily available large-scale corpora. To detect and correct ... significant collocations determining the pair of head words in adjacent base phrases, calculating their pair-wise mutual information values, and fil- tering out candidates with low MI values. Stage...
  • 6
  • 438
  • 0
Tài liệu Báo cáo khoa học: "RELATING COMPLEXITY TO PRACTICAL PERFORMANCE IN PARSING WITH WIDE-COVERAGE UNIFICATION GRAMMARS" pptx

Tài liệu Báo cáo khoa học: "RELATING COMPLEXITY TO PRACTICAL PERFORMANCE IN PARSING WITH WIDE-COVERAGE UNIFICATION GRAMMARS" pptx

Ngày tải lên : 20/02/2014, 21:20
... ANLT grammar (Grover, Carroll & Briscoe, 1993), a wide- coverage grammar of English. The gram- mar is defined in metagrammatical formalism which is compiled into a unification-based 'ob- ... Speech and Natural Language Workshop. 200- 203. Pereira, F. & D. Warren (1980) "Definite clause grammars for language analysis a survey of the formalism and a comparison with aug- ... software and grammars that has occurred nevertheless ap- pears to have caused this to happen automatically. It therefore seems likely that implementational de- cisions and optimisations based...
  • 8
  • 348
  • 0
Applications of Calorimetry in a Wide Context - Differential Scanning Calorimetry, Isothermal Titration Calorimetry and Microcalorimetry docx

Applications of Calorimetry in a Wide Context - Differential Scanning Calorimetry, Isothermal Titration Calorimetry and Microcalorimetry docx

Ngày tải lên : 06/03/2014, 22:20
... P.V. Dhanaraj, N.P. Rajesh, Jose C. Martinez, Javier Murciano-Calles, Eva S. Cobos, Manuel Iglesias-Bexiga, Irene Luque, Javier Ruiz-Sanz, Diana Romanini, Mauricio Javier Braia, Mar a Cecilia Porfiri, ... Noemi E. Zaritzky, Pratima Parashar, Luis Alberto Alcazar-Vara, Eduardo Buenrostro-Gonzalez, W. Steinmann, S. Walter, M. Beckers, G. Seide, T. Gries, Eliane Lopes Rosado, Vanessa Chaia Kaippert, ... Stefka G. Taneva, Sonia Bañuelos, Mar a A. Urbaneja, Amal A. Elkordy, Robert T. Forbes, Brian W. Barry, Laura T. Rodriguez Furlán, Javier Lecot, Antonio Pérez Padilla, Mercedes E. Campderrós,...
  • 484
  • 3K
  • 0
Báo cáo khoa học: "A Broad-Coverage Normalization System for Social Media Language" pot

Báo cáo khoa học: "A Broad-Coverage Normalization System for Social Media Language" pot

Ngày tải lên : 07/03/2014, 18:20
... over 90% word -coverage across all data sets (a 10% absolute increase compared to state-of- the-art); the broad word -coverage can also successfully translate into message-level per- formance gain, yielding ... of candidates and the broad word -coverage can be successfully translated into message-level performance gain. In addition, our system requires no human annotations, therefore can be easily adapted ... machine transla- 1036 References Kevin Atkinson. 2006. Gnu aspell. http://aspell.net/. AiTi Aw, Min Zhang, Juan Xiao, and Jian Su. 2006. A phrase-based statistical model for sms text normaliza- tion....
  • 10
  • 845
  • 0
Báo cáo khoa học: "An Open-License Broad Coverage Lexicon" doc

Báo cáo khoa học: "An Open-License Broad Coverage Lexicon" doc

Ngày tải lên : 07/03/2014, 22:20
... Menlo Park, CA. AAAI Press. Kipper, Karin, Hoa Trang Dang, and Martha Palmer. 2000. Class-Based Construction of a Verb Lexicon. In AAAI-2000 Seventeenth National Conference on Artificial Intelligence, ... Catherine, Ralph Grishman, and Adam Meyers. 1994 Creating a Common Syntactic Dictionary of English. Presented at SNLR: International Workshop on Sharable Natural Language Resources, Nara, Japan. ... is handmade and contains 38,000 lemmas. It represents words in feature value lists that contain lexical data such as part of speech, agreement information, and syntactic frame participation...
  • 5
  • 194
  • 0
Báo cáo khoa học: "Long-Distance Dependency Resolution in Automatically Acquired Wide-Coverage PCFG-Based LFG Approximations" pptx

Báo cáo khoa học: "Long-Distance Dependency Resolution in Automatically Acquired Wide-Coverage PCFG-Based LFG Approximations" pptx

Ngày tải lên : 08/03/2014, 04:22
... 79% coverage (full parse) and 21% fragement/skimmed parses. By the same measure, full parse coverage is around 99% for our automat- ically acquired PCFG-based LFG approximations. Against the PARC ... GF, GF:CFG category pair- as well as CFG category-based subcategorisation frames and associates conditional probabilities with frames. Given a lemma l and an argument list s, the probability of ... constructed hand-crafted XTAG resources. In contrast, we ac- quire our resources from treebanks and achieve sub- stantially wider coverage. Our approach provides wide- coverage, robust, and – with the addition...
  • 8
  • 338
  • 0
Báo cáo khoa học: "Inducing a Semantically Annotated Lexicon via EM-Based Clustering" doc

Báo cáo khoa học: "Inducing a Semantically Annotated Lexicon via EM-Based Clustering" doc

Ngày tải lên : 08/03/2014, 06:20
... consisting of a class label, a selecting head, a grammatical relation, and a filler head. The class label is treated as hidden data in the EM- framework for statistical estimation. 2 EM-Based Clustering ... potential application in many areas. 5. The method is applicable to any natural language where text samples of sufficient size, computational morphology, and a ro- bust parser capable of extracting ... increase.as:s increase.aso:o fall.as:s pay.aso:o reduce.aso:o rise.as:s exceed.aso:o exceed.aso:s affect.aso:o grow.as:s include.aso:s reach.aso:s decline.as:s lose.aso:o act.aso:s...
  • 8
  • 245
  • 0
Báo cáo khoa học: "Development and Evaluation of a Broad-Coverage Probabilistic Grammar of English-Language Computer Manuals" pdf

Báo cáo khoa học: "Development and Evaluation of a Broad-Coverage Probabilistic Grammar of English-Language Computer Manuals" pdf

Ngày tải lên : 08/03/2014, 07:20
... feature-based grammar to the nonterminal labels of the treebank grammar. For example, our grammar main- tains a fairly large number of semantic classes of singular nouns, and it is natural ... the feature bundle all of whose features are variable, and with a decreasing number of variable features occuring as a branch is traced from root to leaf. To find the mnemonic .A4 (A) assigned ... grammar are shown first, and the Lan- caster categories to which they map are shown second: The first case above is straightforward: our prepositional-phrase category maps to Lancaster's....
  • 8
  • 562
  • 0
Báo cáo khoa học: "A TOOLKIT FOR LEXICON BUILDING" pdf

Báo cáo khoa học: "A TOOLKIT FOR LEXICON BUILDING" pdf

Ngày tải lên : 08/03/2014, 18:20
... lexical item "part', by the relational arcs Broca's area T part brain M part which say in effect that Broca's area is a kind of part, specifically a "brain-part." ... [aphasia may be associated with x] apraxia _CAUSE [x is a cause of aphasia] injury lesion NNABLE [aphasia is the inability to do x] speech language Figure 3. Lexical entry for "aphasia" ... arise in a very elaborate network such as that generated from a large dic- tionary, we have replaced the separate modification and taxonomy arcs with a single, ternary relational arc that keeps...
  • 9
  • 245
  • 0
Báo cáo khoa học: "To wards a Self-Extending Lexicon *" doc

Báo cáo khoa học: "To wards a Self-Extending Lexicon *" doc

Ngày tải lên : 08/03/2014, 18:20
... ?x:person take:verb ?y:person ?z:location Pattern and goal mismatch ?x:person take:verb David's physically transferring Goliath to a loca- tion fails since {1) a location is not found and (2) ... (2) the ac- tion does not match David's goals. If these two failures are encountered, then a new phrase is created. In ab- sence of a better alternative, RINA initially generates David ... containing information about the case and pertaining to: (a) its syntactic ap- pearance (b) its semantic concept and (c) its phrase role: agent, patient. Variable identifiers (e.g., ?x. ?y) are used...
  • 9
  • 302
  • 0
Mariposa: a wide-area distributed database system doc

Mariposa: a wide-area distributed database system doc

Ngày tải lên : 23/03/2014, 12:20
... the actual metadata to see if it has changed. The quality of service is then a measurement of the metadata’s rate of update, as well as the name server’s rate of update. 6 Mariposa status and ... execution plans that can adapt to the environment (such as unbalanced workload and poor data placement) in a flexible manner. We are implementing more sophisticated features and plan a general release ... price of scanning the smaller table, R2, remotely from Santa Barbara is less than that of scanning R1 remotely from Berkeley; as a result, Santa Bar- bara offers a lower bid. Similarly, San Diego...
  • 16
  • 379
  • 0