0

teaching a weaker classifier

Báo cáo khoa học:

Báo cáo khoa học: " Teaching a Weaker Classifier: Named Entity Recognition on Upper Case Text" docx

Báo cáo khoa học

... easily applicable.This way of teaching a weaker classifier can alsobe used in other domains, where the task is to in-fer, and an abundance of unlabeled datais available. If one possesses a ... sets areconditionally independent of each other. Each set offeatures can be used to build a classifier, resulting intwo independent classifiers, A and B. Classificationsby A on unlabeled data can ... other tasks such as part-of-speech tag-ging, where case information is helpful. With theabundance of unlabeled text available, such an ap-proach requires no additional annotation effort, andhence...
  • 8
  • 285
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Scalable Probabilistic Classifier for Language Modeling" pdf

Báo cáo khoa học

... Ducharme, P. Vincent, and C. Jauvin. 2003. A Neural Probabilistic Language Model. Journal ofMachine Learning Research, 3:1137–1155. A. Berger, V. Della Pietra, and S. Della Pietra. 1996. A Maximum ... CategorizationResearch. Journal of Machine Learning Research,5:361–397. A. Mnih and G. Hinton. 2008. A Scalable HierarchicalDistributed Language Model. In Advances in NeuralInformation Processing ... Discrimi-native n-gram Language Modeling. Computer, Speechand Language, 21:373–392.R. Rosenfeld. 1994. Adaptive Statistical Language Mod-elling: A Maximum Entropy Approach. Ph.D. thesis,Carnegie...
  • 6
  • 350
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Chain-starting Classifier of Definite NPs in Spanish" docx

Báo cáo khoa học

... a cate-gory that, although it did originally have a seman-tic meaning of “identifiability”, has increased itsrange of contexts so that it is often a grammati-cal rather than a semantic category ... AnCora– Annotated Corpora for Spanish and Catalan(Taule et al., 2008), developed at the Universityof Barcelona and freely available from http://clic.ub.edu/ancora. AnCora-Es is a half-million-word ... mentions.Given that chain starting is the majority classand following (Ng and Cardie, 2002), we took the“one class” classification as a naive baseline: allinstances were classified as chain starting,...
  • 8
  • 322
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Taxonomy, Dataset, and Classifier for Automatic Noun Compound Interpretation" potx

Báo cáo khoa học

... it by farthe largest hand-annotated compound noun datasetin existence that we are aware of. Proper nounswere not included.The next largest available datasets have a vari-ety of drawbacks for ... interpreta-tion in general text. Kim and Baldwin’s (2005)dataset is the second largest available dataset, butinter-annotator agreement was only 52.3%, andthe annotations had an usually lopsided ... Linguistics.Berger, A. , S. A. Della Pietra, and V. J. Della Pietra.1996. A Maximum Entropy Approach to NaturalLanguage Processing. Computational Linguistics22:39-71.Brants, T. and A. Franz. 2006....
  • 10
  • 475
  • 0
UNIVERSITY TEACHER’S CONCEPTUALIZATION OF TASK-BASED TEACHING: A CASE study IN taybac university

UNIVERSITY TEACHER’S CONCEPTUALIZATION OF TASK-BASED TEACHING: A CASE study IN taybac university

Thạc sĩ - Cao học

... Communicative Approach and the Natural Approach are based on this view. The interactional view sees language primarily as a means for establishing and maintaining interpersonal relations and for ... are available only in target language, and the necessary materials can only be obtained if they ask in target language, such activities stimulate a natural need to understand and use it. Many ... towards, task-based language teaching? 2. To what extent do their conceptualizations match the composite view of task-based language teaching? 3. How do they implement task-based language teaching...
  • 105
  • 568
  • 1
Tài liệu Báo cáo khoa học: Structural insights into the substrate specificity and activity of ervatamins, the papain-like cysteine proteases from a tropical plant, Ervatamia coronaria ppt

Tài liệu Báo cáo khoa học: Structural insights into the substrate specificity and activity of ervatamins, the papain-like cysteine proteases from a tropical plant, Ervatamia coronaria ppt

Báo cáo khoa học

... coronariaRaka Ghosh, Sibani Chakraborty, Chandana Chakrabarti, Jiban Kanti Dattagupta andSampa BiswasCrystallography and Molecular Biology Division, Saha Institute of Nuclear Physics, Kolkata, ... acidat a particular position for this family of plant cysteine pro-teases. The primers used were 5¢-TTGCCTGAGCA TGTTGATTGGAGAGCGA AAG-3 ¢ (forward) and 5¢-GGGATAATAAGGTAATCTAGTGATTCCAC-3¢ ... S, Sundd M, Jagan-nadham MV & Dattagupta JK (1999) Crystallizationand preliminary X-ray analysis of ervatamin B and C,two thiol proteases from Ervatamia coronaria. ActaCrystallogr D 55,...
  • 14
  • 634
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Creating Robust Supervised Classifiers via Web-Scale N-gram Data" pdf

Báo cáo khoa học

... Workshopon Natural Language Generation.Natalia N. Modjeska, Katja Markert, and Malvina Nis-sim. 2003. Using the Web in machine learning forother-anaphora resolution. In EMNLP.Preslav Nakov and Marti ... bedrag-gled 56-year-old [professor]. Also, in a particu-lar domain, words may have a non-standard usage.Systems trained on labeled data can learn the do-main usage and leverage other regularities, ... Linking Biological Lit-erature, Ontologies and Databases.Mirella Lapata and Frank Keller. 2005. Web-basedmodels for natural language processing. ACMTransactions on Speech and Language Processing,2(1):1–31.Mark...
  • 10
  • 359
  • 0
Báo cáo khoa học:

Báo cáo khoa học: " A Tool for Error Analysis of Machine Translation Output" doc

Báo cáo khoa học

... machine translationevaluation. Machine Translation, 17(1):43–75.Masaki Murata, Kiyotaka Uchimoto, Qing Ma, ToshiyukiKanamaru, and Hitoshi Isahara. 2005. Analysis ofmachine translation systems’ ... graphical toolfor performing human error analysis, from any MTsystem and for any language pair. BLAST has a graphical user interface, and is designed to be easy1The BiLingual Annotation/Annotator/Analysis ... annotations.BLAST can handle two types of annotations: er-ror annotations and support annotations. Error an-notations are based on a hierarchical error typology,and are used to annotate errors...
  • 6
  • 479
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Towards a Semantic Classification of Spanish Verbs Based on Subcategorisation Information" doc

Báo cáo khoa học

... in Data - An Introduction toCluster Analysis. Probability and MathematicalStatistics. Jonh Wiley and Sons, Inc., New York.Anna Korhonen. 200 2a. Semantically motivatedsubcategorization acquisition. ... noisy fea-tures. In Proceedings of the Seventh Conferenceon Natural Language Learning (CoNLL-2003),page , Edmonton/Canada.Gloria V´azquez, Ana Fern´andez, Irene Castell´on,and M. Antonia Mart´ı. ... prepositions are also taken into accountas part of the subcategorisation frame types.Adapting a methodology that has been thoughtfor English presents a few problems, because En-glish is a language...
  • 6
  • 418
  • 0
SCOP: A Structural Classification of Proteins Database for the Investigation of Sequences and Structures ppt

SCOP: A Structural Classification of Proteins Database for the Investigation of Sequences and Structures ppt

Cơ sở dữ liệu

... spectroscopymeans that there is now a large and rapidly growingcorpus of information available. At present (January,1995) the Brookhaven Protein Databank (PDB,(Abola et al., 1987)) contains 3091 ... treated as a whole. The domains in large proteins are usuallyclassified individually.The classification is on hierarchical levels thatembody the evolutionary and structural relation-ships.FAMILY. ... to use local copies of PDB files if they are available. Equivalent WWW browsers,image-display programs and molecular viewers are also available free for Windows-PC and Macintosh platforms.JMB—MS...
  • 5
  • 546
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Discriminative Classifiers for Deterministic Dependency Parsing" docx

Báo cáo khoa học

... varying com-plexity, with a separate optimization of learningalgorithm parameters for each combination of lan-guage and feature model. The central importanceof feature selection and parameter ... space of possible parses(Taskar et al., 2004; McDonald et al., 2005). A radically different approach is to performdisambiguation deterministically, using a greedyparsing algorithm that approximates ... Chapter of the As-sociation for Computational Linguistics (NAACL),pages 132–139.Yuchang Cheng, Masayuki Asahara, and Yuji Mat-sumoto. 200 5a. Chinese deterministic dependencyanalyzer: Examining...
  • 8
  • 238
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Practical Classification of Multiword Expressions" pdf

Báo cáo khoa học

... simul-taneously with syntactic analysis.3 RationaleThe above classification was formulated during anexamination of the available formalisms for encod-ing multiword expressions, which was a part ... use a powerful formalism (cf. the example in (9)).Our analysis revealed that IDAREX, which is a simple formalism based on regular grammars, isnot appropriate for handling expressions that have ... a subclass that allows passivization, another one thatallows nominalization and subject-verb inversion,etc.The problem with this approach is that it leadsto a proliferation of classes. At least in...
  • 6
  • 431
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Bootstrapped Training of Event Extraction Classifiers" ppt

Báo cáo khoa học

... story(i.e., an article that primarily discusses the detailsof a domain-relevant event). Documents that areclassified as event narratives warrant additionalscrutiny because they most likely contain a ... previous example,tsunami will not be extracted as a weapon becauseit has an incompatible semantic class (EVENT),but bomb will be extracted because it has a com-patible semantic class (WEAPON).We ... lookup). Each pattern is thenmatched against the unannotated texts, and if theextracted noun phrase satisfies its semantic con-straints, then the noun phrase is automatically la-beled as a role...
  • 10
  • 283
  • 0
Bí mật của một trí nhớ   siêu phàm   Secrets of a Super Memory Eran Katz

Bí mật của một trí nhớ siêu phàm Secrets of a Super Memory Eran Katz

Kỹ năng tư duy

... chúng ta đặt ch a kh a xuống trong khi đang nghĩ đến chuyện khác. Chúng ta đang v a tự hỏi xem phải mang theo thứ gì v a triền miên suy nghĩ về điều đang băn khoăn.Chúng ta đã để ch a kh a trên ... thoại reo vang. Michelle đi nghe điện thoại. Cô bắt đầu độc thoại về anh trai c a mình một cách say s a. Anh ấy v a đi công tác về và đi quên mua cho cô ấy chiếc máy fax như đã h a. Không để ... nhóm có cùng đặc điểm như sau:Sản phẩm làm từ s a: s a, s a chua, bơ (ba sản phẩm). Rau quả: cà chua, ớt, cà rốt (ba sản phẩm).Thịt: bánh hamburger, lườn gà (hai sản phẩm).Một đồng nghiệp...
  • 180
  • 2,080
  • 4

Xem thêm