0

a reliably annotated corpus

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political debates" pdf

Báo cáo khoa học

... 564–568,Portland, Oregon, June 19-24, 2011.c2011 Association for Computational LinguisticsLiars and Saviors in a Sentiment Annotated Corpus of Comments to Political debates Paula Carvalho ... Natu-ral Language Processing and Computational Natural Language Learning, Prague. Krippendorff, Klaus. 2004. Content Analysis: An Intro-duction to Its Methodology, 2nd Edition. Sage Publi-cations, ... sentiments for a variety of topics and corresponding targets are potentially involved (Riloff and Wiebe., 2003; Sarmento et al., 2009). Alternative approaches to automatic and manual construction...
  • 5
  • 499
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "WebCAGe – A Web-Harvested Corpus Annotated with GermaNet Senses" docx

Báo cáo khoa học

... hand-crafted sense -annotated corpora have been available (Agirre et al., 2007;Erk and Strapparava, 2012; Mihalcea et al., 2004),while WSD research for languages that lack thesecorpora has lagged behind ... representative examples in Yarowsky’s ap-proach is performed completely manually and istherefore limited to the amount of data that canreasonably be annotated by hand.Leacock et al. (1998), Agirre ... the 3rd In-ternational Language Resources and Evaluation(LREC’02), Las Palmas, Canary Islands, pp. 609–612Santamar´ a, C., Gonzalo, J., Verdejo, F. 2003. Au-tomatic Association of Web Directories...
  • 10
  • 419
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Using an Annotated Corpus as a Stochastic Grammar" ppt

Báo cáo khoa học

... ~ A may be 40 M. Marcus, 1991. "Very Large Annotated Database of America~ English". DARPA Speech and Naawal Language Workshop, ~ Grove, Morgan Kaufmarm. F. Pereira and Y. Schabes, ... VB Amsterdam rens@alf.leLuva.nl Abstract In Data Oriented Parsing (DOP), an annotated corpus is used as a stochastic grammar. An input string is parsed by combining subtrees from the corpus. ... expect a higher accuracy if the corpus is further enlarged. 6 Conclusions and Future Research We have presented a language model that uses an annotated corpus as a stochastic grammar. We...
  • 8
  • 393
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Test Collection Selection and Gold Standard Generation for a Multiply-Annotated Opinion Corpus" potx

Báo cáo khoa học

... also annotated. The details of this corpus are shown in Table 1. Topics Documents SentencesQuantity 32 843 11,907 Table 1. Corpus size 3 Analysis of Annotated Corpus As mentioned, each ... strict and lenient met-rics are also applied in annotations of relevance. 4.2 High agreement To see how the generated gold standards agree with the annotations of all annotators, we analyze ... gold standard; for the lenient metric, sentences with annotations agreed by at least two annotators are selected as the testing collection and the major-ity of annotations are treated as the...
  • 4
  • 418
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Collecting a Why-question corpus for development and evaluation of an automatic QA-system" pdf

Báo cáo khoa học

... each paid reward.• Qualifications To improve the data quality, a HIT can also be attached to certain tests,“qualifications” that are either system-providedor created by the requester. An example ... the assign-ments have been completed.• Rewards At upload time, each HIT has to beassigned a fixed reward, that cannot be changedlater. Minimum reward is $0.01. Amazon.comcollects a 10% (or a ... excess of information. FAQ-pages tend to alsoanswer questions which are not asked, and also con-tain practical examples. Human-powered answersoften contain unrelated information and discourse-like...
  • 9
  • 610
  • 1
The Proposition Bank: An Annotated Corpus of Semantic Roles pdf

The Proposition Bank: An Annotated Corpus of Semantic Roles pdf

Ngân hàng - Tín dụng

... created and annotation disagreements were adju-dicated by a small team of highly trained linguists: Paul Kingsbury created the framesfiles and managed the annotators, and Olga Babko-Malaya checked ... 2004.Palmer, Martha, Olga Babko-Malaya, andHoa Trang Dang. 2004. Different sensegranularities for different applications. InSecond Workshop on Scalable NaturalLanguage Understanding Systems at ... Douglas Appelt, John Bear,David Israel, Megumi Kameyama,Mark E. Stickel, and Mabry Tyson. 1997.FASTUS: A cascaded finite-statetransducer for extracting informationfrom natural-language text....
  • 36
  • 269
  • 0
Báo cáo khoa học:

Báo cáo khoa học: " a Movie Dialogue Corpus for Research and Development" potx

Báo cáo khoa học

... Seve-ral factors, such as the availability of more power-ful computers, an almost unlimited storage ca-pacity, the availability of large volumes of data in digital format, as well as the ... dialogue management and natural language generation. Springer. Stallard D (2000) Talk’n’travel: a conversational system for air travel planning. In Proceedings of the 6th Conference on Applied ... hand, contain all additional information/texts appearing in the scripts, which are typically of narrative nature and explain what is happening in the scene. Figure 1 depicts a browser snapshot...
  • 5
  • 424
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Creating a Corpus of Parse-Annotated Questions" docx

Báo cáo khoa học

... datarepeatParse a new section of raw dataManually correct errors in the parser outputAdd the corrected data to the training setExtract a new grammar for the parseruntil All the data has been processedAlgorithm ... ofPennsylvania, Philadelphia, PA.Daniel Gildea. 2001. Corpus variation and parser perfor-mance. In Lillian Lee and Donna Harman, editors, Pro-ceedings of EMNLP, pages 167–202, Pittsburgh, PA.Charles ... can be rapidly induced from appropri-ate treebank material. However, treebank- andmachine learning-based grammatical resources re-flect the characteristics of the training data. Theygenerally...
  • 8
  • 405
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Creating a manually error-tagged and shallow-parsed learner corpus" pptx

Báo cáo khoa học

... Computational LinguisticsCreating a manually error-tagged and shallow-parsed learner corpus Ryo NagataKonan University8-9-1 Okamoto,Kobe 658-0072 Japanrnagata @ konan-u.ac.jp.Edward Whittaker ... 44th Annual Meeting of ACL, pages 241–248.Katsuaki Okihara. 1985. English writing (in Japanese).Taishukan, Tokyo.Alla Rozovskaya and Dan Roth. 201 0a. Annotating ESLerrors: Challenges and rewords. ... Vera SheinmanThe Japan Institute forEducational Measurement Inc.3-2-4 Kita-Aoyama, Tokyo, 107-0061 Japanwhittaker,sheinman @jiem.co.jpAbstractThe availability of learner corpora, especiallythose...
  • 10
  • 467
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "ModelTalker Voice Recorder – An Interface System for Recording a Corpus of Speech for Synthesis" ppt

Báo cáo khoa học

... pitch, amplitude and pronuncia-tion and users are given immediate feedback on the acceptability of each recording. Users can then rerecord an unacceptable utterance. Recordings are automatically ... utterance. This alignment is retained so that each utterance is automatically labeled. Once the entire corpus has been recorded, alignments are automatically refined based on specific individual ... naturalness and individuality one associates with one’s own voice. Individuals with difficulty speak-ing can be any age, gender, and from any part of the country, with regional dialects and...
  • 4
  • 419
  • 0

Xem thêm