0

quantitative and qualitative evaluation

Báo cáo khoa học:

Báo cáo khoa học: "Quantitative and Qualitative Evaluation of Darpa Communicator Spoken Dialogue Systems" pdf

Báo cáo khoa học

... questions about the user’s travel plans bothat the beginning of the dialogue and also after Quantitative and Qualitative Evaluation of Darpa CommunicatorSpoken Dialogue SystemsMarilyn A. WalkerAT&T ... and that therewere four groups of performers with sites 3,2,1,4in the top group (listed by average user satisfac-tion), sites 4,5,9,6 in a second group, and sites 8 and 7 defining a third and ... standard and tools were used by allsites to collect a set of core metrics for makingcross system comparisons. The core metrics weredeveloped during a workshop of the Evaluation Committee and...
  • 8
  • 319
  • 0
Báo cáo

Báo cáo " Language program evaluation: Quantitative or qualitative approach? " pptx

Báo cáo khoa học

... reality is objective and detached from the observers, and that this reality can be Tạp chí Khoa học ĐHQGHN, Ngoại ngữ 24 (2008) 1-6 1 Language program evaluation: Quantitative or qualitative approach? ... approaches: positivistic /quantitative and naturalistic /qualitative. This article will attempt to review these two major paradigms by (i) giving the definition of each paradigm and presenting its logic ... evaluators want to achieve in the evaluation process. However, evaluators have to rely on either quantitative or qualitative approach which has its own strengths and weaknesses. The researchers...
  • 6
  • 169
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "You Can’t Beat Frequency (Unless You Use Linguistic Knowledge) – A Qualitative Evaluation of Association Measures for Collocation and Term Extraction" pot

Báo cáo khoa học

... for CE (Wermter and Hahn, 2004) and for ATR (Wermter and Hahn, 2005), which havebeen shown to outperform several of the statistics-only metrics.3 Methods and Experiments3.1 Qualitative CriteriaBecause ... best-performing statistics-only measure for CE (cf. Evert and Krenn (2001) and Krenn and Evert (2001)) and also for ATR (seeWermter and Hahn (2005)).Concerning more recent linguistically groundedAMs, ... conditions.Several studies (e.g., Evert and Krenn (2001),Krenn and Evert (2001), Frantzi et al. (2000),Wermter and Hahn (2004)), however, have al-ready observed that ranking the candidates merelyby their...
  • 8
  • 435
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Methods for the Qualitative Evaluation of Lexical Association Measures" doc

Báo cáo khoa học

... part-of-speech tags and minimal PPs were identified.5The PNV triples were selected automatically suchthat the preposition and the noun are constituentsof the same PP, and the PP and the verb co-occurwithin ... moresusceptible to random variation, which illustratesthat evaluation based on a small number of -bestcandidate pairs cannot be reliable.With respect to the recall curves (Figures 3 and 4), we find: ... log-likelihood, and even precision gainedby frequency is better than or at least comparableto log-likelihood. These pairings – log-likelihood and t-test for AdjN, and t-test and frequency forPNV...
  • 8
  • 516
  • 0

"The Potential of Cellulosic Ethanol Production from Municipal Solid Waste: A Technical and Economic Evaluation" doc

Tự động hóa

... of xylan and glucan for ADC final reached 79.1% and 88.2%, respectively. The overall yield of xylan and glucan for ADC green was 83.3% and 89.1%, respectively, through pretreatment and enzymatic ... Municipal solid waste: A Technical and Economic Evaluation Jian Shi, Mirvat Ebrik, Bin Yang*, and Charles E. Wyman Center for Environmental Research and Technology Bourns College of Engineering ... transportation fuels and chemicals because of its abundance, the need to find uses for this problematic waste, and its low and perhaps negative cost. However, significant heterogeneity and possible...
  • 41
  • 554
  • 0
Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining pptx

Data Mining Classification: Basic Concepts, Decision Trees, and Model Evaluation Lecture Notes for Chapter 4 Introduction to Data Mining pptx

Cơ sở dữ liệu

... it–Class counts in each of the partitions, A < v and A ≥ vSimple method to choose best v–For each v, scan the database to gather count matrix and compute its Gini index–Computationally Inefficient! ... or random coilCategorizing news stories as finance, weather, entertainment, sports, etc© Tan,Steinbach, Kumar Introduction to Data Mining 47 Stopping Criteria for Tree InductionStop expanding ... attribute,–Sort the attribute on values–Linearly scan these values, each time updating the count matrix and computing gini index–Choose the split position that has the least gini indexCheat No No...
  • 101
  • 4,291
  • 1
Báo cáo khoa học:

Báo cáo khoa học: "Correlation between ROUGE and Human Evaluation of Extractive Meeting Summaries" pptx

Báo cáo khoa học

... ICASSP.X. Zhu and G. Penn. 2005. Evaluation of sentence selection forspeech summarization. In ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/ or Summariza-tion.X. Zhu and G. ... Infor-mative Coverage (IC): S2 and S9; Informative Relevance(IRV): S3 and S8; and Informative Redundancy (IRD):S4 and S7.4 Results4.1 Correlation between Human Evaluation and Original ROUGE ScoreSimilar ... R-SU4 and human evaluation. 5 Conclusion and Future WorkIn this paper, we have made a first attempt to system-atically investigate the correlation of automatic ROUGEscores with human evaluation...
  • 4
  • 293
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "From Single to Multi-document Summarization: A Prototype System and its Evaluation" pptx

Báo cáo khoa học

... Japan. DUC and TSC both aim to compile standard training and test collections that can be shared among researchers and to provide common and large scale evaluations in single and multiple ... 2000 and 2001. However, the area is still being fleshed out: most past efforts have focused only on single-document summarization (Mani 2000), and no standard test sets and large scale evaluations ... between most and all, cohesion, some and most, and coherence, some and most. This indicates the strategies employed by NeATS (stigma word filtering, adding lead sentence, and time annotation)...
  • 8
  • 288
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Unsupervised Discovery of Generic Relationships Using Pattern Clusters and its Evaluation by Automatically Generated SAT Analogy Questions" pot

Báo cáo khoa học

... wedemonstrate in our bilingual evaluation. 2.3 Evaluation Method Evaluation for hypernymy and synonymy usuallyuses WordNet (Lin and Pantel, 2002; Widdows and Dorow, 2002; Davidov and Rappoport, 2006). ... meronymy (Berland and Charniak, 1999; Girju et al., 2006), synonymy(Widdows and Dorow, 2002; Davidov and Rap-poport, 2006), and verb strength + verb happens-before (Chklovski and Pantel, 2004). ... (Davidov and Rappoport,2006; Widdows and Dorow, 2002) and meronymy(Berland and Charniak, 1999; Girju et al., 2006).Since named entities are very important in NLP,many studies define and discover...
  • 9
  • 390
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Correlating Human and Automatic Evaluation of a German Surface Realiser" doc

Báo cáo khoa học

... these are. Belz and Reiter(2006) and Reiter and Belz (2009) describe com-parison experiments between the automatic eval-uation of system output and human (expert and non-expert) evaluation of ... 0.03686Table 4: Correlation between dependency-based evaluation and human judgementsthe parses of the original strings. We calculateboth a weighted and unweighted dependency f-score, as given in ... Short Papers, pages 97–100,Suntec, Singapore, 4 August 2009.c2009 ACL and AFNLPCorrelating Human and Automatic Evaluation of a German SurfaceRealiserAoife CahillInstitut f¨ur Maschinelle...
  • 4
  • 285
  • 0
STOCK ASSESSMENT AND FISHERY EVALUATION REPORT FOR THE GROUNDFISH FISHERIES OF THE GULF OF ALASKA AND BERING SEA/ALEUTIAN ISLANDS AREA: ECONOMIC STATUS OF THE GROUNDFISH FISHERIES OFF ALASKA, 2008 potx

STOCK ASSESSMENT AND FISHERY EVALUATION REPORT FOR THE GROUNDFISH FISHERIES OF THE GULF OF ALASKA AND BERING SEA/ALEUTIAN ISLANDS AREA: ECONOMIC STATUS OF THE GROUNDFISH FISHERIES OFF ALASKA, 2008 potx

Cao đẳng - Đại học

... mortality are not available. Tables 7 and 8, and 9 and 10, respectively, provide estimates of discarded catch and discard rates by species, area, gear, and target fishery. Within each area or ... 1.7 and 2.2 million t (Fig. 1 and Table 1). The rapid displacement of the foreign and joint-venture fisheries by the domestic fishery between 1984 and 1991 can be seen by comparing Figures 1 and ... StatusNPFMC Economic SAFESTOCK ASSESSMENT AND FISHERY EVALUATION REPORT FOR THE GROUNDFISH FISHERIES OF THE GULF OF ALASKA AND BERING SEA/ALEUTIAN ISLANDS AREA: ECONOMIC STATUS OF THE GROUNDFISH...
  • 276
  • 668
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Comparing Automatic and Human Evaluation of NLG Systems" potx

Báo cáo khoa học

... Riezler and J. T. Maxwell III. 2005. On som e pit-falls in automatic evaluation and significance testingfor MT. In Proc. ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for MT and/ or ... system, and then comparedthe generated texts to the original corpus texts.Similar evaluations have been used e.g. by Banga-lore et al. (2000) and Marciniak and Strube (2004).Such corpus-based evaluations ... (non-repeating column and row entries)experimental design where each combination ofdate and system is assigned one evaluation. 4 ResultsTable 2 shows evaluation scores for the five NLGsystems and the corpus...
  • 8
  • 376
  • 0
DELIVERING HEALTH EDUCATION VIA THE WEB: DESIGN AND FORMATIVE EVALUATION OF A DISCOURSE-BASED LEARNING ENVIRONMENT pot

DELIVERING HEALTH EDUCATION VIA THE WEB: DESIGN AND FORMATIVE EVALUATION OF A DISCOURSE-BASED LEARNING ENVIRONMENT pot

Sức khỏe giới tính

... protocols and tools were developed to collect quantitative and qualitative data. Pre and post tests related to HIV/AIDS and nutrition will allow for quantitative comparison of knowledge, attitude and ... scheduledlecture and tutorial hours; (3) opportunities for a variety of learning activities including small groupdiscussion and collaborative projects; and (4) exposure to and a forum for expressing and ... Web-based and classroom learning environments; the design and development of a prototype Webenvironment to facilitate these learning activities; and, the formative evaluation of learningactivities and...
  • 12
  • 411
  • 0
self-similar network traffic and performance evaluation.

self-similar network traffic and performance evaluation.

Đại cương

... networkxi20 Toward an Improved Understanding of Network Traf®c Dynamics 507R. H. Riedi and Walter Willinger21 Future Directions and Open Problems in Performance Evaluation and Control of Self-Similar ... Heavy Tails and Heavy Traf®c 143O. J. Boxma and J. W. Cohen7 Fluid Queues, OnaOff Processes, and Teletraf®c Modelingwith Highly Variable and Correlated Inputs 171Sidney Resnick and Gennady ... make ashort detour and discuss self-similar processes in slightly more generality. Furtherextensions and detailed treatments can be found in Beran [9] and Samorodnitsky and Taggu [60].Consider...
  • 574
  • 1,530
  • 0
báo cáo hóa học:

báo cáo hóa học: " Development and preliminary evaluation of a quality of life measure targeted at dementia caregivers" doc

Hóa học - Dầu khí

... emotional and social concerns, and spirituality and benefits were identified.Conclusion: These preliminary results support subsequent evaluation of test-retest reliability,construct validity, and ... familyinvolvement, caregiving demands, worry, spirituality and faith, benefits of caregiving, caregiver feelings, and rolelimitations due to caregiving [19]. Spirituality and faith, and benefits of caregiving ... familyinvolvement, demands of caregiving, caregiver worry, and caregiver feelings (See Table 5). Less patient education and non-white caregiver ethnicity were associated withhigher spirituality and faith and...
  • 12
  • 741
  • 0

Xem thêm