0

predicting the fluency of text

Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Predicting the fluency of text with shallow structural features: case studies of machine translation and human-written text" doc

Báo cáo khoa học

... of the 12th Conference of the European Chapter of the ACL, pages 139–147,Athens, Greece, 30 March – 3 April 2009.c2009 Association for Computational Linguistics Predicting the fluency of text ... between text quality assess-ment of the articles and the percentage of fluentsentences according to different models. text, and levels of fluency in the automatically pro-duced text. The distinctions ... the analysis is performed in or-der to better understand which factors are predic-tive of good fluency. The distribution of fluency scores in the datasetis rather skewed, with the majority of...
  • 9
  • 438
  • 0
Tapping into the Power of Text Mining

Tapping into the Power of Text Mining

Cơ sở dữ liệu

... element of the vector usually represents a word (or a group of words) of the document collection, i.e. the size of the vector is defined by the number of words (orgroups of words) of the complete ... from the number of clusters is the silhouette coefficient SC(P) (cf. [KR90]). The main idea of the coef-ficient is to find out the location of a document in the space with respect to the cluster of ... StemmingIn order to reduce the size of the dictionary and thus the dimensionality of the descrip-tion of documents within the collection, the set of words describing the documents canbe reduced...
  • 37
  • 1,334
  • 3
The application of games in teaching grammar with reference to tieng anh 10 textbook at ha trung high school, thanh hoa province

The application of games in teaching grammar with reference to tieng anh 10 textbook at ha trung high school, thanh hoa province

Thạc sĩ - Cao học

... win. 3. 81.1% of the students find that the games guided by their teacher are easy to understand, 18.4% of the students sometimes don’t understand the rule of the games, and 0.5% of the students ... understanding of the lesson.PART III CONCLUSION 1. SummariesIn summary, the study deals with the theories of the role of grammar, students’ motivation, and the application of games in teaching ... tense or the form of verb in each clause of that sentence, guess the meaning and the usage of this condition. The group which gives the clear and correct answer will be the winner. The teacher...
  • 39
  • 1,577
  • 8
A STUDY ON THE TRANSLATION OF ENGLISH COMPUTER TEXTS IN VIETNAMESE EQUIVALENTS

A STUDY ON THE TRANSLATION OF ENGLISH COMPUTER TEXTS IN VIETNAMESE EQUIVALENTS

Khoa học xã hội

... the object of the verb in the active form in an active structure, becomes the subject of the verb in the passive form; while the performer of an action (the agent) – the subject of the verb in ... ve huu – The General Retires” in The Other Side of Heaven”, 1995), which reflects the use of TL adjective in place of SL verb. The fourth type of transposition is the replacement of a virtual ... structure form of a source text. Meanwhile, dynamic equivalence is the principle of equivalence of effect on reader of TT or the same effect on the TL receivers as the source text has on the SL receivers....
  • 94
  • 1,125
  • 3
difficulties in teaching reading comprehension with the new english textbook “tieng anh 10” (the set of standard textbooks) to the 10th form students at ke sat high school

difficulties in teaching reading comprehension with the new english textbook “tieng anh 10” (the set of standard textbooks) to the 10th form students at ke sat high school

Khoa học xã hội

... can help them to be acquainted with the theme and the language relating to the theme. From then, the students can speak, listen and write about the things relating to the theme in the next ... partly helps the students understand the topic of the reading passage. For example, in Unit 1, the title of the unit is The day in the life of. ”. Glancing at the title and looking at the enclosed ... the title of the unit (frequently, the title of the unit is the title of the paasage), or to look at and discuss the picture enclosed the passage. In addition, the teacher can use the available...
  • 44
  • 1,705
  • 8
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "The impact of language models and loss functions on repair disfluency detection" pptx

Báo cáo khoa học

... feature is the log of the “fig-ure of merit” used to guide search in the noisychannel model when it is producing the 25-bestlist for the reranker. The log figure of merit is the sum of the log ... when it is part of the reparandum or Iwhen it is part of the interregnum.1. CopyFlagsX Y: When there is an exact copyin the input text of length X (1 ≤ X ≤ 3) and the gap between the copies is ... list of possible speech disfluency analyses. The choice of this model is driven by the observation that the re-pairs frequently seem to be a “rough copy” of the reparandum, often incorporating the...
  • 9
  • 609
  • 0
A Text-Book of the History of Architecture Seventh Edition, revised pdf

A Text-Book of the History of Architecture Seventh Edition, revised pdf

Kiến trúc - Xây dựng

... architecture. In the North and West, meanwhile, under the growing institutions of the papacy and of the monastic orders and the emergence of a feudal civilization out of the chaos of the Dark Ages, the ... are gathered some of the results of recent investigations and of the architectural progress of the last few years which could not readily be introduced into the text of this edition. The General ... to harmonize in a building the requirements of utility and of beauty. It is the most useful of the fine arts and the noblest of the useful arts. It touches the life of man at every point. It...
  • 25
  • 499
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Case markers and Morphology: Addressing the crux of the fluency problem in English-Hindi SMT" pot

Báo cáo khoa học

... correct casemarking is a crucial part of making translationsconvey the right meaning.801Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP of the AFNLP, pages 800–808,Suntec, ... likely. The separation of the lemma and suffix helps in tiding over the data sparsity problem by allowing the systemto reason about the suffix-case marker com-bination rather than the combination of ... represented ina scope. The Stanford dependency parser on the other hand represents these dependencies with the help of the clausal complement relation, whichlinks said with hit, and uses the complementizerrelation...
  • 9
  • 465
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Practical Solution to the Problem of Automatic Part-of-Speech Induction from Text" pdf

Báo cáo khoa học

... vector of each word from the centroid of its closest cluster, and to assign the differential vector to the most appropriate other cluster. This process can be repeated until the length of the ... a strong negative effect on the results of the vector comparisons. Fortunately, the problem of data sparseness can be minimized by reducing the dimensionality of the matrix. An appropriate ... vectors, and by assigning these to the most similar other cluster. Hereby for the cosine similarity we set a threshold of 0.8. That is, only if the similarity between the differential vector...
  • 4
  • 433
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "The Role of Lexico-Semantic Feedback in Open-Domain Textual Question-Answering" ppt

Báo cáo khoa học

... Suchalternations improve the recall of the answerparagraphs. For example, in the case of questionQ221: “Who killed Martin Luther King ?”,by considering the synonym of killer, the nounassassin, the Q&A ... sledin the Iditarod ?”, since the definition of Word-Net sense 2 of noun harness contains the bigram“pull cart” and both sled and cart are forms of vehicles, the alternation of the pair of keywordspull, ... question the performance wascomputed by the reciprocal value of the rank(RAR) of the highest-ranked correct answer givenby the system. Given that only the first five an-swers were considered in the...
  • 8
  • 508
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "PARSING VS. TEXT PROCESSING IN THE ANALYSIS OF DICTIONARY DEFINITIONS" pot

Báo cáo khoa học

... sample definition and the triples the parser found in it. ABDOMEN 0 1 N THE PART OF THE BODY BETWEEN THE THORAX AND THE PELVIS (THE) pmod (PART) (ABDOMEN 0 1 N) lm (THE) (ABDOMEN 0 1 N) ... inflected forms analyzed, and other modifications of the kind often brought under the rubric of "transformations." The LSP can do this sort of thing very welL The defining words also need ... We extracted the set of intransitive verb definitions, suspecting that these would be the easiest to work with. This is the smallest of the four major 219 Semantic Analysis of Definitions...
  • 8
  • 461
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Predicting Strong Associations on the Basis of Corpus Data" pdf

Báo cáo khoa học

... rank of 101. Thus, the lower the rank, the better the performance of the system.While there are obviously many more ways of as-sembling a test set and scoring the several systems,we found these ... form the building blocks of a compound, and it is possi-ble that one part of a compound calls up the other. The examples show that the process of compound-ing can go in either direction: the ... comparison of the document-based model and the log-likelihood ratio on the basis of the cue–target pairs with the largest difference in log ranks between the two approaches.tween the models in the...
  • 9
  • 434
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx

Báo cáo khoa học

... wereexcluded, because there is no guarantee the language employed there is the same as the rest of the textbook material (metalinguisticterms and so on can be found there).Up to now, using these criteria, ... specificissue of the readability of texts for FFL learn-ers. So, any comparisons with previous studies aresomewhat flawed by the fact that neither the targetpopulation nor the scale of difficulty is the ... attention to the factthat the MLR model multiplies the number of pa-rameters by J − 1 compared to the PO model.Because of this, they recommend using the POmodel.6 Implementation of the modelsHaving...
  • 9
  • 514
  • 0
Báo cáo khoa học:

Báo cáo khoa học: " The Development of Lexical Resources for Information Extraction from Text Combining Word Net and Dewey Decimal Classification" potx

Báo cáo khoa học

... comments to the paper. tion requirement. Unfortunately one of the cur- rent trends in IE is the progressive reduction of the size of training corpora: e.g., from the 1,000 texts of the MUC-5 ... entries in the lexicon. The BL could be seen as the complementary set of the FL with respect to the generic language, i.e. it contains all the words of the language that do not belong to the FL. ... mentioned, there are two problems related to the use of generic dictionaries with respect to the IE needs. First there is no clear way of extracting from them the mapping between the FL and the...
  • 4
  • 436
  • 0

Xem thêm