27294 conjunctions for blankfilling texts

Tài liệu Báo cáo khoa học: "Using Confidence Bands for Parallel Texts Alignment" pptx

Tài liệu Báo cáo khoa học:
... have been two mainstreams for parallel text alignment One assumes that translated texts have proportional sizes; the other tries to use lexical information in parallel texts to generate candidate ... equal frequencies per pair of parallel texts (average percentage of homographs inside brackets) For average size texts (e.g the Written Questions), these words account for about 5% of the total (about ... want to find reliable correspondence points They provide the basic means for extracting reliable information from parallel texts However, as far as we learned from the above papers, current methods...
  • 8
  • 154
  • 0

Báo cáo khoa học: "Planning Reference Choices for Argumentative Texts" ppt

Báo cáo khoa học:
... s Three reference forms have been identified by the author for reasons in naturally occurring proofs (Huang, 1990): The omit form: where a reason is not mentioned at all The explicit form: where ... expressions for methods of inference in PCAs as well Below are the three reference forms identified by the author, which are analogous to the corresponding cases for reasons: the explicit form: this ... usually standard verbalizations the omit form: in this case a word such as "thus" or "therefore" will be used The implicit form: Similar to the implicit form for the expression of reasons, an implicit...
  • 8
  • 110
  • 0

Tài liệu Báo cáo khoa học: "A New Dataset and Method for Automatically Grading ESOL Texts" pdf

Tài liệu Báo cáo khoa học:
... shared dataset for training and testing such systems and comparing their performance As it is likely that the deployment of such systems will increase, standardised and independent evaluation methods ... We make such a dataset of ESOL examination scripts available1 (see Section for more details), describe our novel approach to the task, and provide results for our system on this dataset We address ... consistent, comparable and replicable set of results based entirely on the new dataset and on public-domain tools and data, whilst also experimentally motivating some novel feature types for the AA task,...
  • 10
  • 149
  • 0

Tài liệu Báo cáo khoa học: "An Algorithm for Simultaneously Bracketing Parallel Texts by Aligning Words" ppt

Tài liệu Báo cáo khoa học:
... Singleton-Rebalancing Algorithm We now introduce an algorithm for further improving the bracketing accuracy in cases of singletons Consider the following bracketing produced by the algorithm of the previous section: ... approach for aligning parallel texts In Proceedings of the Fifteenth International Conference on Computational Linguistics, 1096-1102, Kyoto FUNG, PASCALE& KATI~J~ McKEoWN 1994 Aligning noisy parallel ... Grammar-based bracketing methods cannot directly produce results of a comparable nature 7 Conclusion We have proposed a new tool for the corpus linguist's arsenal: a method for simultaneously bracketing...
  • 8
  • 165
  • 0

Tài liệu Báo cáo khoa học: "REPRESENTATION OF TEXTS FOR INFORMATION RETRIEVAL" pdf

Tài liệu Báo cáo khoa học:
... relations; and foregrounding devices (Figure 4) 148 Belkin, N.J., Brooks, H.M., and Oddy, R.N 1979 Representation and classification of knowledge and information for use in interactive information ... constraints typical of DR systems The modi~,cations are designed to recognize such aspects of discourse structure as establishment of topic; "setting of context; summarizing; concept foregrounding; ... correct for the distortion caused by the distribution of function words in the recognition of multi-word concepts N=14 Our current modifications to the analysis consist primarily of methods for translating...
  • 2
  • 142
  • 0

Tài liệu Báo cáo khoa học: "Collaborative Machine Translation Service for Scientific texts" pdf

Tài liệu Báo cáo khoa học:
... k 5.5 k Translation of Scientific Texts The translation system used is a Hybrid Machine Translation (HMT) system from French to English and from English to French, adapted to translate scientific ... training corpus The actual translation of the paper is performed using adapted translation as described in Section • The translation process generates a bilingual TEI format preserving the source ... archived in TEI format and available for display in HTML using dedicated XSLT style sheets The Grobid System Based on state-of-the-art machine learning techniques, Grobid (Lopez, 2009) performs reliable...
  • 5
  • 110
  • 0

Báo cáo khoa học: "Searching for Topics in a Large Collection of Texts" doc

Báo cáo khoa học:
... Document ranking against a query is based on statistical correlation between query words and words in a document Since a document is a small sample of text, the statistics in a document are often ... Proceedings of the International Symposium on Information and Communication Technologies, pages 311–316 Trinity College Dublin, Ireland JAMA 2004 JAMA: A Java Matrix Package Publicdomain, http://math.nist.gov/javanumerics/jama/ ... decompositions for large sparse text data using clustering Machine Learning, 42(1/2):143–175 Jan Hajiˇ 2000 Morphological tagging: Data vs dicc tionaries In Proceedings of the 6th ANLP Conference, 1st NAACL...
  • 6
  • 189
  • 0

Báo cáo khoa học: "Exploiting Parallel Texts for Word Sense Disambiguation: An Empirical Study" potx

Báo cáo khoa học:
... multiple and distinct Chi- nese translations appear in the aligned Chinese sentence For example, for an English occurrence channel, both “频道” (sense translation) and “途 径” (sense translation) ... lumped into one sense (i.e., they are all translated into one Chinese word) , we not perform WSD on these words The aver- age number of senses before and after sense lumping is 5.07 and 3.52 respectively ... which lists seven senses for the noun channel Two senses are lumped together if they are translated in the same way in Chinese For example, sense and of channel are both translated as “频道” in...
  • 8
  • 103
  • 0

Báo cáo khoa học: "Thematic segmentation of texts: two methods for two kinds of texts" pdf

Báo cáo khoa học:
... texts both for building the collocation network and for their thematic segmentation /max = log2 N2(Sw - 1) with N: corpus size and Sw: window size Thematic segmentation lexical network without ... vectors Thus, the segmentation process produces a text representation with thematic blocks including paragraphs about the same topic The two methods have been tested on different kinds of texts We ... the number of occurrences of a descriptor Tj in a paragraph i; dfi is the number of paragraphs in which Tj occurs and 393 descriptor is added in the paragraph if absent In case of reinforcement,...
  • 5
  • 141
  • 0

Báo cáo khoa học: "An IR Approach for Translating New Words from Nonparallel, Comparable Texts" pot

Báo cáo khoa học:
... other words from this online nonparallel, comparable corpus of newspaper materials We choose to use issues of the English newspaper Hong Kong Standard and the Chinese newspaper Mingpao, from Dec.12,97 ... almost all unambiguous Chinese new words find their translations in the first 100 of the ranked list Six of the Chinese words have correct translation as their first candidate Related work Using ... experiment for finding a translation for ~,~, Results In order to apply the above algorithm to find the translation for ~ / l i o u g a n from the HKStand a r d / M i n g p a o corpus, we first use...
  • 7
  • 72
  • 0

Báo cáo khoa học: "A LANGUAGE-INDEPENDENT AN APHORARE SOLUTION SYSTEM FOR UNDERSTANDING MULTILINGUAL TEXTS" pptx

Báo cáo khoa học:
... unique advantages First, it is the only working language-independent discourse system we are aware of By "language-independent, " we mean that the discourse module can be used for different languages ... Webber A Formal Approach to Discourse Anaphora Technical report, Bolt, Beranek, and Newman, 1978 [7] Ido Dagan and Alon Itai Automatic Acquisition of Constraints for the Resolution of Anaphora ... Sandy Shinn The Murasaki Project: Multilingual Natural Language Understanding In Proceedings of the ARPA Human Language Technology Workshop, 1993 [2] Chinatsu Aone, Doug McKee, Sandy Shinn, and...
  • 8
  • 90
  • 0

Báo cáo khoa học: "Combining a Statistical Language Model with Logistic Regression to Predict the Lexical and Syntactic Difficulty of Texts for FFL" potx

Báo cáo khoa học:
... significantly to the adjacent accuracy of classifying the C1 and C2 texts Table 1: Mean Pearson’s r coefficient, exact and adjacent accuracies for both models with the tenfold cross-validation evaluation ... etc The goal is to have as wide a coverage as possible, to achieve maximum generalisability of the formula, and also to check what sort of texts it does not fit (e.g statistical descriptive analyses ... the same as the rest of the textbook material (metalinguistic terms and so on can be found there) 4.1 The language model The lexical difficulty of a text is quite an elaborate phenomenon to parameterise...
  • 9
  • 214
  • 0

Báo cáo khoa học: "A Tool for Deep Semantic Encoding of Narrative Texts" docx

Báo cáo khoa học:
... encodings of narrative texts for machine learning purposes 5 Other features of the software package, such as the setting of causal links and the ability to undo/redo A review of the results of our formative ... Description of Tool The collection process is amenable to community and non-expert annotation by means of a graphical encoding tool We believe this resource can serve a range of experiments in semantics ... discourse models for narrative structure In Proceedings of the ACL Workshop on Discourse Annotation, Barcelona, Spain An outline of the goals of the project and the innovative aspects of our formal representation...
  • 4
  • 91
  • 0

Essential Texts for MBA Students 2012 pptx

Essential Texts for MBA Students 2012 pptx
... Essential texts for MBA Students Welcome Welcome to the 2012 MBA catalogue from SAGE This catalogue has been created specifically with the MBA student in mind It includes ... catalogues online at w w w.sagepub.co.uk or phone us on +44 (0)20 7324 8500 SAGE • Essential texts for MBA Students 2012 mailing code: DM 1K34 Cover Image: © iStockphoto ... introductory case for each chapter and a concluding case for the majority of chapters to demonstrate for students why and how social marketing works • chapter summaries of key points and questions for discussion...
  • 36
  • 117
  • 0

Xem thêm

Nạp tiền Tải lên
Đăng ký
Đăng nhập