automatic training of lemmatization rules

Statistical training of researchers in total quality management: The japanese experience

Statistical training of researchers in total quality management: The japanese experience

Ngày tải lên : 26/10/2012, 09:41
... been composed of one professor, one associate professor and two research associates. To give an example, at the University of Tokyo about 15 professors and associate professors of statistics ... researcher to be aware of the stage he or she is in the stream of R & D. This implies the necessity of an in-company training course at least in the final stage of education of applied statistics, ... manager to supervise the whole process of R & D. Now under the circumstances of Japan and the characteristics of applied statistics, the need of some extensive training system for people to perform...
  • 12
  • 714
  • 0
Tài liệu Automatic Management of Network Security Policy pptx

Tài liệu Automatic Management of Network Security Policy pptx

Ngày tải lên : 14/02/2014, 16:20
... actual enforcement of these policies. Many of our design decisions were influenced by the lack of verifiable enforcement mechanisms for certain security phenomena. An example of this is Denial of Service ... provide required by policies. Often it suffices to reason about connectivity to analyze availability of services to groups of users. For instance, instead of modeling all the details of a file server, we ... settings of firewall rules. Firewall-based layered approaches [2][10] try to map security devices to the layers in the architectural design of IP networks. One of the most comprehensive treatments of...
  • 15
  • 467
  • 0
Tài liệu Báo cáo khoa học: "Applications of GPC Rules and Character Structures in Games for Learning Chinese Characters" doc

Tài liệu Báo cáo khoa học: "Applications of GPC Rules and Character Structures in Games for Learning Chinese Characters" doc

Ngày tải lên : 19/02/2014, 19:20
... pro- nunciations of “匋”, “淘”, “陶”, and “啕 ” are /tao2/. Pronunciations of specific substrings in words of alphabetic languages are governed by grapheme- phoneme conversion (GPC) rules, though ... strict GPC rules either, but they re- main to be good agents for learning to read. Despite the differences among phoneme systems and among the degrees of strictness of the GPC rules in different ... goals. Evaluation of the tool with college and graduate students showed that our system offered an efficient and effective environment for this authoring task. Currently, players of our games...
  • 6
  • 590
  • 0
Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

Tài liệu Báo cáo khoa học: "Automatic Extraction of Lexico-Syntactic Patterns for Detection of Negation and Speculation Scopes" pdf

Ngày tải lên : 20/02/2014, 04:20
... large number of very specific rule patterns - 1,681 nega- tion scope rules and 3,043 speculation scope rules were extracted from the training dataset. To identify a more general set of rules (and ... (2010) used a combination of manually compiled rules, a CRF classifier, and a sequence of post-processing steps on the same task; Velldal et al (2010) manu- ally compiled a set of heuristics based on ... rule set (1,681 negation scope rules and 3,043 speculation scope rules) on the test data. As expected, this rule set consisting of very specific scope matching rules resulted in very high precision and...
  • 5
  • 543
  • 1
Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

Tài liệu Báo cáo khoa học: "Automatic learning of textual entailments with cross-pair similarities" ppt

Ngày tải lên : 20/02/2014, 12:20
... ex- amples of the previous section. From the point of view of bag -of- word methods, the pairs (T 1 , H 1 ) and (T 1 , H 2 ) have both the same intra-pair simi- larity since the sentences of T 1 and ... co-chairman of Miramax.” 407 where 0 ≤ λ ≤ 1 and l(f i ) is the number of lev- els of the subtree f i . Thus λ l(f i ) assigns a lower weight to larger fragments. When λ = 1, ∆ is equal to the number of ... head of constituents. The example of Fig. 1 shows that the placeholder 0 climbs up to the node governing all the NPs. 5.3 Pruning irrelevant information in large text trees Often only a portion of...
  • 8
  • 413
  • 0
Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

Tài liệu Báo cáo khoa học: "Automatic Construction of Polarity-tagged Corpus from HTML Documents" docx

Ngày tải lên : 20/02/2014, 12:20
... the polarity of words There are some works that discuss learning the po- larity of words instead of sentences. Hatzivassiloglou and McKeown proposed a method of learning the polarity of adjectives ... are not in the resources. 458 (1) kono software-no riten-ha hayaku ugoku koto this software-POST advantage-POS T quickly run to The advantage of this software is to run quickly. (2) ketten-ha jikan-ga ... combination of table and other tags can represent various kinds of ta- bles, it is difficult to craft precise rules that can deal with any table. Therefore, we consider only two types of tables in...
  • 8
  • 409
  • 0
Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

Tài liệu Báo cáo khoa học: "Automatic Identification of Pro and Con Reasons in Online Reviews" ppt

Ngày tải lên : 20/02/2014, 12:20
... examples of sen- tences that our system identified as reasons of complaints. (1) Unfortunately, I find that I am no longer comfortable in your establishment because of the unprofessional, ... Orientation of Adjectives. Proceedings of 35th Annual Meet- ing of the Assoc. for Computational Linguistics (ACL-97): 174-181 Hatzivassiloglou, Vasileios and Janyce Wiebe. 2000. Effects of Adjective ... Determin- ing the Sentiment of Opinions. Proceedings of COLING-04. pp. 1367-1373. Geneva, Switzer- land. Kim, Soo-Min and Eduard Hovy. 2005. Automatic Detection of Opinion Bearing Words and...
  • 8
  • 461
  • 1
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Sentence-Level Fluency Andrew Mutton∗" pdf

Ngày tải lên : 20/02/2014, 12:20
... grammar, rhythm and flow, appropriateness of tone, and several other specific characteristics of good text. In terms of automatic evaluation, we are not aware of any technique that measures only fluency ... Methods PoStag In the first of these, we constructed a rough approximation of typical sentence grammar structure by taking bigrams over part -of- speech tags. 6 Then, given a string of PoS tags of length n, t 1 . ... can be fooled by the method of sentence generation; GLEU, how- ever, gives a consistent estimate of fluency regard- less of generation type; and, across all types of gen- erated sentences examined...
  • 8
  • 507
  • 0
Tài liệu Báo cáo khoa học: "Online Large-Margin Training of Dependency Parsers" docx

Tài liệu Báo cáo khoa học: "Online Large-Margin Training of Dependency Parsers" docx

Ngày tải lên : 20/02/2014, 15:20
... of parent node in dependency tree. c-word: word of child node. p-pos: POS of parent node. c-pos: POS of child node. p-pos+1: POS to the right of parent in sentence. p-pos-1: POS to the left of ... parent of x j . T = {(x t , y t )} T t=1 denotes the training data. We follow the edge based factorization method of Eisner (1996) and define the score of a dependency tree as the sum of the score of ... second type of feature provides the local con- text of the attachment, that is, the words before and after the parent-child pair. This feature took the form of a POS 4-gram: The POS of the parent,...
  • 8
  • 443
  • 0
Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

Tài liệu Báo cáo khoa học: "Automatic Evaluation of Machine Translation Quality Using Longest Common Subsequence and Skip-Bigram Statistics" doc

Ngày tải lên : 20/02/2014, 16:20
... construction of N-best translation lexicons from parallel text. Melamed (1995) used the ratio (LCSR) between the length of the LCS of two words and the length of the longer word of the two ... present the evaluations of ROUGE-L, ROUGE-S, and compare their per- formance with other automatic evaluation meas- ures. 5 Evaluations One of the goals of developing automatic evalua- tion ... Proceedings of COLING-92, Nantes, France. Thompson, H. S. 1991. Automatic Evaluation of Translation Quality: Outline of Methodology and Report on Pilot Experiment. In Proceedings of the Evaluator’s...
  • 8
  • 442
  • 0
Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Tài liệu Báo cáo khoa học: "Automatic clustering of collocation for detecting practical sense boundary" ppt

Ngày tải lên : 20/02/2014, 16:20
... 1 shows the average number of clusters with each clustering method shown chapter 3 by the part of speech. WC and WF are the average number of senses by the part of speech. In Table 1 and ... the word senses numbered i of the word x. I x is the word sense indexing function of x that gives an index to each sense of the word x. All contextual words x i ±j of a central word x have ... V N C Æ 2P C/V . In this formula, V means a set of vocabulary, N is the size of the contextual window that is an integer, and C means a set of corpus. In this paper, vocabulary refers to all...
  • 4
  • 425
  • 0
Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Tài liệu Báo cáo khoa học: "Automatic Collection of Related Terms from the Web" pptx

Ngày tải lên : 20/02/2014, 16:20
... (20%) out of 210 terms were col- lected by the system. This low recall primarily comes from the failure of automatic term recogni- tion (case A in the above classification). Improve- ment of this ... following rules are true with a few exception: (1) A type-1 term is a narrower term of the seed term s; (2) A type-2 term is a broader term of the seed term s. We assume that these rules are ... Bin Umino. 1996. Methods of au- tomatic term recognition: A review. Terminology, 3(2):259–289. Hiroshi Nakagawa. 2000. Automatic term recognition based on statistics of compound nouns. Terminology, 6(2):195–210. Satoshi...
  • 4
  • 437
  • 0
Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

Tài liệu Báo cáo khoa học: "AUTOMATIC ACQUISITION OF SUBCATEGORIZATION FRAMES FROM UNTAGGED TEXT" doc

Ngày tải lên : 20/02/2014, 21:20
... The completeness of the output list increases monotonically with the total number of occurrences of each verb in the corpus. False positive rates are one to three percent of observa- tions. ... architecture of the system, and that of this pa- per, directly reflects the three challenges described above. The system consists of three modules: 1. Verb detection: Finds some occurrences of verbs ... is evaluated in terms of efficiency and accuracy. The most useful estimate of effi- ciency is simply the density of observations in the corpus, shown in the first column of Table 3. The SF...
  • 6
  • 416
  • 0
Tài liệu Báo cáo khoa học: "ON THE AUTOMATIC TRANSFORMATION OF CLASS MEMBERSHIP CRITERIA" docx

Tài liệu Báo cáo khoa học: "ON THE AUTOMATIC TRANSFORMATION OF CLASS MEMBERSHIP CRITERIA" docx

Ngày tải lên : 21/02/2014, 20:20
... by means of a process of ~ inmtRntlat~nn OF the deflnition the translation of the de/initlon f~'om a set of criteria for satisfying the definition into an exemplary instance of the concept ... components of the definition of the class are also present in the description of the instance. This also permits easy representation of modifications to the definition, whenever the capability of ... application discussed here (the assignment of an instance of a knowledge structure to one of a set of classes), inexact matching and close relatives thereof are also found in several other domains...
  • 6
  • 366
  • 0
Tài liệu Báo cáo khoa học: "Automatic Detection of Nonreferential It in Spoken Multi-Party Dialog" doc

Tài liệu Báo cáo khoa học: "Automatic Detection of Nonreferential It in Spoken Multi-Party Dialog" doc

Ngày tải lên : 22/02/2014, 02:20
... re- call of 55.1%, a precision of 71.9% and a resulting F-measure of 62.4% for the detection of the class nonreferential. The overall classification accuracy was 75.1%. The advantage of using ... yielded a recall of 57.7%, a precision of only 70.1% and an F-measure of 63.3% for the detec- tion of this class. The overall accuracy was 74.9%. The system produced a mere five rules (compared to ... a mi- nority of all instances of it. Evans (2001) reports that his corpus of approx. 370.000 words from the SUSANNE corpus and the BNC contains 3.171 examples of it, approx. 29% of which are...
  • 8
  • 436
  • 0