Interchange of one part of speech for another

Tài liệu Báo cáo khoa học: "ModelTalker Voice Recorder – An Interface System for Recording a Corpus of Speech for Synthesis" ppt

Tài liệu Báo cáo khoa học:
... recording tool for speech research For example, the MT Voice Recorder offers useful features for language documentation An immediate warning about a poor quality recording will alert a researcher to ... features of ModelTalker Voice Recorder These features include automatic microphone calibration, pitch, amplitude, and pronunciation detection and feedback, and automatic phoneme labeling of speech recordings ... wants to be synthesized clearly and will automatically be included in their entirety in the speech database These utterances are also automatically labeled before being stored In addition, for...
  • 4
  • 116
  • 0

Tài liệu Báo cáo khoa học: "Part-of-Speech Tagging for Twitter: Annotation, Features, and Experiments" pdf

Tài liệu Báo cáo khoa học:
... Creating Speech and Language Data with Amazon’s Mechanical Turk John Lafferty, Andrew McCallum, and Fernando Pereira 2001 Conditional random fields: Probabilistic models for segmenting and labeling ... (The rates for M and Y are both < 0.0005.) 43 Tagset We set out to develop a POS inventory for Twitter that would be intuitive and informative—while at the same time simple to learn and apply—so ... publish a message with attribution For example, than for Standard English text For example, apostrophes are often omitted, and there are frequently words like ima (short for I’m gonna) that cut across...
  • 6
  • 184
  • 0

Tài liệu Báo cáo khoa học: "Arabic Tokenization, Part-of-Speech Tagging and Morphological Disambiguation in One Fell Swoop" pdf

Tài liệu Báo cáo khoa học:
... scheme Finally, we can determine the POS tag, for any morphologically motivated POS tagset Thus, we have performed tokenization, traditional POS tagging, and full morphological disambiguation in one ... tokenization, part-of-speech tagging, and morphological disambiguation in Arabic We have shown that the use of a morphological analyzer is beneficial in POS tagging, and we believe our results are ... performing machine learning experiments       velopment, training, and test corpora with roughly 12,000 word tokens in each of the development and test corpora, and 120,000 words in each...
  • 8
  • 127
  • 0

Báo cáo khoa học: "Efficient Optimization of an MDL-Inspired Objective Function for Unsupervised Part-of-Speech Tagging" docx

Báo cáo khoa học:
... perform this optimization for each instance of (15) These optimizations could easily be performed in parallel for greater scalability     (13)    subject to the constraints w P(w | t) = and ... non-convex optimization problem for which we invoke a publicly available constrained optimization tool, ALGENCAN (Andreani et al., 2007) To carry out its optimization, ALGENCAN requires computation of ... In Proceedings of the ACL S Goldwater and T L Griffiths 2007 A fully Bayesian approach to unsupervised part -of- speech tagging In Proceedings of the ACL M Hyder and K Mahata 2009 An approximate...
  • 6
  • 154
  • 0

Báo cáo khoa học: "Semisupervised condensed nearest neighbor for part-of-speech tagging" pot

Báo cáo khoa học:
... algorithm to condensed nearest neighbor (Hart, 1968; Alpaydin, 1997) and showed that the algorithm leads to more condensed models, and that it performs significantly better than condensed nearest neighbor ... nearest neighbor For part-of-speech tagging, the error reduction over condensed nearest neighbor is more than 40%, and our model is 40% smaller than the one induced by condensed nearest neighbor While ... data by nearest neighbors will enable us to better training set condensation This is exactly what semi-supervised condensed nearest neighbor (SCNN) does We first run a WCNN C and obtain a condensed...
  • 5
  • 146
  • 1

Báo cáo khoa học: "A Cascaded Linear Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" pdf

Báo cáo khoa học:
... tagging (Collins, 2002), Chinese word segmentation (Ng and Low, 2004; Zhang and Clark, 2007) and so on We trained a character-based perceptron for Chinese Joint S&T, and found that the perceptron ... the POS information and reported the F-measure on segmentation only, while the second performed Joint S&T using POS information and reported the F-measure both on segmentation and on Joint S&T ... higher-order word LM on a larger scale corpus Finally, the word count penalty gives improvement to the cascaded model, 0.13 points on segmentation and 0.16 points on Joint S&T In summary, the cascaded model...
  • 8
  • 131
  • 0

Báo cáo khoa học: "Examining the Content Load of Part of Speech Blocks for Information Retrieval" pptx

Báo cáo khoa học:
... behind both of these hypotheses is that, just as individual words can be content- rich or content- poor, the same can hold for blocks of parts of speech According to our first hypothesis, POS blocks ... hypothesis, POS blocks can be categorized as content- rich or content- poor, on the basis of the part of speech class membership of their individual components Specifically, we hypothesise that the ... on the causes of the lack of 533 Content Load is strictly less than zero, we consider the POS block content- poor We assume an underlying equivalence of content in all open class parts of speech, ...
  • 8
  • 156
  • 0

Báo cáo khoa học: "Machine Aided Error-Correction Environment for Korean Morphological Analysis and Part-of-Speech Tagging" pptx

Báo cáo khoa học:
... suggest candidate tags to the user and then to find words which is likely to be wrong tagged Correction rule and m a n u a l correction log are necessary for automatic error detection and candidate ... Ph.D Thesis, Dept of Computer and Information Science, University of Pennsylvania K Choi, Y Han, Y Han, and O Kwon 1994 "KAIST Tree Bank Project for Korean: Present and Future Development" SNLP, ... and J Lee 1996 "Rule-based error correction for statistical part-of-speech tagging" Korea-China Joint Symposium on Oriental Language Computing, pages 125-131 H Lim, J Kim, and H Rim 1996 "A Korean...
  • 5
  • 109
  • 0

Báo cáo khoa học: "Categorial Fluidity in Chinese and its Implications for Part-of-speech Tagging" pptx

Báo cáo khoa học:
... verbalisation in Chinese was observed in Tai (1997) In other words, verbs are more freely deverbalised than nouns denominalised This fluidity between verbal and nominal status of verbs can in theory ... nouns, and the implications this phenomenon might have for POS tagging References Chinese Knowledge Information Processing Group (CKIP) 1993 iirt,=,m,3 }K (EN) Technical Report no.9305, Academia Sinica, ... LIVAC, A Chinese Synchronous Corpus, and Some Applications In Proceedings of the ICCLC International Conference on Chinese Language Computing, Chicago, pages 233-238 Xia, F 2000 The Part-Of-Speech...
  • 4
  • 103
  • 0

Báo cáo khoa học: "Feature-Rich Part-of-speech Tagging for Morphologically Complex Languages: Application to Bulgarian" docx

Báo cáo khoa học:
... the wordform, OldEnd is the string that has to be removed from the end of the wordform, and NewEnd is the string that has to be concatenated to the beginning of the wordform in order to produce ... (2005) decomposed the complex tags into factors, where models for predicting part-of-speech, gender, number, case, and lemma are estimated separately, and then composed into a single CRF model; ... training dataset for each class of tags that can be assigned to some word type, according to the lexicon For example, the most frequent tag for politika is Ncfsi, and the most frequent tag for the tag-class...
  • 11
  • 126
  • 0

Báo cáo khoa học: "A Hierarchical Pitman-Yor Process HMM for Unsupervised Part of Speech Induction" doc

Báo cáo khoa học:
... translation systems The HMM ignores orthographic information, which is often highly indicative of a word’s partof -speech, particularly so in morphologically rich languages For this reason Clark ... Association for Computational Linguistics Sujith Ravi and Kevin Knight 2009 Minimized models for unsupervised part- of- speech tagging In Proceedings of the Joint Conferenceof the 47th Annual Meeting of ... effectiveness of the PYP prior we include results using a Dirichlet Process prior (DP) We see that for all models the use of the PYP provides some gain for the HMM, but diminishes for the 1HMM This...
  • 10
  • 116
  • 0

Xem thêm

Nạp tiền Tải lên
Đăng ký
Đăng nhập