0

a syllable based word recognition model

Báo cáo khoa học:

Báo cáo khoa học: "A Syllable Based Word Recognition Model for Korean Noun Extraction" potx

Báo cáo khoa học

... is larger thanthat of an alphabet in English. In addition, there areparticular characteristics in Korean syllables. Thefact that words do not start with certain syllablesis one of such examples. ... itrequires a training data. Because the existing KoreanPOS tagged corpora are annotated by a morphemelevel, we cannot use them as a training data withoutconverting the data suitable for the word recognition model. ... such as “(nim)”, “ (deul)”, or“(jeog)” is also regarded as a word because it isan uninflected morpheme.3 Syllable based word recognition model A Korean syllable consists of an obligatory...
  • 8
  • 368
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Discriminative Syntactic Word Order Model for Machine Translation" pdf

Báo cáo khoa học

... translation, we develop a discriminative or-der model. An advantage of such a model is that wecan easily combine different kinds of features (suchas syntax -based and surface -based) , and that ... Distortion models forstatistical machine translation. In ACL.D. Chiang. 2005. A hierarchical phrase -based model for statis-tical machine translation. In ACL.M. Collins. 2000. Discriminative reranking ... inference and train-ing of context-rich syntactic translation models. In ACL.P. Koehn. 2004. Pharaoh: A beam search decoder for phrase- based statistical machine translation models. In AMTA.R....
  • 8
  • 404
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A TAG-based noisy channel model of speech repairs" pdf

Báo cáo khoa học

... togetherwith a bigram language model. Then each ofthese analysis is rescored using the TAG chan-nel model and a syntactic parser based language model. The TAG channel model s analysis do not ... reparandum and interregnum words thanthe classifier proposed in Charniak and Johnson(2001). Replacing the bigram language model with a trigram model helps slightly, and parser- based language model ... sentence, and we do this byreranking the initial analysis, replacing the bi-gram language model with a syntactic parser based model. We do not need to intersect thisparser based language model...
  • 8
  • 444
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Constituent-Based Morphological Parsing: A New Approach to the Problem of Word-Recognition" pdf

Báo cáo khoa học

... Ngarrka-ngku.ka marlu marna-kurra luwa.rnu ngarni.nja-kurra (man-ergative-aux kangaroo grass-obj shoot-past eat-infmitive-obj) 'The man is shooting the kangaroo while it is eating grass.' ... Vowel-Harmony The first rule indicates that a word consists of an optional prefix followed by a Vowel- Harmony-Domain; the second claims that a Vowel-Harmony-Domain is a string analyzable as a ... Sciences and Humanities Research Council of Canada. [1] Reduplication is a word formation process involving the repetition of a word or a part of a word. As an example, in Warlpiri there is a process...
  • 8
  • 522
  • 0
Measuring Word Recognition Using a Picture

Measuring Word Recognition Using a Picture

Tư liệu khác

... "Look at the picture (Pause). Look at the words around the picture (Pause.) Find the biggest word. What is that word? (Students will say, "Table.") A line has been drawn from that word ... from a systematic pattern of lining. • Have one example word that is familiar to all students and is bigger than the rest of the words. Difficulty of ItemsThe task will contain words of varying ... task type as one of a multiple of task types in a reading test. Teachers may also apply this task type to classroom exercises and homework assignments. Acknowledgments• The author thanks Hyesug...
  • 3
  • 322
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Ranking-based Approach to Word Reordering for Statistical Machine Translation" doc

Báo cáo khoa học

... Translation: Syntactically InformedPhrasal SMT. In Proc. ACL, pages 271-279. A. Ramanathan, Pushpak Bhattacharyya, JayprasadHegde, Ritesh M. Shah and Sasikumar M. 2008.Simple syntactic and ... source lan-guage based on both lexical and syntacticalfeatures. We evaluated our approach on large-scale Japanese-English and English-Japanesemachine translation tasks, and show that it cansignificantly ... MachineTranslation. Ph.D. Thesis.Karthik Visweswariah, Jiri Navratil, Jeffrey Sorensen,Vijil Chenthamarakshan and Nandakishore Kamb-hatla. 2010. Syntax Based Reordering with Automat-ically Derived...
  • 9
  • 615
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "Smoothing a Tera-word Language Model" doc

Báo cáo khoa học

... and Linda C. Bauman Peto. 1995. A hierarchical Dirichlet language model. Natural Lan-guage Engineering, 1(3):1–19.Y.W. Teh. 2006. A hierarchical Bayesian language model based on Pitman-Yor processes. ... n-grams:C(ab) − C(ab∗). A( ab) = max(1, K(C(ab) − C(ab∗))) A different K constant is chosen for each n-gramorder. Using this formulation as an interpolated 5-gram language model gives a cross ... interpolated form is:α(c|ab) =max(0, C(abc) − D)C(ab∗)(4)γ(ab) =N(ab∗)DC(ab∗)The ∗ represents a wildcard matching any word andC(ab∗) is the total count of n-grams that start withthe n − 1 words...
  • 4
  • 425
  • 1
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "A Phrase-based Statistical Model for SMS Text Normalization" ppt

Báo cáo khoa học

... SMS normalization. 2.3 SMS Normalization versus Text Para-phrasing Problem Others may regard SMS normalization as a para-phrasing problem. Broadly speaking, paraphrases capture core aspects ... a consensus translation technique to bootstrap parallel data using off-the-shelf translation sys-tems for training a hierarchical statistical transla-tion model for general domain instant ... minimal adaptation. One advantage of this pre-translation normalization is that the di-versity in different user groups and domains can be modeled separately without accessing and adapting...
  • 8
  • 399
  • 0
Tài liệu Báo cáo khoa học:

Tài liệu Báo cáo khoa học: "An Alignment Algorithm using Belief Propagation and a Structure-Based Distortion Model" pdf

Báo cáo khoa học

... sentencepairs extracted from pre-aligned data(Utiyama andIsahara, 2003) as a gold standard. We segmentedall the Japanese data with the automatic segmenterJuman (Kurohashi and Nagao, 1994). ... Related WorkAutomatic word alignment of parallel corpora isan important step for data-oriented Machine trans-lation (whether Statistical or Example -Based) aswell as for automatic lexicon acquisition. ... Rada Mihalcea andTed Pedersen, editors, HLT-NAACL 2003 Workshop:Building and Using Parallel Texts: Data Driven Ma-chine Translation and Beyond, pages 1–10, Edmon-ton, Alberta, Canada, May...
  • 9
  • 455
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Class-Based Agreement Model for Generating Accurately Inflected Translations" pptx

Báo cáo khoa học

... exponential translation model for targetlanguage morphology. In ACL-HLT.C. Tillmann. 2004. A unigram orientation model for statisticalmachine translation. In NAACL.K. Toutanova, H. Suzuki, and A. ... similar agreement phenom-ena as probabilistic sequences.Factored Translation ModelsFactored transla-tion models (Koehn and Hoang, 2007) facilitate a more data-oriented approach to agreement modeling.Words ... modeling.Words are represented as a vector of features such aslemma and POS. The bitext is annotated with separatemodels, and the annotations are saved during phraseextraction. Hassan et al. (2007)...
  • 10
  • 414
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Stacked Sub-Word Model for Joint Chinese Word Segmentation and Part-of-Speech Tagging" potx

Báo cáo khoa học

... systems (Ngand Low, 2004; Jiang et al., 200 8a; Zhang and Clark,2008).2.2 Character -Based and Word -Based MethodsTwo kinds of approaches are popular for joint word segmentation and POS tagging. ... information for each character.Each character can be assigned one of two possi-ble boundary tags: “B” for a character that begins a word and “I” for a character that occurs in the mid-dle of a word. ... applicable.Zhang et al. (2006) described a sub -word based tagging model to resolve word segmentation. Toget the pieces which are larger than characters butsmaller than words, they combine a character -based segmenter...
  • 10
  • 412
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Applying a Grammar-based Language Model to a Simplified Broadcast-News Transcription Task" ppt

Báo cáo khoa học

... linguistically motivated grammar (a hand-crafted Head-driven Phrase StructureGrammar) and a statistical model estimatingthe probability of a parse tree. The language model is applied by means of an N-best ... create an artificial recognition taskwith manageable complexity. Our primary aim wasto design a task which allows us to investigate theproperties of our grammar -based approach and tocompare ... grammar accept arbitrarysequences of words and phrases. To keep the gram-mar restrictive, such sequences are penalized by thestatistical model. Accurate hand-crafted grammars have been ap-plied...
  • 8
  • 385
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "A Plan Recognition Model for Clarification Subdialogues" ppt

Báo cáo khoa học

... the plan. The parameters of a plan are the parameters in the header. Associated with each plan is a set of constraints, which are assertions about the plan and its terms and parameters. ... wharf") or a pop. The pop allows a metaplan to the stacked SEEK-ID- PARAMETER of PLAN2 ("What's a gate?") or a pop, which allows a metaplan to the original domain plan ... constructed an entire plan stack based on the original domain-specific expectations to BOARD or MEET a train. Recall that in parallel with the above, communicative analysis is also taking place....
  • 10
  • 275
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "The Best of Both Worlds – A Graph-based Completion Model for Transition-based Parsers" pot

Báo cáo khoa học

... a suitable amount of training data, the model can thus learn to make the correct deci-sion. The dynamic-programming based graph- based parser is designed in such a way that anyscore calculation ... inthe beam are recalculated based on a scoring model inspired by the graph -based parsing ap-proach, i.e., taking complete factors into accountas they become incrementally available. As a con-sequence ... compareaspects of transition -based and graph -based pars-ing, and end up using a transition -based parserwith a combined transition -based/ second-ordergraph -based scoring model (Zhang and Clark,2008, 567),...
  • 11
  • 353
  • 0
Báo cáo khoa học:

Báo cáo khoa học: "Multiple Interpreters in a Principle-Based Model of Sentence Processing" potx

Báo cáo khoa học

... (such as the syntactic and semantic pro- cessors), apply maximally to any input, thereby con- structing a maximal, partial interpretation for a given partial input signal. This entails that each ... on locally identifiable links of a Chain: (5) In an argument (NP) Chain, i) <C-Node- A- co C-NodeA> case-mark(C-Nodea) or, ii) C-NodeA - head(Chain) * ease-mark(C-Nodea) In an argument ... to it as a 'unit' Chain, representing an unmoved element. We noted above that each representation's schema provides a natural locality constraint. That is, we should be able...
  • 6
  • 364
  • 0

Xem thêm