... Ourtranslation model is implemented as an N-gram model of operations using SRILM-Toolkit (Stolcke,2002) with Kneser-Ney smoothing. We use a 9-gram model (m = 8).Integrating the language model the ... maxEpLM(E)p(F, E)where pLM(E) is the monolingual language model and p(F, E) is the translation model. But our trans-lation model is a joint probability model, because ofwhich E is generated twice ... weight associated with the featurehj(F, E). Other than the 3 features discussed above(log probabilities of the operation model, monolin-gual language model and prior probability model) ,we train...