... the AFNLP, pages 297–305,Suntec, Singapore, 2-7 August 2009.c2009 ACL and AFNLPRobust Machine Translation Evaluation with Entailment Features∗Sebastian Pad´oStuttgart Universitypado@ims.uni-stuttgart.deMichel ... University{mgalley,jurafsky,manning}@stanford.eduAbstractExisting evaluation metrics for machine translation lack crucial robustness: their correlations with hu-man quality judgments vary ... main reasonis their inability to properly capture meaning: A good translation candidate means the same thing as thereference translation, regardless of formulation. Wepropose a metric that evaluates...