... correlations with human judgmentsthan when higher-ordered n-grams are included.SysID Human-assessment score MT1 0.661 MT2 0.626 MT3 0.586 MT4 0.578 MT5 0.537 MT6 0.530 MT7 0.530 MT8 0.375 MT9 0.332 MT1 0 ... the pseudo references, the quality of MT systems being evaluated, and the diversity overthe distribution of training examples).Specifically, we reserved four systems (MT2 , MT5 , MT6 , and MT9 ) for ... 296–303,Prague, Czech Republic, June 2007.c2007 Association for Computational LinguisticsRegression for Sentence-Level MT Evaluation with Pseudo ReferencesJoshua S. Albrecht and Rebecca HwaDepartment...