Take a sneak peek at the new NIST.gov and let us know what you think!
(Please note: some content may not be complete on the beta site.).
NIST Authors in Bold
|Author(s):||Chris Callison-Burch; Philipp Koehn; Christof Monz; Kay Peterson; Mark A. Przybocki; Omar F. Zaidan;|
|Title:||Findings of the 2010 Joint Workshop on Statistical Machine Translation and Metrics for Machine Translation|
|Published:||July 20, 2010|
|Abstract:||This paper presents the results of the WMT10 and MetricsMATR10 shared tasks, which included a translation task, a system combination task, and an evaluation task. We conducted a large-scale manual evaluation of 104 machine translation systems and 41 system combination entries. We used the ranking of these systems to measure how strongly auto- matic metrics correlate with human judgments of translation quality for 26 metrics. This year we also investigated increasing the number of human judgments by hiring non-expert annotators through Amazon‰s Mechanical Turk.|
|Proceedings:||ACL 2010 Joint Fifth Workshop on Statistical Machine Translation and MetricsMaTr|
|Dates:||July 15-16, 2010|
|Keywords:||machine translation, mt, evaluation, metrology|
|Research Areas:||Information Technology, Software Testing Metrics|
|PDF version:||Click here to retrieve PDF version of paper (610KB)|