measure(ment),25-1-H05-1117,bq |
Following recent developments in the
<term>
automatic evaluation
</term>
of
<term>
machine translation
</term>
and
<term>
document summarization
</term>
, we present a similar approach , implemented in a measure called
<term>
POURPRE
</term>
, for
<term>
automatically evaluating answers to definition questions
</term>
.
|
#7530
Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE , for automatically evaluating answers to definition questions. |
measure(ment),25-4-H05-1117,bq |
Experiments with the
<term>
TREC 2003 and TREC 2004 QA tracks
</term>
indicate that
<term>
rankings
</term>
produced by our
<term>
metric
</term>
correlate highly with
<term>
official rankings
</term>
, and that
<term>
POURPRE
</term>
outperforms direct application of existing
<term>
metrics
</term>
.
|
#7620
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |