tech,5-1-H05-1117,bq |
Following recent developments in the
<term>
|
automatic
evaluation
|
</term>
of
<term>
machine translation
</term>
|
#7510
Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE, for automatically evaluating answers to definition questions. |
tech,8-1-H05-1117,bq |
<term>
automatic evaluation
</term>
of
<term>
|
machine
translation
|
</term>
and
<term>
document summarization
</term>
|
#7513
Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE, for automatically evaluating answers to definition questions. |
tech,11-1-H05-1117,bq |
<term>
machine translation
</term>
and
<term>
|
document
summarization
|
</term>
, we present a similar approach ,
|
#7516
Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE, for automatically evaluating answers to definition questions. |
measure(ment),25-1-H05-1117,bq |
, implemented in a measure called
<term>
|
POURPRE
|
</term>
, for
<term>
automatically evaluating
|
#7530
Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE , for automatically evaluating answers to definition questions. |
measure(ment),28-1-H05-1117,bq |
measure called
<term>
POURPRE
</term>
, for
<term>
|
automatically
evaluating answers to definition questions
|
</term>
. Until now , the only way to assess
|
#7533
Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE, for automatically evaluating answers to definition questions. |
tech,4-3-H05-1117,bq |
response . The lack of automatic
<term>
|
methods
|
</term>
for
<term>
scoring system output
</term>
|
#7574
The lack of automatic methods for scoring system output is an impediment to progress in the field, which we address with this work. |
measure(ment),6-3-H05-1117,bq |
automatic
<term>
methods
</term>
for
<term>
|
scoring
system output
|
</term>
is an impediment to progress in the
|
#7576
The lack of automatic methods for scoring system output is an impediment to progress in the field, which we address with this work. |
other,3-4-H05-1117,bq |
this work . Experiments with the
<term>
|
TREC
2003 and TREC 2004 QA tracks
|
</term>
indicate that
<term>
rankings
</term>
|
#7598
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |
other,12-4-H05-1117,bq |
2004 QA tracks
</term>
indicate that
<term>
|
rankings
|
</term>
produced by our
<term>
metric
</term>
|
#7607
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |
measure(ment),16-4-H05-1117,bq |
<term>
rankings
</term>
produced by our
<term>
|
metric
|
</term>
correlate highly with
<term>
official
|
#7611
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |
measure(ment),20-4-H05-1117,bq |
metric
</term>
correlate highly with
<term>
|
official
rankings
|
</term>
, and that
<term>
POURPRE
</term>
outperforms
|
#7615
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |
measure(ment),25-4-H05-1117,bq |
official rankings
</term>
, and that
<term>
|
POURPRE
|
</term>
outperforms direct application of
|
#7620
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |
measure(ment),31-4-H05-1117,bq |
outperforms direct application of existing
<term>
|
metrics
|
</term>
. We describe a
<term>
method
</term>
|
#7626
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics . |