measure(ment),20-4-H05-1117,bq |
Experiments with the
<term>
TREC 2003 and TREC 2004 QA tracks
</term>
indicate that
<term>
rankings
</term>
produced by our
<term>
metric
</term>
correlate highly with
<term>
official rankings
</term>
, and that
<term>
POURPRE
</term>
outperforms direct application of existing
<term>
metrics
</term>
.
|
#7615
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly withofficial rankings, and that POURPRE outperforms direct application of existing metrics. |
measure(ment),25-1-H05-1117,bq |
Following recent developments in the
<term>
automatic evaluation
</term>
of
<term>
machine translation
</term>
and
<term>
document summarization
</term>
, we present a similar approach , implemented in a measure called
<term>
POURPRE
</term>
, for
<term>
automatically evaluating answers to definition questions
</term>
.
|
#7530
Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure calledPOURPRE, for automatically evaluating answers to definition questions. |
tech,11-1-H05-1117,bq |
Following recent developments in the
<term>
automatic evaluation
</term>
of
<term>
machine translation
</term>
and
<term>
document summarization
</term>
, we present a similar approach , implemented in a measure called
<term>
POURPRE
</term>
, for
<term>
automatically evaluating answers to definition questions
</term>
.
|
#7516
Following recent developments in the automatic evaluation of machine translation anddocument summarization, we present a similar approach, implemented in a measure called POURPRE, for automatically evaluating answers to definition questions. |
measure(ment),28-1-H05-1117,bq |
Following recent developments in the
<term>
automatic evaluation
</term>
of
<term>
machine translation
</term>
and
<term>
document summarization
</term>
, we present a similar approach , implemented in a measure called
<term>
POURPRE
</term>
, for
<term>
automatically evaluating answers to definition questions
</term>
.
|
#7533
Following recent developments in the automatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE, forautomatically evaluating answers to definition questions. |
measure(ment),31-4-H05-1117,bq |
Experiments with the
<term>
TREC 2003 and TREC 2004 QA tracks
</term>
indicate that
<term>
rankings
</term>
produced by our
<term>
metric
</term>
correlate highly with
<term>
official rankings
</term>
, and that
<term>
POURPRE
</term>
outperforms direct application of existing
<term>
metrics
</term>
.
|
#7626
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existingmetrics. |
tech,8-1-H05-1117,bq |
Following recent developments in the
<term>
automatic evaluation
</term>
of
<term>
machine translation
</term>
and
<term>
document summarization
</term>
, we present a similar approach , implemented in a measure called
<term>
POURPRE
</term>
, for
<term>
automatically evaluating answers to definition questions
</term>
.
|
#7513
Following recent developments in the automatic evaluation ofmachine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE, for automatically evaluating answers to definition questions. |
measure(ment),16-4-H05-1117,bq |
Experiments with the
<term>
TREC 2003 and TREC 2004 QA tracks
</term>
indicate that
<term>
rankings
</term>
produced by our
<term>
metric
</term>
correlate highly with
<term>
official rankings
</term>
, and that
<term>
POURPRE
</term>
outperforms direct application of existing
<term>
metrics
</term>
.
|
#7611
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by ourmetric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |
tech,4-3-H05-1117,bq |
The lack of automatic
<term>
methods
</term>
for
<term>
scoring system output
</term>
is an impediment to progress in the field , which we address with this work .
|
#7574
The lack of automaticmethods for scoring system output is an impediment to progress in the field, which we address with this work. |
other,3-4-H05-1117,bq |
Experiments with the
<term>
TREC 2003 and TREC 2004 QA tracks
</term>
indicate that
<term>
rankings
</term>
produced by our
<term>
metric
</term>
correlate highly with
<term>
official rankings
</term>
, and that
<term>
POURPRE
</term>
outperforms direct application of existing
<term>
metrics
</term>
.
|
#7598
Experiments with theTREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |
measure(ment),6-3-H05-1117,bq |
The lack of automatic
<term>
methods
</term>
for
<term>
scoring system output
</term>
is an impediment to progress in the field , which we address with this work .
|
#7576
The lack of automatic methods forscoring system output is an impediment to progress in the field, which we address with this work. |
tech,5-1-H05-1117,bq |
Following recent developments in the
<term>
automatic evaluation
</term>
of
<term>
machine translation
</term>
and
<term>
document summarization
</term>
, we present a similar approach , implemented in a measure called
<term>
POURPRE
</term>
, for
<term>
automatically evaluating answers to definition questions
</term>
.
|
#7510
Following recent developments in theautomatic evaluation of machine translation and document summarization, we present a similar approach, implemented in a measure called POURPRE, for automatically evaluating answers to definition questions. |
measure(ment),25-4-H05-1117,bq |
Experiments with the
<term>
TREC 2003 and TREC 2004 QA tracks
</term>
indicate that
<term>
rankings
</term>
produced by our
<term>
metric
</term>
correlate highly with
<term>
official rankings
</term>
, and that
<term>
POURPRE
</term>
outperforms direct application of existing
<term>
metrics
</term>
.
|
#7620
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate that rankings produced by our metric correlate highly with official rankings, and thatPOURPRE outperforms direct application of existing metrics. |
other,12-4-H05-1117,bq |
Experiments with the
<term>
TREC 2003 and TREC 2004 QA tracks
</term>
indicate that
<term>
rankings
</term>
produced by our
<term>
metric
</term>
correlate highly with
<term>
official rankings
</term>
, and that
<term>
POURPRE
</term>
outperforms direct application of existing
<term>
metrics
</term>
.
|
#7607
Experiments with the TREC 2003 and TREC 2004 QA tracks indicate thatrankings produced by our metric correlate highly with official rankings, and that POURPRE outperforms direct application of existing metrics. |