#551The purpose of this research is to test the efficacy of applying automated evaluation techniques, originally devised for the evaluation of human language learners, to the output of machine translation (MT) systems.
assessors
</term>
made their decisions . We
tested
this to see if similar criteria could be
#663We tested this to see if similar criteria could be elicited from duplicating the experiment using machine translation output.
retrieval accuracy
</term>
superior to any of the
tested
<term>
word N-gram models
</term>
. Further
#1554Over two distinct datasets, we find that indexing according to simple character bigrams produces a retrieval accuracy superior to any of the tested word N-gram models.
lr,15-6-P03-1051,ak
<term>
exact match accuracy
</term>
on a
<term>
test
corpus
</term>
containing 28,449
<term>
word
#4760The resulting Arabic word segmentation system achieves around 97% exact match accuracy on atest corpus containing 28,449 word tokens.
measure(ment),15-4-H05-1012,ak
improvement on several
<term>
machine translation
tests
</term>
. Performance of the
<term>
algorithm
#5350Significant improvement over traditional word alignment techniques is shown as well as improvement on several machine translation tests.
other,9-2-H05-1032,ak
Bayesian summarizers
</term>
, using
<term>
test
data
</term>
from
<term>
Japanese news texts
#5398Comparison is made against non Bayesian summarizers, usingtest data from Japanese news texts.
outputs . We present the first known empirical
test
of an increasingly common speculative claim
#6335We present the first known empirical test of an increasingly common speculative claim, by evaluating a representative Chinese-to-English SMT model directly on word sense disambiguation performance, using standard WSD evaluation methodology and datasets from the Senseval-3 Chinese lexical sample task.
</term>
for 30
<term>
SCF types
</term>
which
tests
for the presence of
<term>
grammatical relations
#10301The system incorporates a decision-tree classifier for 30 SCF types which tests for the presence of grammatical relations (GRs) in the output of a robust statistical parser.
other,22-5-P05-1076,ak
the process of obtaining
<term>
training and
test
data
</term>
for
<term>
subcategorization acquisition
#10386A new tool for linguistic annotation of SCFs in corpus data is also introduced which can considerably alleviate the process of obtaining training and test data for subcategorization acquisition.
other,31-2-P05-2008,ak
good match between the
<term>
training and
test
data
</term>
with respect to
<term>
topic
</term>
#10451Traditional machine learning techniques have been applied to this problem with reasonable success, but they have been shown to work well only when there is a good match between the training and test data with respect to topic.
vocabulary
</term>
for an English certification
test
as the
<term>
target vocabulary
</term>
and
#10804We used a specialized vocabulary for an English certification test as the target vocabulary and used English Wikipedia, a free-content encyclopedia, as the target corpus.
algorithms
</term>
have been built and empirically
tested
whereas little is known about the
<term>
#10908Over the last decade, a variety of SMT algorithms have been built and empirically tested whereas little is known about the computational complexity of some of the fundamental problems of SMT.
representational format
</term>
for
<term>
meaning
</term>
is
tested
as broadly as possible . In this format
#12776A research program is described in which a particular representational format for meaning is tested as broadly as possible.
represented as a
<term>
procedure
</term>
which
tests
, scores and aggregates the
<term>
elastic
#15363In this approach to semantics, the meaning of a proposition, p, is represented as a procedure which tests, scores and aggregates the elastic constraints which are induced by p.
voice interactive system
</term>
. A series of
tests
are described that show the power of the
#16246A series of tests are described that show the power of the error correction methodology when stereotypic dialogue occurs.
lr,30-3-H89-2019,ak
to be included in and excluded from
<term>
test
corpora
</term>
, and the procedures used
#19942The Common Answer Specification determines the syntax of answer expressions, the minimal content that must be included in them, the data to be included in and excluded fromtest corpora, and the procedures used by the Comparator.
other,22-4-H90-1060,ak
a standard
<term>
grammar
</term>
and
<term>
test
set
</term>
from the
<term>
DARPA Resource
#21151With only 12 training speakers for SI recognition, we achieved a 7.5% word error rate on a standard grammar andtest set from the DARPA Resource Management corpus.
other,10-5-H90-1060,ak
comparable to our best condition for this
<term>
test
suite
</term>
, using 109
<term>
training speakers
#21170This performance is comparable to our best condition for thistest suite, using 109 training speakers.
<term>
semiphone modeling
</term>
have been
tested
. A very simple improved
<term>
duration
#21738Some new variations in semiphone modeling have been tested.
new
<term>
training strategy
</term>
has been
tested
which , by itself , did not provide useful
#21768A new training strategy has been tested which, by itself, did not provide useful improvements but suggests that improvements can be obtained by a related rapid adaptation technique.