|
available on the
<term>
surface
</term>
and could be
|
used
|
directly . Despite the small size of the
|
#183
Emotions and other indices such as the dominance distribution of speakers might be available on the surface and could be used directly. |
|
small size of the
<term>
databases
</term>
|
used
|
some results about the effectiveness of
|
#193
Despite the small size of the databasesused some results about the effectiveness of these indices can be obtained. |
|
and
<term>
scenario templates
</term>
- can be
|
used
|
to enhance access to
<term>
text collections
|
#300
In this paper we show how two standard outputs from information extraction (IE) systems - named entity annotations and scenario templates - can be used to enhance access to text collections via a standard text browser. |
|
</term>
. We describe how this information is
|
used
|
in a
<term>
prototype system
</term>
designed
|
#319
We describe how this information is used in a prototype system designed to support information workers' access to a pharmaceutical news archive as part of their industry watch function. |
|
many attractive properties which may be
|
used
|
in
<term>
NLP
</term>
. In particular ,
<term>
|
#1617
The theoretical study of the range concatenation grammar [RCG] formalism has revealed many attractive properties which may be used in NLP. |
|
<term>
unsupervised training
</term>
is first
|
used
|
to train a
<term>
phone n-gram model
</term>
|
#2263
In our method, unsupervised training is first used to train a phone n-gram model for a particular domain; the output of recognition with this model is then passed to a phone-string classifier. |
|
</term>
. First ,
<term>
decision list
</term>
is
|
used
|
to learn the
<term>
parsing-based NE rules
|
#3350
First, decision list is used to learn the parsing-based NE rules. |
|
Monolingual , unannotated text
</term>
can be
|
used
|
to further improve the
<term>
stemmer
</term>
|
#4492
Monolingual, unannotated text can be used to further improve the stemmer by allowing it to adapt to a desired domain or genre. |
|
performance and the
<term>
algorithm
</term>
can be
|
used
|
for many
<term>
highly inflected languages
|
#4777
We believe this is a state-of-the-art performance and the algorithm can be used for many highly inflected languages provided that one can create a small manually segmented corpus of the language of interest. |
|
parallel corpora
</term>
, which are then
|
used
|
for disambiguating the
<term>
nouns
</term>
|
#4843
In this paper, we evaluate an approach to automatically acquire sense-tagged training data from English-Chinese parallel corpora, which are then used for disambiguating the nouns in the SENSEVAL-2 English lexical sample task. |
|
structure of
<term>
email-threads
</term>
can be
|
used
|
to improve upon
<term>
lexical similarity
|
#6295
We show that various features based on the structure of email-threads can be used to improve upon lexical similarity of discourse segments for question-answer pairing. |
|
very encouraging results . The same system
|
used
|
in a
<term>
validation mode
</term>
, can be
|
#6519
The same system used in a validation mode, can be used to check and spot alignment errors in multilingually aligned wordnets as BalkaNet and EuroWordNet. |
|
in a
<term>
validation mode
</term>
, can be
|
used
|
to check and spot
<term>
alignment errors
|
#6527
The same system used in a validation mode, can be used to check and spot alignment errors in multilingually aligned wordnets as BalkaNet and EuroWordNet. |
|
show that
<term>
MBR decoding
</term>
can be
|
used
|
to tune
<term>
statistical MT
</term>
performance
|
#6634
Our results show that MBR decoding can be used to tune statistical MT performance for specific loss functions. |
|
</term>
. The
<term>
probabilistic model
</term>
|
used
|
in the
<term>
alignment
</term>
directly models
|
#7297
The probabilistic modelused in the alignment directly models the link decisions. |
|
<term>
algorithms
</term>
that are currently
|
used
|
in the literature . Following recent developments
|
#7500
Two hardness results for the class NP are reported, along with an exponential time lower-bound for certain classes of algorithms that are currently used in the literature. |
|
and demonstrate how our application can be
|
used
|
by
<term>
developers
</term>
to explore
<term>
|
#7667
We incorporate this analysis into a diagnostic tool intended for developers of machine translation systems, and demonstrate how our application can be used by developers to explore patterns in machine translation output. |
|
well established . Yet , they are scarcely
|
used
|
for the assessment of
<term>
language pairs
|
#7705
Yet, they are scarcely used for the assessment of language pairs like English-Chinese or English-Japanese, because of the word segmentation problem. |
|
tool kit
</term>
, will be introduced and
|
used
|
to build a working
<term>
translation system
|
#8133
STTK, a statistical machine translation tool kit, will be introduced and used to build a working translation system. |
|
over a number of years and is currently
|
used
|
as the basis of
<term>
CMU 's SMT system
</term>
|
#8158
STTK has been developed by the presenter and co-workers over a number of years and is currently used as the basis of CMU's SMT system. |