|
<term>
Emotions
</term>
and other
<term>
indices
</term>
such as the
<term>
dominance distribution of speakers
</term>
might be available on the
<term>
surface
</term>
and could be
used
directly .
|
#183
Emotions and other indices such as the dominance distribution of speakers might be available on the surface and could be used directly. |
|
Despite the small size of the
<term>
databases
</term>
used
some results about the effectiveness of these
<term>
indices
</term>
can be obtained .
|
#193
Despite the small size of the databasesused some results about the effectiveness of these indices can be obtained. |
|
In this paper we show how two standard outputs from
<term>
information extraction ( IE ) systems
</term>
-
<term>
named entity annotations
</term>
and
<term>
scenario templates
</term>
- can be
used
to enhance access to
<term>
text collections
</term>
via a standard
<term>
text browser
</term>
.
|
#300
In this paper we show how two standard outputs from information extraction (IE) systems - named entity annotations and scenario templates - can be used to enhance access to text collections via a standard text browser. |
|
We describe how this information is
used
in a
<term>
prototype system
</term>
designed to support
<term>
information workers
</term>
' access to a
<term>
pharmaceutical news archive
</term>
as part of their
<term>
industry watch
</term>
function .
|
#319
We describe how this information is used in a prototype system designed to support information workers' access to a pharmaceutical news archive as part of their industry watch function. |
|
The theoretical study of the
<term>
range concatenation grammar [ RCG ] formalism
</term>
has revealed many attractive properties which may be
used
in
<term>
NLP
</term>
.
|
#1617
The theoretical study of the range concatenation grammar [RCG] formalism has revealed many attractive properties which may be used in NLP. |
|
In our method ,
<term>
unsupervised training
</term>
is first
used
to train a
<term>
phone n-gram model
</term>
for a particular
<term>
domain
</term>
; the
<term>
output
</term>
of
<term>
recognition
</term>
with this
<term>
model
</term>
is then passed to a
<term>
phone-string classifier
</term>
.
|
#2263
In our method, unsupervised training is first used to train a phone n-gram model for a particular domain; the output of recognition with this model is then passed to a phone-string classifier. |
|
First ,
<term>
decision list
</term>
is
used
to learn the
<term>
parsing-based NE rules
</term>
.
|
#3350
First, decision list is used to learn the parsing-based NE rules. |
|
<term>
Monolingual , unannotated text
</term>
can be
used
to further improve the
<term>
stemmer
</term>
by allowing it to adapt to a desired
<term>
domain
</term>
or
<term>
genre
</term>
.
|
#4492
Monolingual, unannotated text can be used to further improve the stemmer by allowing it to adapt to a desired domain or genre. |
|
We believe this is a state-of-the-art performance and the
<term>
algorithm
</term>
can be
used
for many
<term>
highly inflected languages
</term>
provided that one can create a small
<term>
manually segmented corpus
</term>
of the
<term>
language
</term>
of interest .
|
#4777
We believe this is a state-of-the-art performance and the algorithm can be used for many highly inflected languages provided that one can create a small manually segmented corpus of the language of interest. |
|
In this paper , we evaluate an approach to automatically acquire
<term>
sense-tagged training data
</term>
from
<term>
English-Chinese parallel corpora
</term>
, which are then
used
for disambiguating the
<term>
nouns
</term>
in the
<term>
SENSEVAL-2 English lexical sample task
</term>
.
|
#4843
In this paper, we evaluate an approach to automatically acquire sense-tagged training data from English-Chinese parallel corpora, which are then used for disambiguating the nouns in the SENSEVAL-2 English lexical sample task. |
|
We show that various
<term>
features
</term>
based on the structure of
<term>
email-threads
</term>
can be
used
to improve upon
<term>
lexical similarity
</term>
of
<term>
discourse segments
</term>
for
<term>
question-answer pairing
</term>
.
|
#6295
We show that various features based on the structure of email-threads can be used to improve upon lexical similarity of discourse segments for question-answer pairing. |
|
The same system
used
in a
<term>
validation mode
</term>
, can be used to check and spot
<term>
alignment errors
</term>
in
<term>
multilingually aligned wordnets
</term>
as
<term>
BalkaNet
</term>
and
<term>
EuroWordNet
</term>
.
|
#6519
The same system used in a validation mode, can be used to check and spot alignment errors in multilingually aligned wordnets as BalkaNet and EuroWordNet. |
|
The same system used in a
<term>
validation mode
</term>
, can be
used
to check and spot
<term>
alignment errors
</term>
in
<term>
multilingually aligned wordnets
</term>
as
<term>
BalkaNet
</term>
and
<term>
EuroWordNet
</term>
.
|
#6527
The same system used in a validation mode, can be used to check and spot alignment errors in multilingually aligned wordnets as BalkaNet and EuroWordNet. |
|
Our results show that
<term>
MBR decoding
</term>
can be
used
to tune
<term>
statistical MT
</term>
performance for specific
<term>
loss functions
</term>
.
|
#6634
Our results show that MBR decoding can be used to tune statistical MT performance for specific loss functions. |
|
The
<term>
probabilistic model
</term>
used
in the
<term>
alignment
</term>
directly models the
<term>
link decisions
</term>
.
|
#7297
The probabilistic modelused in the alignment directly models the link decisions. |
|
Two
<term>
hardness
</term>
results for the class
<term>
NP
</term>
are reported , along with an
<term>
exponential time lower-bound
</term>
for certain classes of
<term>
algorithms
</term>
that are currently
used
in the literature .
|
#7500
Two hardness results for the class NP are reported, along with an exponential time lower-bound for certain classes of algorithms that are currently used in the literature. |
|
We incorporate this analysis into a
<term>
diagnostic tool
</term>
intended for
<term>
developers
</term>
of
<term>
machine translation systems
</term>
, and demonstrate how our application can be
used
by
<term>
developers
</term>
to explore
<term>
patterns
</term>
in
<term>
machine translation output
</term>
.
|
#7667
We incorporate this analysis into a diagnostic tool intended for developers of machine translation systems, and demonstrate how our application can be used by developers to explore patterns in machine translation output. |
|
Yet , they are scarcely
used
for the assessment of
<term>
language pairs
</term>
like
<term>
English-Chinese
</term>
or
<term>
English-Japanese
</term>
, because of the
<term>
word segmentation problem
</term>
.
|
#7705
Yet, they are scarcely used for the assessment of language pairs like English-Chinese or English-Japanese, because of the word segmentation problem. |
|
<term>
STTK
</term>
, a
<term>
statistical machine translation tool kit
</term>
, will be introduced and
used
to build a working
<term>
translation system
</term>
.
|
#8133
STTK, a statistical machine translation tool kit, will be introduced and used to build a working translation system. |
|
<term>
STTK
</term>
has been developed by the presenter and co-workers over a number of years and is currently
used
as the basis of
<term>
CMU 's SMT system
</term>
.
|
#8158
STTK has been developed by the presenter and co-workers over a number of years and is currently used as the basis of CMU's SMT system. |