An alternative <term> index </term> could be the activity such as discussing , planning , informing , story-telling , etc . This paper addresses the problem of the <term> automatic detection </term> of those activities in meeting situation and everyday rejoinders .
In this presentation , we describe the features of and <term> requirements </term> for a genuinely useful <term> software infrastructure </term> for this purpose .
In this paper we show how two standard outputs from <term> information extraction ( IE ) systems </term> - <term> named entity annotations </term> and <term> scenario templates </term> - can be used to enhance access to <term> text collections </term> via a standard <term> text browser </term> .
The purpose of this research is to test the efficacy of applying <term> automated evaluation techniques </term> , originally devised for the <term> evaluation </term> of <term> human language learners </term> , to the <term> output </term> of <term> machine translation ( MT ) systems </term> .
We have built and will demonstrate an application of this approach called <term> LCS-Marine </term> .
In this paper , we address the problem of combining several <term> language models ( LMs ) </term> .
We describe our use of this approach in numerous fielded <term> user studies </term> conducted with the U.S. military .
We describe our use of this approach in numerous fielded <term> user studies </term> conducted with the U.S. military . This paper proposes a practical approach employing <term> n-gram models </term> and <term> error-correction rules </term> for <term> Thai key prediction </term> and <term> Thai-English language identification </term> .
In this paper , we present <term> SPoT </term> , a <term> sentence planner </term> , and a new methodology for automatically training <term> SPoT </term> on the basis of <term> feedback </term> provided by <term> human judges </term> .
In this paper , we compare the relative effects of <term> segment order </term> , <term> segmentation </term> and <term> segment contiguity </term> on the <term> retrieval performance </term> of a <term> translation memory system </term> .
In this paper , we study a <term> parsing technique </term> whose purpose is to improve the practical efficiency of <term> RCL parsers </term> .
Our approach yields <term> phrasal and single word lexical paraphrases </term> as well as <term> syntactic paraphrases </term> . This paper presents a <term> formal analysis </term> for a large class of <term> words </term> called <term> alternative markers </term> , which includes <term> other ( than ) </term> , <term> such ( as ) </term> , and <term> besides </term> .
In this paper We experimentally evaluate a <term> trainable sentence planner </term> for a <term> spoken dialogue system </term> by eliciting <term> subjective human judgments </term> .
We report on different aspects of the <term> predictive performance </term> of our <term> models </term> , including the influence of various <term> training and testing factors </term> on <term> predictive performance </term> , and examine the relationships among the target variables . This paper describes a method for <term> utterance classification </term> that does not require <term> manual transcription </term> of <term> training data </term> .
In this paper we present <term> ONTOSCORE </term> , a system for scoring sets of <term> concepts </term> on the basis of an <term> ontology </term> .
In this paper , we introduce a <term> generative probabilistic optical character recognition ( OCR ) model </term> that describes an end-to-end process in the <term> noisy channel framework </term> , progressing from generation of <term> true text </term> through its transformation into the <term> noisy output </term> of an <term> OCR system </term> .
In this paper , we show how <term> training data </term> can be supplemented with <term> text </term> from the <term> web </term> filtered to match the <term> style </term> and/or <term> topic </term> of the target <term> recognition task </term> , but also that it is possible to get bigger performance gains from the <term> data </term> by using <term> class-dependent interpolation </term> of <term> N-grams </term> .
We evaluate the utility of this <term> constraint </term> in two different <term> algorithms </term> .
A novel <term> bootstrapping approach </term> to <term> Named Entity ( NE ) tagging </term> using <term> concept-based seeds </term> and <term> successive learners </term> is presented . This approach only requires a few <term> common noun </term> or <term> pronoun </term><term> seeds </term> that correspond to the <term> concept </term> for the targeted <term> NE </term> , e.g. he/she/man / woman for <term> PERSON NE </term> .
In this paper , we describe a <term> phrase-based unigram model </term> for <term> statistical machine translation </term> that uses a much simpler set of <term> model parameters </term> than similar <term> phrase-based models </term> .
hide detail