ACL RD-TEC 1.0 Summarization of H92-1003
Paper Title:
MULTI-SITE DATA COLLECTION FOR A SPOKEN LANGUAGE CORPUS
MULTI-SITE DATA COLLECTION FOR A SPOKEN LANGUAGE CORPUS
Primarily assigned technology terms:
- anonymous ftp
- atis
- atis system
- automatic speech recognition
- categorization
- cd-rom
- cd-rom production
- classification
- computing
- data collection
- data exchange
- data validation
- database
- database access
- error correction
- hardware
- human-computer interface
- information system
- language understanding
- natural language understanding
- objective evaluation
- problem solving
- processing
- recognition
- recognition system
- scoring
- speech recognition
- speech recognition component
- speech recognition system
- spoken language system
- spoken language systems
- spoken language understanding
- transcription
- truncation
- validation
Other assigned terms:
- abbreviations
- anaphoric expression
- annotation
- annotator
- annotators
- atis benchmark
- atis corpora
- atis data collection
- benchmark
- break
- canned corpus
- clarification dialogue
- community
- corpora
- data flow
- database query
- distribution
- document
- electronic mail
- evaluation methodology
- evaluation paradigm
- events
- interpretation
- language corpus
- language use
- lexicon
- maximal answer
- meaning
- measure
- message
- method
- methodology
- natural language
- nist
- paraphrase
- procedure
- process
- quality control
- queries
- query
- questionnaire
- recognition component
- recognition errors
- reference answer
- sentence
- sentences
- speech data
- speech input
- spoken language
- spoken language corpus
- sql query
- synthesized speech
- system response
- system-initiated clarification
- technology
- terms
- test corpus
- test data
- test material
- training
- training and test data
- training data
- training material
- transcribed input
- transcriptions
- understanding
- understanding component
- user
- utterance
- words