ACL RD-TEC 1.0 Summarization of H92-1075
Paper Title:
COLLECTION AND ANALYSES OF WSJ-CSR DATA AT MIT
COLLECTION AND ANALYSES OF WSJ-CSR DATA AT MIT
Authors: Michael Phillips and James Glass and Joseph Polifroni and Victor Zue
Primarily assigned technology terms:
- atis
- cd-rom
- computer science
- computing
- continuous speech recognition
- data capture
- data collection
- hardware
- interface environment
- large vocabulary continuous speech recognition
- large vocabulary speech recognition
- monitoring
- preprocessing
- reading
- recognition
- recognition systems
- recognition technology
- resource management
- sampling
- speaker adaptation
- speech recognition
- speech recognition technology
- spoken language system
- spoken language systems
- system development and evaluation
- text preprocessing
- transcription
- truncation
- user interface
- word deletion
Other assigned terms:
- abbreviation
- abbreviations
- ambiguity
- american english
- atis corpora
- break
- case
- community
- continuous speech
- corpora
- data collection initiative
- data set
- denominations
- disk
- distribution
- document
- duration
- error rate
- fact
- french
- histogram
- hypothesis
- language model
- large speech corpora
- large vocabulary speech
- measures
- nist
- noise
- orthographic transcription
- paragraph
- performance evaluation
- perplexity
- procedure
- process
- punctuation
- research and development
- sentence
- sentence punctuation
- sentences
- server
- set size
- signal
- signal-to-noise ratio
- speaking rate
- speech corpora
- speech corpus
- speech data
- spoken language
- standard deviation
- statistics
- system development
- technology
- term
- text
- timit corpus
- tokens
- training
- training set
- transcriptions
- understanding
- user
- utterance
- vocabulary
- word
- word strings
- word-pair language model
- words