ACL RD-TEC 1.0 Summarization of H92-1075

Paper Title:
COLLECTION AND ANALYSES OF WSJ-CSR DATA AT MIT

Authors: Michael Phillips and James Glass and Joseph Polifroni and Victor Zue

Other assigned terms:

  • abbreviation
  • abbreviations
  • ambiguity
  • american english
  • atis corpora
  • break
  • case
  • community
  • continuous speech
  • corpora
  • data collection initiative
  • data set
  • denominations
  • disk
  • distribution
  • document
  • duration
  • error rate
  • fact
  • french
  • histogram
  • hypothesis
  • language model
  • large speech corpora
  • large vocabulary speech
  • measures
  • nist
  • noise
  • orthographic transcription
  • paragraph
  • performance evaluation
  • perplexity
  • procedure
  • process
  • punctuation
  • research and development
  • sentence
  • sentence punctuation
  • sentences
  • server
  • set size
  • signal
  • signal-to-noise ratio
  • speaking rate
  • speech corpora
  • speech corpus
  • speech data
  • spoken language
  • standard deviation
  • statistics
  • system development
  • technology
  • term
  • text
  • timit corpus
  • tokens
  • training
  • training set
  • transcriptions
  • understanding
  • user
  • utterance
  • vocabulary
  • word
  • word strings
  • word-pair language model
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***