ACL RD-TEC 1.0 Summarization of W04-2902

Paper Title:
ANALYSIS AND PROCESSING OF LECTURE AUDIO DATA: PRELIMINARY INVESTIGATIONS

Authors: James Glass and Timothy J. Hazen and Lee Hetherington and Chao Wang

Other assigned terms:

  • annotated corpus
  • annotation
  • broadcast news
  • case
  • community
  • content words
  • continuous speech
  • conversation
  • conversational material
  • corpora
  • discourse
  • document
  • error rate
  • general vocabulary
  • generation
  • knowledge
  • language model
  • language models
  • language processing research
  • language usage
  • large corpus
  • linear algebra
  • measures
  • method
  • natural language
  • pauses
  • perplexity
  • process
  • punctuation
  • qualitative analysis
  • retrieval task
  • sentence
  • sentence boundaries
  • sentence level
  • signal
  • speech corpora
  • speech data
  • speech signal
  • spontaneous speech corpora
  • standard deviation
  • style
  • technology
  • terms
  • test data
  • test set
  • text
  • textbook
  • toolkit
  • topics
  • training
  • training data
  • training material
  • transcript
  • transcriptions
  • transcripts
  • trigram
  • trigram language model
  • vocabulary
  • vocabulary size
  • wide-band speech
  • word
  • word error rate
  • word error rates
  • word usage
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***