ACL RD-TEC 1.0 Summarization of P00-1073

Paper Title:
DISTRIBUTION-BASED PRUNING OF BACKOFF LANGUAGE MODELS

Authors: Jianfeng Gao and Kai-Fu Lee

Other assigned terms:

  • approach
  • backoff
  • backoff model
  • bias
  • bigram
  • bigram model
  • case
  • characters
  • cluster
  • clusters
  • conditional probabilities
  • content words
  • data sparseness
  • distribution
  • document
  • document frequency
  • entropy
  • estimation
  • experimental results
  • fact
  • finite set
  • geometric mean
  • implementation
  • inverse document frequency
  • language model
  • language models
  • large corpus
  • large training
  • likelihood
  • measure
  • method
  • mood
  • n-gram
  • n-gram language model
  • n-gram models
  • n-grams
  • normalization factor
  • perplexity
  • perplexity reduction
  • pinyin
  • poisson distribution
  • probabilistic model
  • probabilities
  • probability
  • probability estimate
  • spoken language
  • style
  • term
  • term distribution
  • testing data
  • text
  • training
  • training corpus
  • training data
  • training set
  • training text
  • trigram
  • understanding
  • unigram
  • word
  • word pair
  • word perplexity
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***