ACL RD-TEC 1.0 Summarization of W04-3242

Paper Title:
RANDOM FORESTS IN LANGUAGE MODELIN

Authors: Peng Xu and Frederick Jelinek

Other assigned terms:

  • approach
  • backoff
  • case
  • cluster
  • clusters
  • cross entropy
  • data sparseness
  • data sparseness problem
  • data structure
  • dimensionality
  • distribution
  • entropy
  • error rate
  • estimation
  • events
  • experimental results
  • fact
  • forest
  • hypothesis
  • hypothesis space
  • interpolation
  • knowledge
  • language model
  • language model probability
  • language models
  • large vocabulary speech
  • lattices
  • leaf
  • likelihood
  • log-likelihood
  • measure
  • measures
  • method
  • methodology
  • model probability
  • named entity
  • natural language
  • natural speech
  • nist
  • noun phrase
  • perplexity
  • phrase
  • probabilities
  • probability
  • probability distribution
  • procedure
  • random sample
  • sentence
  • sentences
  • sparseness problem
  • statistics
  • sub-tree
  • syntactic information
  • technique
  • test data
  • test set
  • text
  • toolkit
  • training
  • training corpus
  • training data
  • tree
  • treebank
  • trees
  • trigram
  • trigram language model
  • trigram model
  • upenn treebank
  • utterance
  • vocabulary
  • vocabulary size
  • word
  • word error rate
  • word sequence
  • word string
  • words
  • wsj corpus

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***