ACL RD-TEC 1.0 Summarization of W97-0127

Paper Title:
PROBABILISTIC WORD CLASSIFICATION BASED ON CONTEXT-SENSITIVE BINARY TREE METHOD

Authors: Jun Gao and XiXian Chen

Other assigned terms:

  • bigram
  • binary tree
  • boundary marker
  • case
  • chinese word
  • chinese words
  • cluster
  • clusters
  • co-occurrence
  • concept
  • concepts
  • content words
  • corpora
  • data sparseness
  • distribution
  • entropy
  • events
  • information source
  • information theory
  • kullback-leibler distance
  • language model
  • language models
  • linguistic
  • linguistics
  • measure
  • measures
  • method
  • modeling language
  • mutual information
  • n-gram
  • n-gram language model
  • natural language
  • news corpus
  • nouns
  • part-of-speech
  • perplexity
  • perplexity reduction
  • phrase
  • probabilities
  • probability
  • probability distribution
  • procedure
  • process
  • relation
  • sentence
  • sentence boundary
  • sentences
  • similarity measures
  • similarity metric
  • similarity metrics
  • statistical language model
  • subcorpus
  • symbols
  • technique
  • test set
  • text
  • theory
  • training
  • training corpus
  • training data
  • transitivity
  • tree
  • trees
  • vocabulary
  • vocabulary size
  • word
  • word association
  • word classes
  • word-based language model
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***