ACL RD-TEC 1.0 Summarization of W97-0127
Paper Title:
PROBABILISTIC WORD CLASSIFICATION BASED ON CONTEXT-SENSITIVE BINARY TREE METHOD
PROBABILISTIC WORD CLASSIFICATION BASED ON CONTEXT-SENSITIVE BINARY TREE METHOD
Authors: Jun Gao and XiXian Chen
Primarily assigned technology terms:
- agglomerative clustering
- algorithm
- binary splitting
- bottom-up merging
- c + +
- class-based language modeling
- class-based modeling
- classification
- classification method
- classification system
- clustering
- computational linguistics
- decomposition
- distributional classification
- grouping
- language modeling
- listing
- measuring
- metropolis algorithm
- modeling
- part-of-speech tagger
- probabilistic classification
- processing
- programming language
- recognition
- search
- searching
- simulated annealing
- singular value decomposition
- speech recognition
- splitting
- splitting method
- tagger
- tagging
- top-down splitting
- tree growing
- word classification
Other assigned terms:
- bigram
- binary tree
- boundary marker
- case
- chinese word
- chinese words
- cluster
- clusters
- co-occurrence
- concept
- concepts
- content words
- corpora
- data sparseness
- distribution
- entropy
- events
- information source
- information theory
- kullback-leibler distance
- language model
- language models
- linguistic
- linguistics
- measure
- measures
- method
- modeling language
- mutual information
- n-gram
- n-gram language model
- natural language
- news corpus
- nouns
- part-of-speech
- perplexity
- perplexity reduction
- phrase
- probabilities
- probability
- probability distribution
- procedure
- process
- relation
- sentence
- sentence boundary
- sentences
- similarity measures
- similarity metric
- similarity metrics
- statistical language model
- subcorpus
- symbols
- technique
- test set
- text
- theory
- training
- training corpus
- training data
- transitivity
- tree
- trees
- vocabulary
- vocabulary size
- word
- word association
- word classes
- word-based language model
- words