ACL RD-TEC 1.0 Summarization of P98-2239
Paper Title:
WORD ASSOCIATION AND MI TRIGGER-BASED LANGUAGE MODELING
WORD ASSOCIATION AND MI TRIGGER-BASED LANGUAGE MODELING
Authors: GuoDong Zhou and KimTeng Lua
Primarily assigned technology terms:
- algorithm
- automatic sentence disambiguation
- computing
- disambiguation
- language disambiguation
- language modeling
- language processing
- learning
- learning methods
- linear interpolation
- measuring
- mi-trigger-based modeling
- modeling
- natural language processing
- pinyin-to-character conversion
- processing
- recognition
- rule-based approach
- sentence disambiguation
- statistical method
- word bigram
Other assigned terms:
- adjective
- ambiguity
- ambiguity problem
- approach
- bigram
- bigram model
- characters
- chinese characters
- chinese language
- chinese words
- co-occurrence
- co-occurrence statistics
- collocation
- concept
- concepts
- conditional distribution
- conditional probability
- correlation
- dictionaries
- distribution
- document
- entropy
- events
- fact
- index
- information content
- interpolation
- joint probability
- knowledge
- language model
- large corpus
- lattice
- lexical ambiguity
- lexicon
- measure
- method
- modeling power
- mutual information
- n-gram
- n-gram models
- natural language
- perplexity
- pinyin
- probabilities
- probability
- probability estimate
- recognition rate
- semantic
- semantic preference
- sentence
- sentences
- statistical approach
- statistical data
- statistics
- term
- test data
- text
- training
- training corpus
- tree
- unigram
- unigram model
- window size
- word
- word association
- word bigram model
- word pair
- word sequences
- words
- xinhua corpus