ACL RD-TEC 1.0 Summarization of W06-0138
Paper Title:
USING PART-OF-SPEECH RERANKING TO IMPROVE CHINESE WORD SEGMENTATION
USING PART-OF-SPEECH RERANKING TO IMPROVE CHINESE WORD SEGMENTATION
Authors: Mengqiu Wang and Yanxin Shi
Primarily assigned technology terms:
- algorithm
- chinese language processing
- chinese segmentation
- chinese word segmentation
- computational linguistics
- conditional random fields
- crfs
- cross validation
- cross-validation
- decoding
- disambiguation
- joint decoding
- language processing
- learning
- list reranking
- markov random fields
- n-best list reranking
- normalization
- np-chunking
- pos tagging
- processing
- reranking
- search
- searching
- segmentation
- segmentation and pos tagging
- segmenter
- tagger
- tagging
- validation
- viterbi
- viterbi algorithm
- word segmentation
- word segmentation and pos tagging
Other assigned terms:
- 10-fold cross-validation
- association for computational linguistics
- baseline model
- binary features
- case
- character sequence
- characters
- chinese language
- chinese treebank
- chinese word
- conditional probability
- crf model
- ctb corpus
- document
- experimental results
- feature
- hownet
- index
- joint probability
- lexicon
- linguistics
- mapping
- maps
- maximum similarity
- meanings
- measure
- method
- n-best list
- part-of-speech
- penn chinese treebank
- penn treebank
- posterior
- posterior probability
- precision
- probability
- punctuation
- search space
- semantic
- semantic class
- semantic classes
- semantic features
- similarity measure
- similarity score
- similarity scores
- statistics
- tagger model
- tagging model
- tagging task
- term
- terms
- training
- training data
- training set
- treebank
- weight vector
- word
- word boundaries
- words