ACL RD-TEC 1.0 Summarization of I05-3033
Paper Title:
TOWARDS A HYBRID MODEL FOR CHINESE WORD SEGMENTATION
TOWARDS A HYBRID MODEL FOR CHINESE WORD SEGMENTATION
Primarily assigned technology terms:
- algorithm
- character-based tagging
- chinese unknown word identification
- chinese word segmentation
- error analysis
- error-driven learning
- hmm tagger
- identification
- instantiation
- learning
- learning algorithm
- learning algorithms
- learning process
- machine learning
- machine learning algorithms
- pos tagging
- recognition
- segmentation
- segmenter
- tagger
- tagging
- tagging algorithm
- training process
- transformation-based error-driven learning
- transformation-based learning
- unknown word identification
- unknown word recognition
- viterbi
- viterbi algorithm
- word identification
- word recognition
- word segmentation
- word segmentation bakeoff
- word segmenter
Other assigned terms:
- approach
- character sequence
- characters
- chinese word
- compounds
- development set
- f-score
- gold standard
- heuristics
- hypothesis
- knowledge
- linguistic
- linguistic knowledge
- names
- part of speech
- probabilities
- probability
- process
- rule template
- segmentation accuracy
- segmentation bakeoff
- sentence
- sentences
- system architecture
- system description
- tag sequence
- tagging accuracy
- tagging task
- tags
- tagset
- terms
- test data
- test set
- text
- time expressions
- training
- training corpus
- training data
- training set
- transition probabilities
- word
- word resolution
- words