ACL RD-TEC 1.0 Summarization of P97-1041
Paper Title:
A TRAINABLE RULE-BASED ALGORITHM FOR WORD SEGMENTATION
A TRAINABLE RULE-BASED ALGORITHM FOR WORD SEGMENTATION
Primarily assigned technology terms:
- algorithm
- approximation
- bracketing
- chinese information retrieval
- chinese segmentation
- chunking
- computing
- corpus-based language processing
- decision-tree
- decision-tree induction
- disambiguation
- error analysis
- error reduction
- error-driven learning
- greedy algorithm
- induction
- information retrieval
- knowledge engineering
- language processing
- language processing method
- learning
- learning algorithm
- machine learning
- matching
- matching algorithm
- maximum matching
- morphological analyzers
- morphological disambiguation
- morphological segmentation
- nlp
- parser
- parsing
- part-of-speech tagger
- phrase parsing
- processing
- reporting
- scoring
- search
- segmentation
- segmentation algorithm
- segmenter
- tagger
- text chunking
- trainable segmentation
- transformation learning
- transformation-based error-driven learning
- transformation-based learning
- word segmentation
- word separation
Other assigned terms:
- approach
- bigram
- case
- character sequence
- characters
- chinese word
- chinese words
- corpora
- debugging
- domain knowledge
- english corpus
- english language
- english sentence
- error rate
- f-measure
- fact
- gold standard
- idiomatic expressions
- knowledge
- language resources
- large corpus
- latex
- lexica
- lexical resources
- lexicon
- measures
- method
- names
- nlp tasks
- part-of-speech
- person names
- phrase
- phrase attachment
- precision
- prefixes and suffixes
- prepositional phrase
- prepositional phrase attachment
- procedure
- process
- proper names
- punctuation
- rule sequence
- segmentation accuracy
- segments
- sentence
- sentences
- suffixes
- syntax
- technique
- test data
- test set
- text
- thai language
- thai word
- training
- training data
- training set
- transformation
- word
- word boundaries
- word lists
- word model
- words
- writing system