ACL RD-TEC 1.0 Summarization of W06-0124
Paper Title:
BOOSTING FOR CHINESE NAMED ENTITY RECOGNITION
BOOSTING FOR CHINESE NAMED ENTITY RECOGNITION
Authors: Xiaofeng Yu and Marine Carpuat and Dekai Wu
Primarily assigned technology terms:
- adaboost
- algorithm
- binary classification
- boosting
- boosting algorithm
- capitalization
- categorization
- chinese language processing
- chinese named entity recognition
- classification
- classifier
- classifiers
- computational linguistics
- decision tree
- disambiguation
- entity identification
- entity recognition
- feature selection
- forward match
- hidden markov
- hidden markov model
- identification
- information extraction
- information retrieval
- language processing
- learner
- learning
- learning methods
- lexical analysis
- machine learning
- machine learning methods
- machine translation
- markov model
- modeling
- named entity identification
- named entity recognition
- natural language processing
- normalization
- pos tagging
- preprocessing
- processing
- question answering
- recognition
- segmentation
- sense disambiguation
- supervised learning
- tagging
- text categorization
- unknown word recognition
- weak classifier
- word recognition
- word segmentation
- word sense disambiguation
Other assigned terms:
- ambiguous segmentation
- approach
- association for computational linguistics
- capitalization information
- characters
- chinese language
- chinese lexical
- chinese words
- chunk
- chunk tag
- chunks
- classification accuracy
- context window
- corpora
- dictionaries
- distribution
- dutch
- entity recognition task
- evaluations
- fact
- feature
- feature set
- fmeasure
- gazetteer
- geopolitical entity
- gold standard
- hypotheses
- hypothesis
- knowledge
- language processing applications
- linguistics
- method
- named entities
- named entity
- names
- natural language
- natural language processing applications
- ner model
- normalization factor
- nouns
- open test
- organization names
- part-of-speech
- person names
- pos tag
- precision
- proper names
- recognition task
- tags
- test corpora
- test set
- text
- training
- training corpora
- training corpus
- training data
- training examples
- training set
- tree
- vocabulary
- word
- word sense
- words