ACL RD-TEC 1.0 Summarization of C04-1058
Paper Title:
WHY NITPICKING WORKS: EVIDENCE FOR OCCAM'S RAZOR IN ERROR CORRECTORS
WHY NITPICKING WORKS: EVIDENCE FOR OCCAM'S RAZOR IN ERROR CORRECTORS
Authors: Dekai Wu and Grace Ngai and Marine Carpuat
Primarily assigned technology terms:
- adaboost
- algorithm
- boosting
- boosting algorithm
- bracketing
- categorization
- classification
- classifier
- classifier combination
- classifiers
- computational linguistics
- computing
- cross-validation
- data analysis
- decision list learning
- decision trees
- disambiguation
- discriminative training
- entity recognition
- error correcting
- error correction
- error-driven learning
- error-minimization
- feature engineering
- identification
- illustration
- language processing
- learner
- learning
- learning algorithm
- learning algorithms
- learning methods
- machine learning
- machine learning algorithms
- maximum entropy
- maximum entropy classifiers
- message understanding
- modeling
- named entity recognition
- named-entity identification
- named-entity recognition
- natural language processing
- nlp
- nlp systems
- parsing
- part-ofspeech tagging
- partitioning
- post-processing
- processing
- ranking
- recognition
- rule learning
- rule-based machine
- rulelearning
- rulelearning mechanism
- sampling
- scoring
- scoring function
- searching
- segmentation
- sense disambiguation
- tagging
- text categorization
- transformation-based learning
- tuning
- validation
- voting
- word sense disambiguation
Other assigned terms:
- annotated training set
- approach
- case
- classification task
- context features
- corpora
- data sets
- distribution
- empirical results
- entropy
- error rate
- evaluation set
- evaluations
- experimental results
- f-measure
- fact
- feature
- feature space
- hypotheses
- hypothesis
- hypothesis space
- language processing tasks
- learning model
- linguistics
- mechanisms
- message
- method
- model parameters
- named entity
- named-entity
- natural language
- nlp application
- nlp tasks
- ordered list
- part-ofspeech
- process
- processing tasks
- sentence
- sparse data
- sparse data problem
- statistics
- style
- svms
- technique
- test corpora
- test set
- text
- theory
- time complexity
- training
- training corpus
- training data
- training examples
- training phase
- training set
- trees
- understanding
- word
- word sense
- words