ACL RD-TEC 1.0 Summarization of W99-0613
Paper Title:
UNSUPERVISED MODELS FOR NAMED ENTITY CLASSIFICATION
UNSUPERVISED MODELS FOR NAMED ENTITY CLASSIFICATION
Authors: Michael Collins and Yoram Singer
Primarily assigned technology terms:
- adaboost
- algorithm
- binary classification
- boosting
- boosting algorithm
- bootstrap
- bootstrapping
- bootstrapping approach
- classification
- classifier
- classifiers
- cross-validation
- decision list algorithm
- decision list learning
- disambiguation
- em algorithm
- entity classification
- entity extractor
- expectation maximization
- extractor
- feature extraction
- greedy approach
- hill-climb
- learner
- learning
- learning algorithm
- learning task
- list algorithm
- machine learning
- maximum likelihood
- maximum-entropy
- mutual bootstrapping
- naive bayes
- named entity classification
- named-entity classification
- nlp
- normalization
- optimization
- parser
- predictor
- querying
- search
- smoothing
- smoothing method
- spelling
- supervised learning
- supervised machine learning
- tile
- unsupervised algorithm
- unsupervised training
- weak learner
- weak learning
- weak learning algorithm
- word-sense disambiguation
- world wide web
Other assigned terms:
- approach
- binary classification problem
- case
- characters
- classification error
- classification problem
- conditional probability
- contextual features
- convergence
- corpora
- derivation
- distribution
- error rate
- estimation
- events
- experimental results
- extraction patterns
- fact
- feature
- feature sets
- feature type
- feature vector
- formalism
- generative model
- gold standard
- heuristic
- hyponyms
- hypotheses
- hypothesis
- implementation
- joint probability
- knowledge
- labeling
- large corpora
- large corpus
- learning problem
- lexicon
- likelihood
- likelihood function
- local maximum
- measure
- measures
- method
- modifier
- named entity
- named entity task
- named-entity
- names
- noise
- normalization factor
- noun phrase
- nouns
- pairs of words
- parameter settings
- parameter values
- phrase
- precision
- preposition
- prior probability
- probabilities
- probability
- procedure
- process
- proper names
- seed
- sentences
- singular noun
- smoothing parameter
- tags
- temporal expressions
- term
- terms
- test data
- test set
- text
- training
- training data
- training examples
- training set
- unlabeled examples
- vertex
- web pages
- word
- word sequence
- word sequences
- words