ACL RD-TEC 1.0 Summarization of W02-2007
Paper Title:
LANGUAGE INDEPENDENT NER USING A UNIFIED MODEL OF INTERNAL AND CONTEXTUAL EVIDENCE
LANGUAGE INDEPENDENT NER USING A UNIFIED MODEL OF INTERNAL AND CONTEXTUAL EVIDENCE
Authors: Silviu Cucerzan and David Yarowsky
Primarily assigned technology terms:
- algorithm
- bootstrapping
- bootstrapping process
- capitalization
- classification
- co-training
- entity recognition
- hierarchical smoothing
- hierarchical smoothing procedure
- incremental learning
- iterative learning
- learning
- model bootstrapping
- named entity recognition
- normalization
- parameter estimation
- re-estimation
- recognition
- segmentation
- segmentation process
- segmentation system
- smoothing
- tagger
- topic segmentation
- tuning
Other assigned terms:
- annotated corpus
- annotation
- approach
- chunks
- class distribution
- class probability
- co-occurrence
- conditional distribution
- conditional probability
- contextual information
- data structure
- determiners
- discourse
- distribution
- distributional class
- document
- dutch
- entity type
- estimation
- f-measure
- heuristics
- implementation
- large corpus
- method
- model parameters
- named entity
- names
- normalization factor
- paragraph
- part-of-speech
- part-of-speech information
- person names
- pos information
- precision
- prefixes and suffixes
- prepositions
- probabilities
- probability
- probability distribution
- probability distributions
- procedure
- process
- pronoun
- pronouns
- representations
- seed
- sentence
- statistics
- suffix
- suffixes
- system development
- system performance
- tags
- terms
- test data
- text
- training
- training corpus
- training data
- word
- word structure
- words