ACL RD-TEC 1.0 Summarization of W99-0612
Paper Title:
LANGUAGE INDEPENDENT NAMED ENTITY RECOGNITION COMBINING MORPHOLOGICAL AND CONTEXTUAL EVIDENCE
LANGUAGE INDEPENDENT NAMED ENTITY RECOGNITION COMBINING MORPHOLOGICAL AND CONTEXTUAL EVIDENCE
Authors: Silviu Cucerzan and David Yarowsky
Primarily assigned technology terms:
- active learning
- algorithm
- bootstrapping
- bootstrapping algorithm
- bootstrapping procedure
- bootstrapping process
- capitalization
- classification
- classifiers
- disambiguation
- entity classification
- entity identification
- entity recognition
- entity recognizer
- entity recognizers
- expectation-maximization
- greedy search
- hierarchical smoothing
- hierarchical smoothing procedure
- identification
- incremental learning
- interactive system
- iterative bootstrapping
- iterative learning
- language-independent bootstrapping
- learning
- learning algorithm
- learning procedure
- learning system
- machine learning
- matching
- maximum entropy
- measuring
- modeling
- morphology
- named entity identification
- named entity recognition
- named-entity classification
- normalization
- partial matching
- prefix trie
- re-estimation
- reading
- recognition
- recognizer
- reestimation
- search
- segmentation
- segmentation system
- smoothing
- suffix trie
- supervised learning
- system training
- tagging
- text acquisition
- tokenization
- word segmentation
Other assigned terms:
- affixes
- ambiguity
- annotated corpora
- annotation
- annotators
- approach
- backoff
- baseline model
- baseline performance
- bias
- case
- characters
- class distribution
- class probability
- classification accuracy
- concept
- conditional distribution
- contextual information
- convergence
- corpora
- cross-language analysis
- data sets
- data structure
- data structures
- discourse
- distribution
- distributional class
- document
- document collection
- document length
- entity class
- entropy
- estimation
- evaluations
- exact match
- f-measure
- fact
- hindi
- human annotation
- human performance
- implementation
- information content
- information sources
- inheritance
- interpolation
- knowledge
- labeling
- linear combination
- maximum entropy principle
- meaning
- measure
- measures
- method
- morphological information
- name class
- named entities
- named entity
- named-entity
- names
- nouns
- precision
- prior probability
- priori
- probabilities
- probability
- probability distribution
- probability distributions
- procedure
- process
- pronoun
- punctuation
- representations
- seed
- seed words
- sources of information
- statistics
- subtree
- suffix
- symbol
- system performance
- tags
- terms
- text
- tokens
- training
- training data
- training examples
- training set
- unannotated text
- user
- word
- word boundaries
- word level
- word types
- words