ACL RD-TEC 1.0 Summarization of W04-0701
Paper Title:
MULTI-DOCUMENT PERSON NAME RESOLUTION
MULTI-DOCUMENT PERSON NAME RESOLUTION
Authors: Michael Fleischman and Eduard Hovy
Primarily assigned technology terms:
- agglomerative clustering
- agglomerative method
- algorithm
- binary classifier
- classification
- classifier
- clustering
- clustering algorithm
- clustering technique
- coreference resolution
- cross validation
- cross-validation
- crossvalidation
- disambiguation
- feature generation
- feature selection
- greedy clustering
- greedy feature selection
- language processing
- learning
- maximum entropy
- maximum entropy model
- name disambiguation
- name resolution
- natural language processing
- normalization
- person name disambiguation
- person name resolution
- processing
- querying
- referent disambiguation
- search
- selection technique
- sense disambiguation
- single clustering
- splitting
- validation
- vector space model
- web search
- word sense disambiguation
Other assigned terms:
- ambiguity
- annotator
- approach
- artificial identity
- bag of words
- bias
- case
- cluster
- clusters
- complex noun
- concept
- concepts
- contextual information
- coreference chains
- corpora
- correlation
- development set
- distribution
- document
- entropy
- evaluation metric
- evaluations
- experimental results
- f-measure
- fact
- feature
- feature set
- feature types
- feature vector
- feature weights
- frequency counts
- generation
- gold standard
- human annotator
- implementation
- knowledge
- large corpora
- large corpus
- lexical items
- likelihood
- measure
- method
- methodology
- model parameters
- named entities
- names
- natural language
- noise
- noun phrase
- ontology
- orthography
- person names
- phrase
- precision
- probabilities
- probability
- query
- search results
- semantic
- semantic distance
- semantic features
- semantic relatedness
- sense ambiguity
- sentences
- similarity measure
- similarity metric
- similarity metrics
- similarity score
- standard deviation
- statistic
- statistics
- technique
- term
- term frequency
- test data
- test set
- text
- training
- training set
- vector space
- vertex
- word
- word sense
- word sense ambiguity
- wordnet
- words