ACL RD-TEC 1.0 Summarization of N04-1002
Paper Title:
CROSS-DOCUMENT COREFERENCE ON A LARGE SCALE CORPUS
CROSS-DOCUMENT COREFERENCE ON A LARGE SCALE CORPUS
Authors: Chung Heong Gooi and James Allan
Primarily assigned technology terms:
- agglomerative clustering
- agglomerative vector space
- algorithm
- analyzer
- approximation
- automatic content extraction
- clustering
- clustering algorithm
- computing
- coreference resolution
- cross-document coreference analysis
- cross-document coreference resolution
- detection and tracking
- disambiguation
- entity detection
- entity detection and tracking
- entity disambiguation
- entity disambiguation and clustering
- entity extraction
- entity extraction system
- extraction system
- incremental algorithm
- information extraction
- information fusion
- information retrieval
- intelligent information retrieval
- mention detection
- message understanding
- named entity extraction
- scoring
- search
- search engine
- sense disambiguation
- smoothing
- statistical methods
- threshold selection
- vector space model
- word sense disambiguation
Other assigned terms:
- ambiguity
- approach
- bias
- case
- cluster
- clusters
- community
- coreference chains
- coreference information
- corpora
- cosine similarity
- cross document coreference
- cross-document coreference
- disambiguation model
- distribution
- document
- document coreference
- evaluation measures
- evaluation set
- evaluations
- f measure
- f-measure
- genre
- healthcare
- hypothesis
- implementation
- incremental approach
- kl divergence
- kullback-leibler divergence
- language model
- large corpus
- large scale corpus
- mappings
- measure
- measures
- message
- message understanding conference
- methodology
- named entities
- named entity
- names
- nist
- opinions
- precision
- probabilities
- probability
- probability distribution
- probability distributions
- process
- queries
- query
- research and development
- running time
- runtime
- sentences
- skew divergence
- tagged corpora
- tags
- technique
- term
- terms
- test corpora
- test set
- text
- training
- training data
- understanding
- vector space
- vocabulary
- window size
- word
- word sense
- words