ACL RD-TEC 1.0 Summarization of W06-0701
Paper Title:
DIMENSIONALITY REDUCTION AIDS TERM CO-OCCURRENCE BASED MULTI-DOCUMENT SUMMARIZATION
DIMENSIONALITY REDUCTION AIDS TERM CO-OCCURRENCE BASED MULTI-DOCUMENT SUMMARIZATION
Authors: Ben Hachey and Gabriel Murray and David Reitter
Primarily assigned technology terms:
- algorithm
- annotated sentence extraction
- approximation
- automatic system
- boundary disambiguation
- boundary disambiguation module
- computational linguistics
- computing
- correlation analysis
- decomposition
- dimensionality reduction
- disambiguation
- entity tagger
- estimator
- extraction system
- extractive summarisation
- identification
- information extraction
- information retrieval
- latent semantic analysis
- lemmatisation
- lemmatiser
- measuring
- mining
- modelling
- multi-document summarisation
- multi-document summarization
- name discrimination
- named entity tagger
- optimisation
- preprocessing
- question answering
- question-answering
- redundancy removal
- relation discovery
- rouge evaluation
- selection algorithm
- semantic analysis
- sentence boundary disambiguation
- sentence extraction
- sentence identification
- sentence selection
- singular value decomposition
- smoothing
- summarisation
- summarization
- svd dimensionality reduction
- tagger
- term weighting
- text mining
- tokenisation
- transducer
- vector representation
- vector space model
- vector space representation
- weighting
- xml markup
Other assigned terms:
- annotators
- approach
- association for computational linguistics
- bigram
- case
- co-occurrence
- co-occurrence information
- co-occurrence matrix
- coherence
- concepts
- context information
- context vectors
- corpora
- corpus size
- correlation
- correlations
- data sets
- dimensionality
- distribution
- document
- document frequency
- document set
- document vector
- evaluation metric
- evaluation metrics
- evaluations
- experimental results
- f score
- f-score
- frame
- gold standard
- grammar
- human annotators
- human performance
- hypothesis
- hypothesis test
- implementation
- inverse document frequency
- knowledge
- large corpora
- large corpus
- latent semantic
- lexical semantics
- linear combination
- linguistic
- linguistics
- markup
- meaning
- meanings
- measure
- measures
- message
- method
- named entity
- ngram
- normal distribution
- pair similarity
- partof-speech
- precision
- procedure
- process
- projection
- query
- query vector
- relation
- relevance measurement
- representations
- semantic
- semantic model
- semantic similarity
- semantic space
- sentence
- sentence boundary
- sentence meaning
- sentence representation
- sentence similarity
- sentences
- size of the corpus
- statistical significance
- stems
- system performance
- technique
- term
- term co-occurrence
- term frequency
- terms
- text
- text collection
- textual coherence
- topics
- vector space
- vocabulary
- word
- word pair
- word pair similarity
- word similarity
- word vector
- words