ACL RD-TEC 1.0 Summarization of W03-1203

Paper Title:
COMBINING OPTIMAL CLUSTERING AND HIDDEN MARKOV MODELS FOR EXTRACTIVE SUMMARIZATION

Authors: Pascale Fung and Grace Ngai and Chi-Shun Cheung

Other assigned terms:

  • annotation
  • approach
  • bigram
  • case
  • cluster
  • cluster number
  • clustering model
  • clusters
  • coefficient
  • cohesion
  • compression ratio
  • concept
  • concepts
  • convergence
  • cosine measure
  • cosine similarity
  • cosine similarity measure
  • data sets
  • density function
  • dice
  • dice coefficient
  • discourse
  • discourse structures
  • distribution
  • document
  • document vectors
  • estimation
  • euclidean distance
  • evaluation method
  • experimental results
  • feature
  • feature vector
  • feature vectors
  • feature weights
  • frame
  • frequency counts
  • generation
  • generation process
  • heuristics
  • index
  • index terms
  • knowledge
  • labeling
  • lexical items
  • likelihood
  • linguistic
  • linguistic information
  • linguistic knowledge
  • manual annotation
  • mapping
  • markov models
  • measure
  • measures
  • method
  • model parameter
  • model parameters
  • multi-document summarization task
  • negative binomial
  • noisy channel
  • paragraph
  • paragraphs
  • part-of-speech
  • part-of-speech tags
  • poisson distribution
  • probabilistic approach
  • probabilistic framework
  • probabilistic model
  • probabilistic models
  • probabilities
  • probability
  • probability density
  • probability density function
  • probability distribution
  • probability distributions
  • process
  • query
  • relative frequency
  • relative frequency count
  • schema
  • segments
  • sentence
  • sentence boundaries
  • sentences
  • similarity measure
  • similarity measures
  • similarity score
  • similarity scores
  • summarization task
  • synonyms
  • system performance
  • tags
  • term
  • term distribution
  • terms
  • testing data
  • text
  • text cohesion
  • text segment
  • topics
  • training
  • training corpus
  • training data
  • training documents
  • training set
  • transition probabilities
  • unigram
  • user
  • user query
  • vector space
  • vocabulary
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***