ACL RD-TEC 1.0 Summarization of W06-2914

Paper Title:
WORD DISTRIBUTIONS FOR THEMATIC SEGMENTATION IN A SUPPORT VECTOR MACHINE APPROACH

Authors: Maria Georgescul and Alexander Clark and Susan Armstrong

Primarily assigned technology terms:

Other assigned terms:

  • anaphora
  • annotators
  • approach
  • association for computational linguistics
  • bag of words
  • bias
  • binary classification problem
  • broadcast news
  • brown corpus
  • classification problem
  • coherence
  • cohesion
  • computational complexity
  • conditional independence
  • conll-x
  • corpora
  • correlation
  • cosine distance
  • data set
  • data sets
  • dialogues
  • dimensionality
  • discourse
  • discourse topic
  • distribution
  • document
  • document collections
  • document structure
  • empirical evaluation
  • entropy
  • error metric
  • error rate
  • estimation
  • evaluation measures
  • evaluation metrics
  • experimental results
  • fact
  • feature
  • feature space
  • frequency counts
  • genre
  • gold standard
  • grid
  • human annotators
  • hypothesis
  • hypothesis space
  • intention
  • inter-annotator agreement
  • kernel function
  • knowledge
  • labeling
  • latent semantic
  • learning machine
  • lemma
  • lexical level
  • likelihood
  • linear algebra
  • linguistics
  • log-likelihood
  • mapping
  • measure
  • measures
  • method
  • methodology
  • multimodal information
  • natural language
  • optimisation problem
  • paragraphs
  • parameter settings
  • parameter values
  • parametric model
  • precision
  • procedure
  • pronouns
  • prosodic information
  • regularization parameter
  • relation
  • representations
  • risk minimization principle
  • segment boundaries
  • segments
  • semantic
  • sentence
  • sentences
  • similarity measure
  • statistical significance
  • support vector
  • svms
  • system performance
  • technique
  • term
  • term weighting scheme
  • test set
  • text
  • text collection
  • text segment
  • textual structure
  • thematic segment
  • theory
  • topic shift
  • topics
  • training
  • training data
  • training example
  • training examples
  • training set
  • training time
  • transcriptions
  • transformation
  • tree
  • understanding
  • utterance
  • vector space
  • vocabulary
  • weighting scheme
  • word
  • word distribution
  • word frequencies
  • word frequency
  • word level
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***