ACL RD-TEC 1.0 Summarization of P04-2003

Paper Title:
SEARCHING FOR TOPICS IN A LARGE COLLECTION OF TEXTS

Authors: Martin Holub and Jiri Semecky and Jiri Divis

Other assigned terms:

  • ambiguity
  • annotated test collection
  • annotator
  • approach
  • cluster
  • clusters
  • computational complexity
  • concept
  • concepts
  • correlation
  • cosine similarity
  • dimensionality
  • distance metric
  • document
  • document frequency
  • document similarity
  • document vectors
  • experimental results
  • feature
  • heuristic
  • human annotator
  • implementation
  • intention
  • kullback-leibler divergence
  • latent semantic
  • linguistics
  • method
  • nist
  • nouns
  • priori
  • probability
  • probability distributions
  • procedure
  • process
  • queries
  • query
  • representations
  • search procedure
  • seed
  • semantic
  • similarity threshold
  • statistics
  • technique
  • term
  • terms
  • test collection
  • text
  • text collection
  • text documents
  • time complexity
  • topics
  • training
  • training samples
  • transformation
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***