ACL RD-TEC 1.0 Summarization of J00-3001

Paper Title:
EXTRACTING THE LOWEST-FREQUENCY WORDS: PITFALLS AND POSSIBILITIES

Authors: Marc Weeber and R. Harald Baayen and Rein Vos

Other assigned terms:

  • analogy
  • approach
  • case
  • collocation
  • compounds
  • contingency table
  • convergence
  • corpora
  • corpus size
  • data sets
  • distribution
  • dutch
  • dutch verb-particle
  • f-measure
  • fact
  • frequency distribution
  • hapax legomena
  • hapax legomenon
  • knowledge
  • linguistics
  • log-likelihood
  • log-likelihood ratio
  • marketing
  • measure
  • measures
  • medline
  • method
  • mutual information
  • newspaper corpus
  • precision
  • priori
  • probability
  • probability distribution
  • procedure
  • psycholinguistics
  • seed
  • seed term
  • semantic
  • semantic association
  • subcorpus
  • technique
  • term
  • terms
  • tokens
  • window size
  • word
  • word association
  • word frequencies
  • word frequency
  • word types
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***