ACL RD-TEC 1.0 Summarization of J00-3001
Paper Title:
EXTRACTING THE LOWEST-FREQUENCY WORDS: PITFALLS AND POSSIBILITIES
EXTRACTING THE LOWEST-FREQUENCY WORDS: PITFALLS AND POSSIBILITIES
Authors: Marc Weeber and R. Harald Baayen and Rein Vos
Primarily assigned technology terms:
- algorithm
- approximation
- automatic extraction
- collocation extraction
- collocation-based term extraction
- computational linguistics
- database
- extraction application
- extraction method
- extraction procedure
- extraction system
- extraction systems
- extraction technique
- information extraction
- information extraction system
- information retrieval
- lexical extraction
- optimization
- statistical analysis
- statistical methods
- surveillance
- term extraction
- word extraction
- word extraction procedure
Other assigned terms:
- analogy
- approach
- case
- collocation
- compounds
- contingency table
- convergence
- corpora
- corpus size
- data sets
- distribution
- dutch
- dutch verb-particle
- f-measure
- fact
- frequency distribution
- hapax legomena
- hapax legomenon
- knowledge
- linguistics
- log-likelihood
- log-likelihood ratio
- marketing
- measure
- measures
- medline
- method
- mutual information
- newspaper corpus
- precision
- priori
- probability
- probability distribution
- procedure
- psycholinguistics
- seed
- seed term
- semantic
- semantic association
- subcorpus
- technique
- term
- terms
- tokens
- window size
- word
- word association
- word frequencies
- word frequency
- word types
- words