ACL RD-TEC 1.0 Summarization of W98-1104
Paper Title:
USING SUFFIX ARRAYS TO COMPUTE TERM FREQUENCY AND DOCUMENT FREQUENCY FOR ALL SUBSTRINGS IN A CORPUS
USING SUFFIX ARRAYS TO COMPUTE TERM FREQUENCY AND DOCUMENT FREQUENCY FOR ALL SUBSTRINGS IN A CORPUS
Authors: Mikio Yamamoto and Kenneth W. Church
Primarily assigned technology terms:
Other assigned terms:
- approach
- array
- case
- characters
- cluster
- concept
- corpora
- cpu time
- data structure
- document
- document frequency
- english corpus
- implementation
- index
- inverse document frequency
- japanese corpus
- large corpora
- linguistics
- measures
- method
- mutual information
- ngram
- phrase
- procedure
- ridf value
- substring
- suffix
- suffixes
- term
- term frequency
- terms
- text
- words