ACL RD-TEC 1.0 Summarization of C00-1059
Paper Title:
CORPUS-DEPENDENT ASSOCIATION THESAURI FOR INFORMATION RETRIEVAL
CORPUS-DEPENDENT ASSOCIATION THESAURI FOR INFORMATION RETRIEVAL
Authors: Hiroyuki Kaji and Yasutsugu Morimoto and Toshiko Aizono and Noriyuki Yamasaki
Primarily assigned technology terms:
- agglomerative clustering
- algorithm
- automatic generation
- clustering
- clustering algorithm
- correlation analysis
- data extraction
- disambiguation
- disambiguation method
- disanabiguation
- document clustering
- generation method
- global-statistics-based disambiguation
- information retrieval
- information retrieval systems
- local-statistics-based disambiguation
- matching
- navigation
- pattern matching
- ratio test
- resource development
- retrieval systems
- scatter\/gather document clustering
- search
- self-organizing map
- shortest path
- statistical disambiguation
- term clustering
- term extraction
- text retrieval
- thcsaurus generation
- thesaurus generation
- tile
Other assigned terms:
- ambiguity
- case
- cluster
- clusters
- co-occurrence
- co-occurrence frequency
- co-occurrence information
- collocation
- compound noun
- concept
- concepts
- correlation
- determiner
- distribution
- document
- document frequency
- domain-specific thesaurus
- function words
- generation
- hyponyms
- information need
- information space
- information structure
- inverse document frequency
- japanese compound
- linguistic
- log-likelihood
- log-likelihood ratio
- measure
- method
- mutual information
- noun phrase
- noun phrases
- nouns
- occurrence frequency
- part-of-speech
- patent
- phrase
- procedure
- query
- semantic
- semantic categories
- sentences
- statistics
- structural ambiguity
- suffix
- synonyms
- technology
- term
- term frequency
- terms
- text
- text corpus
- thesaurus
- tile structure
- topics
- user
- window size
- word
- word association
- word sequence
- words