ACL RD-TEC 1.0 Summarization of W97-0803
Paper Title:
EXTENDING A THESAURUS BY CLASSIFYING WORDS
EXTENDING A THESAURUS BY CLASSIFYING WORDS
Authors: Tokunaga Takenobu and Fujii Atsushi and Sakurai Naoyuki and Tanaka Hozumi
Primarily assigned technology terms:
- algorithm
- beam search
- categorization
- classification
- clustering
- clustering algorithm
- code assignment
- coding
- coding system
- computer-based natural language processing
- computing
- cross validation
- disambiguation
- document categorization
- document categorization \
- full parsing
- hierarchical clustering
- k-nearest neighbor
- k-nn
- language processing
- natural language processing
- nlp
- nlp systems
- parsing
- processing
- ranking
- reasoning
- search
- search process
- searching
- sense disambiguation
- tree search
- validation
- word classification
- word sense disambiguation
Other assigned terms:
- 10-fold cross validation
- accusative case
- adjective
- approach
- beam
- case
- characters
- cluster
- clusters
- co-occurrence
- co-occurrences
- compound noun
- computational overhead
- conditional independence
- corpora
- data sparseness
- data sparseness problem
- dictionary
- distribution
- document
- estimation
- evaluation data
- fact
- grammatical relations
- grammatical role
- heuristics
- hierarchical structure
- katakana
- knowledge
- large corpora
- large thesaurus
- leaf
- linguistic
- linguistic knowledge
- meaning
- method
- morphemes
- natural language
- nouns
- part of speech
- prior probability
- probabilistic model
- probabilities
- probability
- probability value
- process
- relation
- relative frequency
- search space
- search strategy
- sparseness problem
- tags
- target word
- terms
- test data
- text
- theorem
- thesaurus
- training
- training data
- tree
- tree structure
- trees
- verb
- vocabulary
- vocabulary size
- word
- word classes
- word co-occurrence
- word sense
- word senses
- wordnet
- words