ACL RD-TEC 1.0 Summarization of W02-1026
Paper Title:
MANIPULATING LARGE CORPORA FOR TEXT CLASSIFICATION
MANIPULATING LARGE CORPORA FOR TEXT CLASSIFICATION
Authors: Fumiyo Fukumoto and Yoshimi Suzuki
Primarily assigned technology terms:
- bayes classifier
- binary classification
- binary classifier
- classification
- classification method
- classifier
- classifiers
- cross validation
- internet
- learning
- learning task
- learning techniques
- machine learning
- machine learning techniques
- multi-label classification
- naive bayes
- naive bayes classifier
- optimization
- pattern recognition
- recognition
- support vector machines
- tagger
- text classification
- validation
Other assigned terms:
- approach
- case
- category level
- classification problem
- corpora
- distribution
- document
- document length
- error rate
- evaluation methodology
- feature
- hierarchical structure
- hypothesis
- index
- labeling
- large corpora
- leaf
- measure
- measures
- method
- methodology
- optimization problem
- partof-speech
- positive and negative examples
- precision
- probabilities
- probability
- probability value
- procedure
- process
- reuters corpus
- support vector
- svms
- technique
- test data
- test set
- text
- time complexity
- training
- training data
- training documents
- training example
- training examples
- training set
- tree
- vector space
- vocabulary
- web content
- word
- words