ACL RD-TEC 1.0 Summarization of W04-3239
Paper Title:
A BOOSTING ALGORITHM FOR CLASSIFICATION OF SEMI-STRUCTURED TEXT
A BOOSTING ALGORITHM FOR CLASSIFICATION OF SEMI-STRUCTURED TEXT
Authors: Taku Kudo and Yuji Matsumoto
Primarily assigned technology terms:
- 5-fold cross validation
- adaboost
- algorithm
- analyzer
- bag-of-word representation
- bag-of-words kernel
- binary classification
- binary classifier
- boosting
- boosting algorithm
- categorization
- classification
- classification algorithm
- classifier
- classifiers
- computing
- convolution kernels
- cross validation
- cross-validation
- feature representation
- identification
- information extraction
- internet
- japanese morphological analyzer
- kernel
- kernels
- learner
- learning
- learning algorithm
- learning algorithms
- machine learning
- machine learning algorithms
- modality identification
- morphological analyzer
- nlp
- parse re-ranking
- processing
- pruning
- re-ranking
- review classification
- search
- sentence classification
- smoothing
- support vector machines
- text classification
- text processing
- topic identification
- topic-based text classification
- tree classification
- tree kernel
- validation
- weak learner
- world wide web
Other assigned terms:
- adjective
- annotation
- approach
- chunk
- classification problem
- classification tasks
- coefficient
- concept
- convergence
- data structure
- dependency tree
- document
- domain knowledge
- error rate
- experimental setting
- f-measure
- feature
- feature set
- feature space
- feature vector
- head word
- heuristic
- hypotheses
- hypothesis
- implementation
- knowledge
- labeling
- lattice
- leaf
- linear combination
- mapping
- maps
- meaning
- method
- modality
- n-gram
- n-grams
- ngram
- opinion
- opinions
- parse
- parts-of-speech
- positive and negative examples
- process
- relation
- search problem
- search space
- sentence
- sentences
- sparse data
- statistical significance
- structural information
- structural representation
- subtree
- subtrees
- support vector
- svms
- syntactic relations
- taxonomy
- terms
- text
- theorem
- topics
- training
- training data
- training examples
- tree
- trees
- word
- word boundaries
- word order
- word sequence
- words
- xml document