ACL RD-TEC 1.0 Summarization of N01-1025
Paper Title:
CHUNKING WITH SUPPORT VECTOR MACHINES
CHUNKING WITH SUPPORT VECTOR MACHINES
Authors: Taku Kudo and Yuji Matsumoto
Primarily assigned technology terms:
- base phrase identification
- basenp identification
- beam search
- binary classification
- boosting
- brill tagger
- categorization
- chunking
- classification
- classifier
- classifiers
- cross validation
- cross validation method
- dependency structure analysis
- deterministic parsing
- dp matching
- dynamic programming
- empirical risk estimation
- entity extraction
- entity identification
- feature selection
- hidden markov
- hidden markov model
- identification
- japanese named entity extraction
- japanese named entity identification
- kernel
- language processing
- learning
- learning algorithms
- learning approaches
- learning program
- learning techniques
- machine learning
- machine learning algorithms
- machine learning approaches
- machine learning techniques
- markov model
- matching
- maximum entropy
- maximum entropy model
- named entity extraction
- named entity identification
- natural language processing
- nlp
- noun phrase identification
- np chunking
- optimization
- pairwise classification
- pairwise voting
- parsing
- part-of-speech tagging
- phrase identification
- polynomial kernel
- pos tagging
- processing
- risk estimation
- search
- statistical learning
- statistical learning theory
- structure analysis
- support vector machines
- tagger
- tagging
- text categorization
- tokenization
- validation
- voting
- voting scheme
- weighted voting
- weighted voting scheme
- weighting
- weighting method
Other assigned terms:
- annotated corpora
- approach
- base noun
- base noun phrase
- basenp
- beam
- binary classification task
- bunsetsu
- case
- chunk
- chunk representation
- chunks
- classification task
- computational overhead
- corpora
- data set
- data sets
- dependency structure
- dimensionality
- distribution
- entropy
- entropy models
- error rate
- estimation
- experimental results
- fact
- feature
- feature sets
- feature vector
- feature vectors
- heuristics
- hypotheses
- identification task
- kernel function
- maximum entropy models
- measure
- method
- named entity
- natural language
- nlp tasks
- noise
- noun phrase
- optimization problem
- part-of-speech
- part-of-speech tags
- penn treebank
- phrase
- pos tag
- precision
- probability
- procedure
- representation system
- representations
- sbar
- stem
- support vector
- svms
- svms training
- tags
- technique
- term
- test data
- test data set
- text
- theorem
- theory
- tokens
- training
- training data
- training data set
- training samples
- treebank
- vector space
- verb
- word
- words
- wsj corpus