ACL RD-TEC 1.0 Summarization of W05-0608
Paper Title:
DOMAIN KERNELS FOR TEXT CATEGORIZATION
DOMAIN KERNELS FOR TEXT CATEGORIZATION
Authors: Alfio Gliozzo and Carlo Strapparava
Primarily assigned technology terms:
- algorithm
- bag-of-words feature representation
- bootstrapping
- bootstrapping process
- categorization
- classi cation
- classi er
- clustering
- clustering algorithm
- co-training
- co-training algorithm
- comparative evaluation
- computational linguistics
- computational natural language learning
- computer science
- computing
- decomposition
- disambiguation
- domain kernels
- expectation maximization
- feature mapping
- feature representation
- feature selection
- hardware
- idf term weighting
- indexing
- information retrieval
- kernel
- kernels
- knowledge acquisition
- language learning
- language processing
- latent semantic analysis
- latent semantic indexing
- latent semantic kernel
- learning
- learning algorithm
- learning method
- learning process
- learning techniques
- linear kernel
- machine learning
- machine learning techniques
- mapping function
- modeling
- natural language learning
- natural language processing
- nlp
- preprocessing
- processing
- reporting
- semantic analysis
- semantic indexing
- semantic web
- semi-supervised learning
- sense disambiguation
- similarity estimation
- singular value decomposition
- splitting
- supervised classi cation
- supervised learning
- support vector machine
- svm approach
- term clustering
- term weighting
- text categorization
- text classi cation
- text clustering
- tuning
- unsupervised technique
- user modeling
- vector space model
- weighting
- word sense disambiguation
Other assigned terms:
- acquisition technique
- adjective
- adverb
- ambiguity
- analogy
- approach
- association for computational linguistics
- background knowledge
- case
- category label
- cluster
- clusters
- computational linguistics domain
- concepts
- corpora
- cosine similarity
- data set
- data sets
- device
- dimensionality
- document
- document frequency
- document similarity
- domain model
- error rate
- estimation
- external knowledge
- external knowledge source
- fact
- feature
- feature space
- hyperonymy
- hypothesis
- implementation
- index
- inverse document frequency
- kernel function
- knowledge
- knowledge acquisition bottleneck
- labeled training data
- large corpora
- latent semantic
- latent semantic space
- learning schema
- lemma
- lexical ambiguity
- linguistics
- mapping
- maps
- measure
- method
- methodology
- natural language
- nlp tasks
- parameter settings
- part of speech
- parts of speech
- positive and negative examples
- precision
- probabilistic approach
- process
- reuters corpus
- schema
- semantic
- semantic domain
- semantic space
- semi-supervised approach
- sentences
- similarity function
- similarity scores
- support vector
- svm implementation
- svms
- synonymy
- technique
- term
- terms
- test data
- test set
- text
- topics
- training
- training and test data
- training corpora
- training data
- training examples
- user
- vector space
- verb
- vocabulary
- word
- word sense
- words