ACL RD-TEC 1.0 Summarization of W06-2805
Paper Title:
LEARNING TO RECOGNIZE BLOGS: A PRELIMINARY EXPLORATION
LEARNING TO RECOGNIZE BLOGS: A PRELIMINARY EXPLORATION
Authors: Erik Elgersma and Maarten de Rijke
Primarily assigned technology terms:
- algorithm
- attribute selection
- binary blog classification
- binary classification
- blog classification
- bootstrap
- bootstrapping
- bootstrapping method
- classification
- classification process
- classifier
- classifiers
- co-training
- corpus creation
- crawler
- crawling
- cross-validation
- date detection
- decision tree
- detection algorithm
- internet
- iterative process
- learner
- learning
- learning algorithm
- learning algorithms
- machine learner
- machine learning
- machine learning algorithms
- model building
- ranking
- recognition
- reporting
- resampling
- search
- support-vector learning
- ten-fold cross-validation
- trend analysis
- weka
Other assigned terms:
- annotated dataset
- annotated training set
- approach
- blog content
- case
- classification problem
- classification research
- classification task
- data set
- document
- document frequency
- fact
- feature
- feature set
- genre
- heuristics
- implementation
- large training
- learning model
- method
- precision
- process
- style
- support vector
- technology
- term
- terms
- test data
- test set
- text
- tokens
- toolkit
- training
- training and test data
- training data
- training material
- training set
- tree
- user
- web page