ACL RD-TEC 1.0 Summarization of W06-3809
Paper Title:
RANDOM-WALK TERM WEIGHTING FOR IMPROVED TEXT CLASSIFICATION
RANDOM-WALK TERM WEIGHTING FOR IMPROVED TEXT CLASSIFICATION
Authors: Samer Hassan and Carmen Banea
Primarily assigned technology terms:
- algorithm
- bayes classifier
- categorization
- classification
- classification algorithm
- classification system
- classifier
- classifiers
- computer science
- cross validation
- disambiguation
- document summarization
- encoding
- extraction application
- extraction tool
- feature selection
- feature weighting
- frequency weighting
- graph-based ranking
- information retrieval
- k-nearest neighbor
- k-neighbors
- kernel
- kernels
- keyword extraction
- knn
- language processing
- learning
- learning algorithms
- learning approach
- learning task
- linear kernel
- machine learning
- machine learning algorithms
- machine learning approach
- maximum likelihood
- modeling
- natural language processing
- pagerank random-walk
- processing
- quantification
- random walk
- random-walk
- ranking
- ranking algorithm
- retrieval system
- rocchio algorithm
- scoring
- sense disambiguation
- summarization
- support vector machines
- term weighting
- text categorization
- text classification
- text classifier
- text processing
- textrank keyword extraction
- tf weighting
- validation
- vertex selection
- voting
- weighting
- word sense disambiguation
Other assigned terms:
- 10-fold cross validation
- abbreviations
- approach
- case
- classification task
- co-occurrence
- co-occurrence relation
- coefficient
- conditional probability
- contingency table
- convergence
- correlation
- correlation coefficient
- cosine similarity
- data set
- data sets
- dependency relation
- distribution
- document
- error rate
- evaluations
- fact
- feature
- feature vector
- feature vectors
- feature weights
- formal language
- implementation
- independence assumption
- keyword
- labeling
- language model
- language models
- language processing task
- language processing tasks
- likelihood
- logic
- meaning
- measure
- measures
- message
- method
- multinomial model
- n-grams
- natural language
- natural language processing tasks
- natural language texts
- nouns
- pagerank
- probabilities
- probability
- process
- processing tasks
- punctuation
- relation
- scoring scheme
- support vector
- symbols
- system performance
- technique
- term
- term co-occurrence
- term frequency
- terms
- text
- text classification task
- theorem
- topics
- training
- training document
- training documents
- training examples
- unigram
- unlabeled examples
- vertex
- weighting scheme
- window size
- word
- word features
- word sense
- words