ACL RD-TEC 1.0 Summarization of N06-1061
Paper Title:
LANGUAGE MODEL-BASED DOCUMENT CLUSTERING USING RANDOM WALKS
LANGUAGE MODEL-BASED DOCUMENT CLUSTERING USING RANDOM WALKS
Primarily assigned technology terms:
- algorithm
- approximation
- bayesian smoothing
- clustering
- clustering algorithm
- complete-link clustering
- computing
- database
- disambiguation
- document clustering
- document representation
- document retrieval
- extractive summarization
- graph construction
- graph representation
- graph-based inference
- hierarchical clustering
- information retrieval
- information-theoretic clustering
- k-means
- language modeling
- language modeling approach
- mapping function
- maximum likelihood
- ml estimation
- modeling
- nlp
- parameter tuning
- parsing
- processing
- random walk
- reporting
- retrieval system
- sense disambiguation
- single-link clustering
- smoothing
- smoothing technique
- summarization
- tuning
- vector representation
- vector space representation
- word clustering
- word sense disambiguation
Other assigned terms:
- analogy
- approach
- citation
- cluster
- clusters
- confusion matrix
- corpora
- cosine similarity
- data set
- data sets
- distribution
- document
- document collection
- document vector
- document vectors
- estimation
- events
- experimental setting
- f-measure
- generation
- interpretation
- joint distribution
- language model
- language models
- likelihood
- mapping
- maps
- measure
- method
- mutual information
- phrase
- phrase attachment
- precision
- prepositional phrase
- prepositional phrase attachment
- probabilities
- probability
- probability distribution
- probability distributions
- procedure
- queries
- query
- relation
- representations
- scalability
- seed
- semantic
- semantic relatedness
- semantic similarity
- semantic structure
- sentences
- similarity function
- similarity metric
- similarity metrics
- smoothing parameter
- subtree
- subtrees
- technique
- term
- terms
- text
- toolkit
- transition probability
- tree
- unigram
- vector space
- word
- word sense
- words