ACL RD-TEC 1.0 Summarization of W03-1805
Paper Title:
A LANGUAGE MODEL APPROACH TO KEYPHRASE EXTRACTION
A LANGUAGE MODEL APPROACH TO KEYPHRASE EXTRACTION
Authors: Takashi Tomokiyo and Matthew Hurst
Primarily assigned technology terms:
- algorithm
- analysis tool
- boundary segmentation
- chi-square test
- classifier
- collocation discovery
- document analysis
- extraction procedure
- keyphrase extraction
- keyphrase finding
- learning
- likelihood estimation
- linear interpolation
- parameter tuning
- phrase boundary segmentation
- phrase extension
- ranking
- ratio test
- scoring
- scoring function
- segmentation
- segmentation algorithm
- smoothing
- smoothing technique
- smoothing techniques
- spelling
- supervised learning
- tuning
Other assigned terms:
- approach
- background corpus
- background information
- background model
- bigram
- cache
- case
- citation
- cohesion
- collocation
- concept
- corpora
- correlation
- cross entropy
- data set
- data sparseness
- discourse
- discourse model
- distribution
- document
- document set
- empirical results
- entropy
- estimation
- events
- experimental results
- exponential model
- geometric mean
- human judgment
- hypothesis
- interpolation
- keyphrase
- keyword
- kl divergence
- knowledge
- labeling
- language model
- language models
- learning problem
- likelihood
- likelihood ratio
- linguistic
- linguistic filter
- local context
- log-likelihood
- log-likelihood ratio
- measure
- message
- method
- mutual information
- n-gram
- n-gram model
- n-gram models
- ngram
- noun phrases
- null hypothesis
- opinion
- perplexity
- phrase
- phrase boundary
- pointwise mutual information
- portability
- prior probability
- probability
- probability value
- procedure
- punctuation
- query
- relative frequency
- relative frequency ratio
- sparse data
- statistic
- statistics
- stopword list
- style
- supervised learning problem
- symbols
- technique
- term
- terms
- test data
- text
- tokens
- training
- training corpus
- trigram
- trigram language model
- unigram
- unigram language model
- unigram model
- user
- web site
- word
- word collocation
- word pair
- word sequence
- words