ACL RD-TEC 1.0 Summarization of N03-1032
Paper Title:
FREQUENCY ESTIMATES FOR STATISTICAL WORD SIMILARITY MEASURES
FREQUENCY ESTIMATES FOR STATISTICAL WORD SIMILARITY MEASURES
Authors: Egidio L. Terra and Charles L. A. Clarke
Primarily assigned technology terms:
- approximation
- co-occurrence frequency estimation
- comparative evaluation
- crawler
- disambiguation
- document retrieval
- estimation method
- frequency estimation
- grouping
- information retrieval
- language modeling
- language processing
- latent semantic analysis
- likelihood estimate
- likelihood ratio test
- maximum entropy
- maximum entropy model
- maximum likelihood
- measuring
- modeling
- natural language processing
- nlp
- parameter training
- parser
- parsers
- parsing
- part-of-speech tagger
- pos tagger
- processing
- ranking
- ratio test
- search
- semantic analysis
- sense disambiguation
- smoothing
- smoothing techniques
- syntax analysis
- tagger
- tagging
- tagging process
- word sense disambiguation
Other assigned terms:
- approach
- binomial distribution
- case
- co-occurrence
- co-occurrence frequency
- co-occurrences
- collocation
- comparative study
- conditional probabilities
- conditional probability
- contingency table
- corpora
- corpus size
- cosine measure
- distribution
- document
- entropy
- estimation
- fact
- feedback model
- foreign language
- hypotheses
- hypothesis
- independence assumption
- joint probability
- keyword
- knowledge
- large corpus
- latent semantic
- likelihood
- likelihood function
- likelihood ratio
- maximum likelihood estimate
- measure
- measures
- method
- mutual information
- n-gram
- n-gram model
- natural language
- natural language texts
- noise
- norm
- pairs of words
- paragraph
- part-of-speech
- part-of-speech tag
- passage
- perplexity
- pointwise mutual information
- precision
- probabilities
- probability
- process
- quantitative information
- queries
- seed
- semantic
- sentence
- similarity between words
- similarity measure
- similarity measures
- size of the corpus
- statistic
- statistics
- style
- synonym
- synonymy
- syntactic category
- syntactical information
- syntax
- target word
- terms
- test set
- text
- training
- transformation
- window size
- word
- word association
- word co-occurrence
- word frequencies
- word sense
- word senses
- word sequences
- word similarity
- word similarity measure
- words