ACL RD-TEC 1.0 Summarization of P03-1037
Paper Title:
PARAMETRIC MODELS OF LINGUISTIC COUNT DATA
PARAMETRIC MODELS OF LINGUISTIC COUNT DATA
Primarily assigned technology terms:
- algorithm
- bayes classifier
- bayes text classification
- classification
- classifier
- classifiers
- document classification
- em algorithm
- goodness-of-fit test
- language modeling
- likelihood estimate
- likelihood estimation
- loglinear
- maximum likelihood
- maximum likelihood estimation
- modeling
- naive bayes
- naive bayes classifier
- nlp
- parameter estimate
- parameter estimation
- parameterization
- stemmer
- text classification
- tokenization
Other assigned terms:
- approach
- authorship
- binomial distribution
- binomial model
- case
- characters
- chunk
- classification accuracy
- classification task
- classification tasks
- conditional model
- data set
- distribution
- document
- document length
- duration
- estimation
- events
- fact
- feature
- hypothesis
- independence assumption
- likelihood
- linguistic
- maximum likelihood estimate
- message
- method
- mixture models
- multinomial distribution
- multinomial model
- names
- negative binomial
- nlp applications
- null hypothesis
- phrase
- poisson distribution
- priori
- probabilities
- probability
- probability estimates
- procedure
- process
- proper names
- recipe
- statistic
- target word
- term
- terms
- test data
- test set
- text
- text classification task
- token frequency
- training
- training and test data
- training data
- vocabulary
- vocabulary size
- word
- word frequency
- word types
- words