ACL RD-TEC 1.0 Summarization of A94-1027
Paper Title:
A PROBABILISTIC MODEL FOR TEXT CATEGORIZATION: BASED ON A SINGLE RANDOM VARIABLE WITH MULTIPLE VALUES
A PROBABILISTIC MODEL FOR TEXT CATEGORIZATION: BASED ON A SINGLE RANDOM VARIABLE WITH MULTIPLE VALUES
Authors: Makoto Iwayama and Takenobu Tokunaga
Primarily assigned technology terms:
- approximation
- automatic text categorization
- binary estimation
- categorization
- category assignment
- category assignment strategy
- classification
- clustering
- document representation
- estimation method
- estimator
- feature selection
- feature selection method
- indexing
- information retrieval
- part-of-speech tagger
- preprocessing
- probability estimation
- random sampling
- ranking
- relevance weighting
- sampling
- scoring
- search
- selection method
- smoothing
- tagger
- term weighting
- text categorization
- thresholding
- weighting
Other assigned terms:
- case
- conditional independence
- derivation
- dictionary
- document
- empirical results
- estimation
- feature
- geometric mean
- grounding
- index
- method
- nouns
- part-of-speech
- posterior
- posterior probability
- precision
- prior probability
- probabilistic model
- probabilistic models
- probabilities
- probability
- probability theory
- process
- search strategy
- technology
- term
- terms
- test set
- text
- theorem
- theory
- topics
- training
- training data
- training documents
- training set
- transformation
- word
- words