We treat the problem of classifying documents as that of conducting
<term>
statistical hypothesis testing
</term>
over
<term>
finite mixture models
</term>
, and employ the
<term>
EM algorithm
</term>
to efficiently estimate
<term>
parameters
</term>
in a
<term>
finite mixture model
</term>
.
#29121We treat the problem of classifying documents as that of conducting statistical hypothesis testing over finite mixture models, and employ the EM algorithm to efficiently estimate parameters in a finite mixture model.
other,14-2-P97-1006,ak
We define for each category a
<term>
finite mixture model
</term>
based on
<term>
soft clustering
</term>
of
<term>
words
</term>
.
#29092We define for each category a finite mixture model based on soft clustering of words .