tech,6-1-A94-1011,bq |
of
<term>
NLP techniques
</term>
for
<term>
|
document classification
|
</term>
has not produced significant improvements
|
#19889
The use of NLP techniques for document classification has not produced significant improvements in performance within the standard term weighting statistical assignment paradigm (Fagan 1987; Lewis, 1992bc; Buckley, 1993). |
other,29-3-A94-1011,bq |
methods
</term>
to derive notions of
<term>
|
noun group
|
</term>
,
<term>
verb group
</term>
, and so
|
#19974
A novel method for adding linguistic annotation to corpora is presented which involves using a statistical POS tagger in conjunction with unsupervised structure finding methods to derive notions of noun group , verb group, and so on which is inherently extensible to more sophisticated annotation, and does not require a pre-tagged corpus to fit. |
tech,26-8-A94-1011,bq |
sophisticated representations
</term>
for
<term>
|
document classification
|
</term>
. This paper reports on work done
|
#20160
It therefore shows that statistical systems can exploit sophisticated representations of documents, and lends some support to the use of more linguistically sophisticated representations for document classification . |