other,29-3-A94-1011,ak methods </term> to derive notions of <term> noun group </term> , <term> verb group </term> , and so
other,11-5-A94-1011,ak <term> descriptors </term> from individual <term> phrases </term> rather than from the <term> weighted
other,40-4-A94-1011,ak descriptors ( keywords ) </term> than single <term> words </term> are . This leads us to consider the
model,6-6-A94-1011,ak investigate how sets of individually <term> high-precision rules </term> can result in a low <term> precision
other,8-4-A94-1011,ak distinguishing features of a more <term> linguistically sophisticated representation </term> of <term> documents </term> over a <term>
tech,3-1-A94-1011,ak practical translation use . The use of <term> NLP techniques </term> for <term> document classification </term>
other,22-8-A94-1011,ak lends some support to the use of more <term> linguistically sophisticated representations </term> for <term> document classification </term>
other,33-4-A94-1011,ak frequently individually good predictors of <term> document descriptors ( keywords ) </term> than single <term> words </term> are
other,12-4-A94-1011,ak sophisticated representation </term> of <term> documents </term> over a <term> word set based representation
tech,21-3-A94-1011,ak POS tagger </term> in conjunction with <term> unsupervised structure finding methods </term> to derive notions of <term> noun group
measure(ment),13-6-A94-1011,ak high-precision rules </term> can result in a low <term> precision </term> when used together , and develop
measure(ment),16-5-A94-1011,ak phrases </term> rather than from the <term> weighted sum </term> of a <term> word set representation
tech,6-1-A94-1011,ak use of <term> NLP techniques </term> for <term> document classification </term> has not produced significant improvements
other,18-7-A94-1011,ak particularly suitable for exploiting <term> linguistically sophisticated representations </term> , and show that a <term> statistically
tech,45-3-A94-1011,ak inherently extensible to more sophisticated <term> annotation </term> , and does not require a <term> pre-tagged
other,8-8-A94-1011,ak statistical systems </term> can exploit <term> sophisticated representations </term> of <term> documents </term> , and lends
other,15-4-A94-1011,ak </term> of <term> documents </term> over a <term> word set based representation </term> of them is that linguistically sophisticated
other,8-5-A94-1011,ak leads us to consider the assignment of <term> descriptors </term> from individual <term> phrases </term>
lr,52-3-A94-1011,ak annotation </term> , and does not require a <term> pre-tagged corpus </term> to fit . One of the distinguishing
other,20-5-A94-1011,ak from the <term> weighted sum </term> of a <term> word set representation </term> . We investigate how sets of individually
hide detail