tech,3-1-A94-1011,bq translation </term> use . The use of <term> NLP techniques </term> for <term> document classification </term>
tech,6-1-A94-1011,bq use of <term> NLP techniques </term> for <term> document classification </term> has not produced significant improvements
tech,18-1-A94-1011,bq in performance within the standard <term> term weighting statistical assignment paradigm </term> ( Fagan 1987 ; Lewis , 1992bc ; Buckley
tech,16-2-A94-1011,bq if the power of recently developed <term> NLP techniques </term> are to be successfully applied in
tech,24-2-A94-1011,bq </term> are to be successfully applied in <term> IR </term> . A novel method for adding <term>
other,5-3-A94-1011,bq </term> . A novel method for adding <term> linguistic annotation </term> to <term> corpora </term> is presented
lr,8-3-A94-1011,bq <term> linguistic annotation </term> to <term> corpora </term> is presented which involves using
tech,15-3-A94-1011,bq is presented which involves using a <term> statistical POS tagger </term> in conjunction with <term> unsupervised
tech,21-3-A94-1011,bq POS tagger </term> in conjunction with <term> unsupervised structure finding methods </term> to derive notions of <term> noun group
other,29-3-A94-1011,bq methods </term> to derive notions of <term> noun group </term> , <term> verb group </term> , and so
other,32-3-A94-1011,bq notions of <term> noun group </term> , <term> verb group </term> , and so on which is inherently extensible
other,45-3-A94-1011,bq inherently extensible to more sophisticated <term> annotation </term> , and does not require a <term> pre-tagged
lr,52-3-A94-1011,bq annotation </term> , and does not require a <term> pre-tagged corpus </term> to fit . One of the distinguishing
other,8-4-A94-1011,bq distinguishing features of a more <term> linguistically sophisticated representation of documents </term> over a <term> word set based representation
other,15-4-A94-1011,bq representation of documents </term> over a <term> word set based representation </term> of them is that <term> linguistically
other,23-4-A94-1011,bq representation </term> of them is that <term> linguistically sophisticated units </term> are more frequently individually
other,33-4-A94-1011,bq frequently individually good predictors of <term> document descriptors ( keywords ) </term> than single <term> words </term> are
other,40-4-A94-1011,bq descriptors ( keywords ) </term> than single <term> words </term> are . This leads us to consider the
other,8-5-A94-1011,bq leads us to consider the assignment of <term> descriptors </term> from individual <term> phrases </term>
other,11-5-A94-1011,bq <term> descriptors </term> from individual <term> phrases </term> rather than from the <term> weighted
hide detail