other,15-4-A94-1011,bq representation of documents </term> over a <term> word set based representation </term> of them is that <term> linguistically
tech,18-9-A94-1011,bq check </term> , which is creating a <term> PC based tool </term> to be used in the <term> technical
other,26-9-A94-1011,bq based tool </term> to be used in the <term> technical abstracting industry </term> . This paper proposes a model using
other,36-7-A94-1011,bq improved performance for sophisticated <term> representations </term> . It therefore shows that <term> statistical
tech,10-7-A94-1011,bq repeat results which show that standard <term> statistical models </term> are not particularly suitable for
tech,6-1-A94-1011,bq use of <term> NLP techniques </term> for <term> document classification </term> has not produced significant improvements
other,45-3-A94-1011,bq inherently extensible to more sophisticated <term> annotation </term> , and does not require a <term> pre-tagged
tech,21-3-A94-1011,bq POS tagger </term> in conjunction with <term> unsupervised structure finding methods </term> to derive notions of <term> noun group
tech,16-2-A94-1011,bq if the power of recently developed <term> NLP techniques </term> are to be successfully applied in
tech,26-7-A94-1011,bq representations </term> , and show that a <term> statistically fitted rule-based model </term> provides significantly improved performance
tech,3-1-A94-1011,bq translation </term> use . The use of <term> NLP techniques </term> for <term> document classification </term>
other,8-8-A94-1011,bq statistical systems </term> can exploit <term> sophisticated representations of documents </term> , and lends some support to the use
other,7-6-A94-1011,bq sets of individually high-precision <term> rules </term> can result in a <term> low precision
measure(ment),12-6-A94-1011,bq high-precision <term> rules </term> can result in a <term> low precision </term> when used together , and develop
lr,52-3-A94-1011,bq annotation </term> , and does not require a <term> pre-tagged corpus </term> to fit . One of the distinguishing
other,8-9-A94-1011,bq paper reports on work done for the <term> LRE project SmTA double check </term> , which is creating a <term> PC based
tech,15-3-A94-1011,bq is presented which involves using a <term> statistical POS tagger </term> in conjunction with <term> unsupervised
other,5-3-A94-1011,bq </term> . A novel method for adding <term> linguistic annotation </term> to <term> corpora </term> is presented
tech,26-8-A94-1011,bq sophisticated representations </term> for <term> document classification </term> . This paper reports on work done
other,22-8-A94-1011,bq lends some support to the use of more <term> linguistically sophisticated representations </term> for <term> document classification </term>
hide detail