C02-1087 classification apply VSM on their pre-processing stage . SOM does not reduce the
C02-1112 including . We plan to improve the pre-processing of our systems , the detection
C02-1033 the text , resulting from its pre-processing . A topic context is associated
C02-1087 have the label 0 because after pre-processing these articles are zero vectors
A97-1034 existing corpora without extensive pre-processing . • It does support the
A92-1025 identification as well as other forms of pre-processing . Because the pattern matcher
C02-1002 corpora . We briefly describe the pre-processing steps , a baseline iterative
A92-1025 Combining Information The statistical pre-processing methods and calculations of relevance
A92-1025 would not have worked without the pre-processing of relevant text , name and collocation
A97-1036 tractable by NLP software . This pre-processing can not usually be fully automated
C02-1027 processing in Bulgarian . First , the pre-processing modules for tokenisation , sentence
C02-1027 boundaries . LINGUA performs the pre-processing , needed as an input to the anaphora
A00-1031 the cleaning process , or during pre-processing , so the tagger can emit multiple
A88-1016 device ( not described here ) . * PRE-PROCESSING The main purpose of this first
A97-1036 formats that require extensive pre-processing to transform them into resources
C04-1031 expressions are identified in a pre-processing step in order to handle them
C00-2165 consists of three modules : the pre-processing module , the automatic tagging
C02-1002 translation equivalents requires special pre-processing : * sentence alignment ; we used
C02-1027 postediting of the output of the pre-processing tools were undertaken . The main
C00-2136 can be added . 3 Methodology 3.1 Pre-processing : Syntactic Analysis Before at
hide detail