Concordance

Query pre-processing 2,037 >
GDEX 2,037 (21.1 per million)

C02-1087	classification apply VSM on their	pre-processing	stage . SOM does not reduce the
C02-1112	including . We plan to improve the	pre-processing	of our systems , the detection
C02-1033	the text , resulting from its	pre-processing	. A topic context is associated
C02-1087	have the label 0 because after	pre-processing	these articles are zero vectors
A97-1034	existing corpora without extensive	pre-processing	. • It does support the
A92-1025	identification as well as other forms of	pre-processing	. Because the pattern matcher
C02-1002	corpora . We briefly describe the	pre-processing	steps , a baseline iterative
A92-1025	Combining Information The statistical	pre-processing	methods and calculations of relevance
A92-1025	would not have worked without the	pre-processing	of relevant text , name and collocation
A97-1036	tractable by NLP software . This	pre-processing	can not usually be fully automated
C02-1027	processing in Bulgarian . First , the	pre-processing	modules for tokenisation , sentence
C02-1027	boundaries . LINGUA performs the	pre-processing	, needed as an input to the anaphora
A00-1031	the cleaning process , or during	pre-processing	, so the tagger can emit multiple
A88-1016	device ( not described here ) . *	PRE-PROCESSING	The main purpose of this first
A97-1036	formats that require extensive	pre-processing	to transform them into resources
C04-1031	expressions are identified in a	pre-processing	step in order to handle them
C00-2165	consists of three modules : the	pre-processing	module , the automatic tagging
C02-1002	translation equivalents requires special	pre-processing	: * sentence alignment ; we used
C02-1027	postediting of the output of the	pre-processing	tools were undertaken . The main
C00-2136	can be added . 3 Methodology 3.1	Pre-processing	: Syntactic Analysis Before at


	in Help