Concordance

Query N04-4028 20 >
Sort Left 20 (932.4 per million)

measure(ment),7-2-N04-4028,bq	Despite the successes of these systems , <term>	accuracy	</term> will always be imperfect . For many	#6781 Despite the successes of these systems,accuracy will always be imperfect.
tech,0-1-N04-4028,bq	baseline </term> on all three aspects . <term>	Information extraction techniques	</term> automatically create <term> structured	#6754 Results indicate that the system yields higher performance than a baseline on all three aspects.Information extraction techniques automatically create structured databases from unstructured data sources, such as the Web or newswire documents.
tech,19-4-N04-4028,bq	conditional random field ( CRF ) </term> , a <term>	probabilistic model	</term> which has performed well on <term>	#6830 The information extraction system we evaluate is based on a linear-chain conditional random field (CRF), aprobabilistic model which has performed well on information extraction tasks because of its ability to capture arbitrary, overlapping features of the input in a Markov model.
model,44-4-N04-4028,bq	<term> features </term> of the input in a <term>	Markov model	</term> . We implement several techniques	#6855 The information extraction system we evaluate is based on a linear-chain conditional random field (CRF), a probabilistic model which has performed well on information extraction tasks because of its ability to capture arbitrary, overlapping features of the input in aMarkov model.
tech,10-4-N04-4028,bq	system </term> we evaluate is based on a <term>	linear-chain conditional random field ( CRF )	</term> , a <term> probabilistic model </term>	#6821 The information extraction system we evaluate is based on alinear-chain conditional random field ( CRF ), a probabilistic model which has performed well on information extraction tasks because of its ability to capture arbitrary, overlapping features of the input in a Markov model.
measure(ment),19-5-N04-4028,bq	multi-field records </term> , obtaining an <term>	average precision	</term> of 98 % for retrieving correct <term>	#6877 We implement several techniques to estimate the confidence of both extracted fields and entire multi-field records, obtaining anaverage precision of 98% for retrieving correct fields and 87% for multi-field records.
other,10-5-N04-4028,bq	the <term> confidence </term> of both <term>	extracted fields	</term> and entire <term> multi-field records	#6868 We implement several techniques to estimate the confidence of bothextracted fields and entire multi-field records, obtaining an average precision of 98% for retrieving correct fields and 87% for multi-field records.
other,27-5-N04-4028,bq	</term> of 98 % for retrieving correct <term>	fields	</term> and 87 % for <term> multi-field records	#6885 We implement several techniques to estimate the confidence of both extracted fields and entire multi-field records, obtaining an average precision of 98% for retrieving correctfields and 87% for multi-field records.
other,5-1-N04-4028,bq	techniques </term> automatically create <term>	structured databases	</term> from <term> unstructured data sources	#6759 Information extraction techniques automatically createstructured databases from unstructured data sources, such as the Web or newswire documents.
other,21-3-N04-4028,bq	system has in the correctness of each <term>	extracted field	</term> . The <term> information extraction	#6808 For many reasons, it is highly desirable to accurately estimate the confidence the system has in the correctness of eachextracted field.
other,14-5-N04-4028,bq	<term> extracted fields </term> and entire <term>	multi-field records	</term> , obtaining an <term> average precision	#6872 We implement several techniques to estimate the confidence of both extracted fields and entiremulti-field records, obtaining an average precision of 98% for retrieving correct fields and 87% for multi-field records.
other,32-5-N04-4028,bq	correct <term> fields </term> and 87 % for <term>	multi-field records	</term> . We present a novel approach for	#6890 We implement several techniques to estimate the confidence of both extracted fields and entire multi-field records, obtaining an average precision of 98% for retrieving correct fields and 87% formulti-field records.
other,8-1-N04-4028,bq	<term> structured databases </term> from <term>	unstructured data sources	</term> , such as the <term> Web </term> or <term>	#6762 Information extraction techniques automatically create structured databases fromunstructured data sources, such as the Web or newswire documents.
other,26-4-N04-4028,bq	</term> which has performed well on <term>	information extraction tasks	</term> because of its ability to capture	#6837 The information extraction system we evaluate is based on a linear-chain conditional random field (CRF), a probabilistic model which has performed well oninformation extraction tasks because of its ability to capture arbitrary, overlapping features of the input in a Markov model.
other,17-1-N04-4028,bq	</term> , such as the <term> Web </term> or <term>	newswire documents	</term> . Despite the successes of these	#6771 Information extraction techniques automatically create structured databases from unstructured data sources, such as the Web ornewswire documents.
other,38-4-N04-4028,bq	to capture arbitrary , overlapping <term>	features	</term> of the input in a <term> Markov model	#6849 The information extraction system we evaluate is based on a linear-chain conditional random field (CRF), a probabilistic model which has performed well on information extraction tasks because of its ability to capture arbitrary, overlappingfeatures of the input in a Markov model.
tech,1-4-N04-4028,bq	each <term> extracted field </term> . The <term>	information extraction system	</term> we evaluate is based on a <term> linear-chain	#6812 Theinformation extraction system we evaluate is based on a linear-chain conditional random field (CRF), a probabilistic model which has performed well on information extraction tasks because of its ability to capture arbitrary, overlapping features of the input in a Markov model.
other,15-1-N04-4028,bq	unstructured data sources </term> , such as the <term>	Web	</term> or <term> newswire documents </term>	#6769 Information extraction techniques automatically create structured databases from unstructured data sources, such as theWeb or newswire documents.
other,12-3-N04-4028,bq	desirable to accurately estimate the <term>	confidence	</term> the system has in the correctness	#6799 For many reasons, it is highly desirable to accurately estimate theconfidence the system has in the correctness of each extracted field.
other,7-5-N04-4028,bq	several techniques to estimate the <term>	confidence	</term> of both <term> extracted fields </term>	#6865 We implement several techniques to estimate theconfidence of both extracted fields and entire multi-field records, obtaining an average precision of 98% for retrieving correct fields and 87% for multi-field records.


	in Help