other,17-1-N04-4028,bq <term> Information extraction techniques </term> automatically create <term> structured databases </term> from <term> unstructured data sources </term> , such as the <term> Web </term> or <term> newswire documents </term> .
tech,19-4-N04-4028,bq The <term> information extraction system </term> we evaluate is based on a <term> linear-chain conditional random field ( CRF ) </term> , a <term> probabilistic model </term> which has performed well on <term> information extraction tasks </term> because of its ability to capture arbitrary , overlapping <term> features </term> of the input in a <term> Markov model </term> .
other,26-4-N04-4028,bq The <term> information extraction system </term> we evaluate is based on a <term> linear-chain conditional random field ( CRF ) </term> , a <term> probabilistic model </term> which has performed well on <term> information extraction tasks </term> because of its ability to capture arbitrary , overlapping <term> features </term> of the input in a <term> Markov model </term> .
other,10-5-N04-4028,bq We implement several techniques to estimate the <term> confidence </term> of both <term> extracted fields </term> and entire <term> multi-field records </term> , obtaining an <term> average precision </term> of 98 % for retrieving correct <term> fields </term> and 87 % for <term> multi-field records </term> .
other,7-5-N04-4028,bq We implement several techniques to estimate the <term> confidence </term> of both <term> extracted fields </term> and entire <term> multi-field records </term> , obtaining an <term> average precision </term> of 98 % for retrieving correct <term> fields </term> and 87 % for <term> multi-field records </term> .
other,14-5-N04-4028,bq We implement several techniques to estimate the <term> confidence </term> of both <term> extracted fields </term> and entire <term> multi-field records </term> , obtaining an <term> average precision </term> of 98 % for retrieving correct <term> fields </term> and 87 % for <term> multi-field records </term> .
measure(ment),19-5-N04-4028,bq We implement several techniques to estimate the <term> confidence </term> of both <term> extracted fields </term> and entire <term> multi-field records </term> , obtaining an <term> average precision </term> of 98 % for retrieving correct <term> fields </term> and 87 % for <term> multi-field records </term> .
other,5-1-N04-4028,bq <term> Information extraction techniques </term> automatically create <term> structured databases </term> from <term> unstructured data sources </term> , such as the <term> Web </term> or <term> newswire documents </term> .
other,12-3-N04-4028,bq For many reasons , it is highly desirable to accurately estimate the <term> confidence </term> the system has in the correctness of each <term> extracted field </term> .
tech,10-4-N04-4028,bq The <term> information extraction system </term> we evaluate is based on a <term> linear-chain conditional random field ( CRF ) </term> , a <term> probabilistic model </term> which has performed well on <term> information extraction tasks </term> because of its ability to capture arbitrary , overlapping <term> features </term> of the input in a <term> Markov model </term> .
other,15-1-N04-4028,bq <term> Information extraction techniques </term> automatically create <term> structured databases </term> from <term> unstructured data sources </term> , such as the <term> Web </term> or <term> newswire documents </term> .
other,21-3-N04-4028,bq For many reasons , it is highly desirable to accurately estimate the <term> confidence </term> the system has in the correctness of each <term> extracted field </term> .
other,32-5-N04-4028,bq We implement several techniques to estimate the <term> confidence </term> of both <term> extracted fields </term> and entire <term> multi-field records </term> , obtaining an <term> average precision </term> of 98 % for retrieving correct <term> fields </term> and 87 % for <term> multi-field records </term> .
other,8-1-N04-4028,bq <term> Information extraction techniques </term> automatically create <term> structured databases </term> from <term> unstructured data sources </term> , such as the <term> Web </term> or <term> newswire documents </term> .
tech,0-1-N04-4028,bq Results indicate that the system yields higher performance than a <term> baseline </term> on all three aspects . <term> Information extraction techniques </term> automatically create <term> structured databases </term> from <term> unstructured data sources </term> , such as the <term> Web </term> or <term> newswire documents </term> .
other,38-4-N04-4028,bq The <term> information extraction system </term> we evaluate is based on a <term> linear-chain conditional random field ( CRF ) </term> , a <term> probabilistic model </term> which has performed well on <term> information extraction tasks </term> because of its ability to capture arbitrary , overlapping <term> features </term> of the input in a <term> Markov model </term> .
measure(ment),7-2-N04-4028,bq Despite the successes of these systems , <term> accuracy </term> will always be imperfect .
other,27-5-N04-4028,bq We implement several techniques to estimate the <term> confidence </term> of both <term> extracted fields </term> and entire <term> multi-field records </term> , obtaining an <term> average precision </term> of 98 % for retrieving correct <term> fields </term> and 87 % for <term> multi-field records </term> .
model,44-4-N04-4028,bq The <term> information extraction system </term> we evaluate is based on a <term> linear-chain conditional random field ( CRF ) </term> , a <term> probabilistic model </term> which has performed well on <term> information extraction tasks </term> because of its ability to capture arbitrary , overlapping <term> features </term> of the input in a <term> Markov model </term> .
tech,1-4-N04-4028,bq The <term> information extraction system </term> we evaluate is based on a <term> linear-chain conditional random field ( CRF ) </term> , a <term> probabilistic model </term> which has performed well on <term> information extraction tasks </term> because of its ability to capture arbitrary , overlapping <term> features </term> of the input in a <term> Markov model </term> .
hide detail