other,10-1-P01-1004,bq we compare the relative effects of <term> segment order </term> , <term> segmentation </term> and <term>
tech,13-1-P01-1004,bq effects of <term> segment order </term> , <term> segmentation </term> and <term> segment contiguity </term>
other,15-1-P01-1004,bq </term> , <term> segmentation </term> and <term> segment contiguity </term> on the <term> retrieval performance
measure(ment),19-1-P01-1004,bq <term> segment contiguity </term> on the <term> retrieval performance </term> of a <term> translation memory system
tech,23-1-P01-1004,bq <term> retrieval performance </term> of a <term> translation memory system </term> . We take a selection of both <term>
tech,6-2-P01-1004,bq </term> . We take a selection of both <term> bag-of-words and segment order-sensitive string comparison methods </term> , and run each over both <term> character
lr,19-2-P01-1004,bq methods </term> , and run each over both <term> character - and word-segmented data </term> , in combination with a range of <term>
model,31-2-P01-1004,bq </term> , in combination with a range of <term> local segment contiguity models </term> ( in the form of <term> N-grams </term>
model,40-2-P01-1004,bq contiguity models </term> ( in the form of <term> N-grams </term> ) . Over two distinct <term> datasets
lr,3-3-P01-1004,bq N-grams </term> ) . Over two distinct <term> datasets </term> , we find that <term> indexing </term>
tech,8-3-P01-1004,bq <term> datasets </term> , we find that <term> indexing </term> according to simple <term> character
model,12-3-P01-1004,bq indexing </term> according to simple <term> character bigrams </term> produces a <term> retrieval accuracy
measure(ment),16-3-P01-1004,bq character bigrams </term> produces a <term> retrieval accuracy </term> superior to any of the tested <term>
model,24-3-P01-1004,bq </term> superior to any of the tested <term> word N-gram models </term> . Further , in their optimum <term>
other,5-4-P01-1004,bq </term> . Further , in their optimum <term> configuration </term> , <term> bag-of-words methods </term>
tech,7-4-P01-1004,bq optimum <term> configuration </term> , <term> bag-of-words methods </term> are shown to be equivalent to <term>
tech,15-4-P01-1004,bq </term> are shown to be equivalent to <term> segment order-sensitive methods </term> in terms of <term> retrieval accuracy
measure(ment),21-4-P01-1004,bq order-sensitive methods </term> in terms of <term> retrieval accuracy </term> , but much faster . We also provide
hide detail