other,19-6-P03-1051,bq | test corpus </term> containing 28,449 <term> | word tokens | </term> . We believe this is a state-of-the-art | #4762 The resulting Arabic word segmentation system achieves around 97% exact match accuracy on a test corpus containing 28,449 word tokens . |