other,19-6-P03-1051,ak | test corpus </term> containing 28,449 <term> | word tokens | </term> . We believe this is a state-of-the-art | #4764 The resulting Arabic word segmentation system achieves around 97% exact match accuracy on a test corpus containing 28,449 word tokens . |