tech,22-2-P03-1051,bq | unsupervised algorithm </term> to build the <term> | Arabic word segmenter | </term> from a large <term> unsegmented Arabic | #4660 Our method is seeded by a small manually segmented Arabic corpus and uses it to bootstrap an unsupervised algorithm to build the Arabic word segmenter from a large unsegmented Arabic corpus. |
measure(ment),10-6-P03-1051,bq | system </term> achieves around 97 % <term> | exact match accuracy | </term> on a <term> test corpus </term> containing | #4753 The resulting Arabic word segmentation system achieves around 97% exact match accuracy on a test corpus containing 28,449 word tokens. |