tech,22-2-P03-1051,ak | Our method is seeded by a <term> small manually segmented Arabic corpus </term> and uses it to bootstrap an <term> unsupervised algorithm </term> to build the <term> Arabic word segmenter </term> from a <term> large unsegmented Arabic corpus </term> . | #4662 Our method is seeded by a small manually segmented Arabic corpus and uses it to bootstrap an unsupervised algorithm to build the Arabic word segmenter from a large unsegmented Arabic corpus. | |
measure(ment),10-6-P03-1051,ak | The resulting <term> Arabic word segmentation system </term> achieves around 97 % <term> exact match accuracy </term> on a <term> test corpus </term> containing 28,449 <term> word tokens </term> . | #4755 The resulting Arabic word segmentation system achieves around 97% exact match accuracy on a test corpus containing 28,449 word tokens. |