other,17-1-P03-1051,bq | word </term> consists of a sequence of <term> | morphemes | </term> in the <term> pattern </term><term> | #4617 We approximate Arabic's rich morphology by a model that a word consists of a sequence of morphemes in the pattern prefix*-stem-suffix* (* denotes zero or more occurrences of a morpheme). |
other,15-4-P03-1051,bq | segmented corpus </term> of about 110,000 <term> | words | </term> . To improve the <term> segmentation | #4704 The language model is initially estimated from a small manually segmented corpus of about 110,000 words . |