P02-1023 data described in Section 4 for bigram model training . We divided the test set described
W96-0113 . He applied this algorithm to bigram model training from untagged Japanese text for
hide detail