C04-1066 chunking method for general unknown word identification in Japanese texts . Our method
C04-1066 based on n-gram model for unknown word identification . The method estimates how likely
C04-1066 character-based chunking for unknown word identification in Japanese text . A major advantage
C04-1066 . Firstly , we examine unknown word identification experiment in newspaper articles
C04-1066 . Patent Texts We also examine word identification experiment with patent texts
C92-4173 characters into computer . Without word identification , we can not hope to achieve
D08-1097 patterns to help question head word identification . Note that these patterns depend
D08-1111 showed that the precision of new word identification was more important than the recall
C04-1066 are critical clues for the long word identification , the backward direction is effective
A88-1013 for fluctuate ) . 7.2 Unknown Words Identification of truly unknown words ( those
C04-1066 man - ually . We perform unknown word identification on newspaper articles and patent
C02-1049 segmentation . The problem of unknown word identification is considered more difficult
C92-4173 segmentation in China mainland and as word identification abroad . In recent years , it
C92-4199 Wei-Chuan Li Chao-Huang Chang Abstract Word Identification has been an important and active
D10-1121 important step for our method . Our word identification module is similar to the work
C00-1026 new words . Hence the unknown word identification for Chinese became one of the
C04-1066 </figurecaption> <title> Japanese Unknown Word Identification by Character-based Chunking </title>
C94-1096 performance of word segmentation and word identification . For segmentation , segperf.exe
C02-1057 original Japanese sentence for the word identification , the correctness of the modification
C92-4173 entity word . The difficulty of word identification has rcsnlted fi'om a confusion
hide detail