According to our assumption , most of the words with similar <term> context features </term>
encodes <term> honorifics </term> ( respectful words ) . <term> Honorifics </term> are used extensively
ungrammatically , missing out or repeating words , breaking-off and restarting , speaking
lr,21-2-C90-3072,bq dictionaries of word forms </term> instead of <term> words </term> . This approach is sufficient for
other,1-2-P01-1009,bq </term> , and <term> besides </term> . These <term> words </term> appear frequently enough in <term>
other,1-8-C92-3165,bq practical systems . Detected <term> unknown words </term> can be incrementally incorporated
other,11-1-P01-1009,bq analysis </term> for a large class of <term> words </term> called <term> alternative markers </term>
other,11-4-P82-1035,bq </term> can be used to figure out <term> unknown words </term> from <term> context </term> , constrain
other,12-2-P06-2001,bq little <term> corpus </term> of 100,000 <term> words </term> , the system guesses correctly not
other,15-3-C04-1147,bq compute <term> similarity </term> between <term> words </term> or use <term> lexical affinity </term>
other,15-4-P03-1051,bq segmented corpus </term> of about 110,000 <term> words </term> . To improve the <term> segmentation
other,15-5-A92-1027,bq </term> based on the placement of <term> function words </term> , and by <term> heuristic rules </term>
other,16-2-P04-2005,bq <term> topic signature </term> is a set of <term> words </term> that tend to co-occur with it . <term>
other,18-3-I05-5003,bq of speech information </term> of the <term> words </term> contributing to the <term> word matches
other,18-4-H01-1042,bq language essays </term> in less than 100 <term> words </term> . Even more illuminating was the
other,19-2-C92-4199,bq is proposed for identifying <term> unknown words </term> , especially <term> personal names </term>
other,21-4-P82-1035,bq possible <term> word-senses </term> of <term> words with multiple meanings </term> ( <term> ambiguity
other,21-6-A94-1026,bq semantic categories </term> of the <term> adjoining words </term> . The method accurately determines
other,22-1-A94-1007,bq <term> but </term> and the equivalent <term> words </term> . <term> Syntactic analysis of the
other,23-5-J05-4003,bq <term> parallel corpus </term> ( 100,000 <term> words </term> ) and exploiting a large <term> non-parallel
hide detail