ACL RD-TEC 1.0 Summarization of I05-3026

Paper Title:
DESCRIPTION OF THE HKU CHINESE WORD SEGMENTATION SYSTEM FOR SIGHAN BAKEOFF 2005

Authors: Guohong Fu and Kang-Kwong Luke and Percy Ping-Wai WONG

Other assigned terms:

  • ambiguous segmentation
  • bigram
  • bigram model
  • characters
  • chinese characters
  • chinese text
  • chinese word
  • chinese words
  • corpora
  • dictionaries
  • dictionary
  • f measure
  • f-measure
  • grammar
  • lexicon
  • measure
  • measures
  • open test
  • out-of-vocabulary rate
  • part-of-speech
  • part-of-speech information
  • pfr corpus
  • precision
  • probability
  • process
  • segmentation bakeoff
  • sentence
  • sinica corpus
  • tagging task
  • tags
  • technology
  • test corpus
  • testing corpora
  • text
  • training
  • training corpora
  • training corpus
  • training data
  • word
  • word bigram model
  • word boundaries
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***