ACL RD-TEC 1.0 Summarization of W06-0117

Paper Title:
FRANCE TELECOM R&D BEIJING WORD SEGMENTER FOR SIGHAN BAKEOFF 2006

Authors: Wu Liu and Heng Li and Yuan Dong and Nan He and Haitao Luo and Haila Wang

Other assigned terms:

  • abbreviations
  • anaphora
  • approach
  • association for computational linguistics
  • chinese language
  • chinese text
  • chinese word
  • chinese words
  • contextual information
  • contextual word
  • dictionary
  • entropy
  • f-score
  • knowledge
  • language model
  • lexicon
  • linguistics
  • method
  • named entities
  • named entity
  • names
  • ngram
  • ngram language model
  • organization names
  • person names
  • precision
  • rule template
  • segmentation bakeoff
  • statistical framework
  • statistical model
  • system description
  • tag information
  • tags
  • test corpus
  • text
  • theory
  • toolkit
  • training
  • training corpus
  • training data
  • trigram
  • trigram language model
  • window size
  • word
  • word information
  • word lists
  • word segmentation performance
  • word-based language model
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***