ACL RD-TEC 1.0 Summarization of W99-0701

Paper Title:
UNSUPERVISED LEARNING OF WORD BOUNDARY WITH DESCRIPTION LENGTH GAIN

Authors: Chunyu Kitt and Yorick Wilks

Other assigned terms:

  • approach
  • bias
  • binary tree
  • brown corpus
  • character sequence
  • characters
  • chunks
  • co-occurrence
  • co-occurrence frequency
  • corpora
  • correlation
  • dictionary
  • english text
  • experimental results
  • fact
  • implementation
  • index
  • information theory
  • knowledge
  • language data
  • lexical item
  • lexical items
  • linguistic
  • mdl principle
  • measure
  • method
  • minimum description length
  • n-gram
  • n-grams
  • names
  • natural language
  • natural language sentences
  • precision
  • proper names
  • ptb
  • right-hand side
  • segments
  • sentences
  • statistical data
  • tags
  • technology
  • terms
  • text
  • text corpora
  • text corpus
  • theory
  • time complexity
  • tokens
  • training
  • transformation
  • tree
  • tree structure
  • utterance
  • web pages
  • word
  • word boundaries
  • word boundary
  • words
  • written corpora
  • wsj corpus

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***