ACL RD-TEC 1.0 Summarization of P06-1125

Paper Title:
A PHONETIC-BASED APPROACH TO CHINESE CHAT TEXT NORMALIZATION

Authors: Yunqing Xia and Kam-Fai Wong and Wenjie Li

Other assigned terms:

  • approach
  • association for computational linguistics
  • backoff
  • case
  • characters
  • chinese characters
  • chinese corpus
  • chinese language
  • chinese language corpus
  • chinese text
  • corpora
  • data sparseness
  • data sparseness problem
  • dictionaries
  • dictionary
  • discourse
  • distribution
  • estimation
  • experimental results
  • f-1 measure
  • fact
  • feature
  • formalism
  • implementation
  • intention
  • language corpora
  • language corpus
  • language model
  • likelihood
  • linguistics
  • mapping
  • mapping model
  • mappings
  • meanings
  • measure
  • method
  • methodology
  • natural language
  • phonetic mapping model
  • phonetic similarity
  • pinyin
  • precision
  • probabilities
  • probability
  • sentence
  • sentences
  • sentential context
  • simplified chinese
  • source channel model
  • sparse data
  • sparse data problem
  • sparseness problem
  • statistical approach
  • statistics
  • technique
  • term
  • term distribution
  • terms
  • test set
  • text
  • text corpus
  • training
  • training data
  • training samples
  • translation model
  • trigram
  • trigram model
  • understanding
  • word
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***