ACL RD-TEC 1.0 Summarization of P06-2005

Paper Title:
A PHRASE-BASED STATISTICAL MODEL FOR SMS TEXT NORMALIZATION

Authors: AiTi Aw and Min Zhang and Juan Xiao and Jian Su

Other assigned terms:

  • abbreviations
  • ambiguity
  • approach
  • association for computational linguistics
  • backoff
  • baseline performance
  • baseline score
  • bleu
  • bleu score
  • bleu scores
  • case
  • characters
  • convergence
  • conversation
  • copula verb
  • corpora
  • customization
  • data set
  • data sparseness
  • derivations
  • dictionary
  • discourse
  • distribution
  • edit distance
  • english language
  • english sentence
  • english text
  • grammar
  • heuristics
  • joint probability
  • language expression
  • language model
  • language modeling toolkit
  • lexical ambiguity
  • lexical unit
  • lexicon
  • likelihood
  • linguistic
  • linguistics
  • machine translation model
  • main verb
  • mapping
  • mapping model
  • mappings
  • meaning
  • measure
  • measures
  • message
  • method
  • modeling toolkit
  • morpho-syntactic information
  • n-gram
  • noisy channel
  • normalization model
  • nouns
  • orthographic variation
  • parallel corpora
  • parallel corpus
  • paraphrases
  • particles
  • phrase
  • phrase level
  • phrase-based model
  • prior distribution
  • probabilities
  • probability
  • process
  • pronoun
  • pronunciation
  • punctuation
  • reordering
  • representations
  • semantic
  • semantic information
  • sentence
  • sentence boundaries
  • sentence pair
  • sentences
  • slang
  • sms text
  • source channel model
  • source text
  • statistical model
  • statistical translation model
  • statistics
  • style
  • target text
  • technique
  • text
  • text collection
  • text corpus
  • text structure
  • text style
  • tokens
  • toolkit
  • training
  • training corpus
  • training data
  • transformation
  • translation accuracy
  • translation model
  • translation output
  • translation problem
  • translation quality
  • understanding
  • unigram
  • verb
  • vocabulary
  • word
  • word-based language model
  • word-based model
  • words
  • written texts

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***