ACL RD-TEC 1.0 Summarization of P03-1020

Paper Title:
TRUECASING

Authors: Lucian Vlad Lita and Abe Ittycheriah and Salim Roukos and Nanda Kambhatla

Other assigned terms:

  • ace corpus
  • ambiguous word
  • approach
  • asr output
  • baseline score
  • beam
  • bigram
  • bleu
  • bleu score
  • bleu scores
  • boundary information
  • broadcast news
  • broadcast news data
  • case
  • case information
  • context information
  • corpora
  • data sets
  • detection task
  • distribution
  • entropy
  • estimation
  • evaluation method
  • evaluations
  • f-measure
  • fact
  • feature
  • feature space
  • interpolation
  • labeling
  • language model
  • language models
  • large training
  • lattice
  • lexical item
  • lexical items
  • local context
  • machine translation output
  • mapping
  • meaning
  • method
  • model parameters
  • morphological features
  • n-gram
  • n-grams
  • named entities
  • named entity
  • names
  • natural language
  • natural language text
  • nist
  • nlp tasks
  • nouns
  • organization names
  • perplexity
  • person names
  • precision
  • probabilities
  • probability
  • procedure
  • process
  • proper name
  • punctuation
  • qualitative analysis
  • random sample
  • semantic
  • semantic categories
  • sentence
  • sentence boundaries
  • sentence level
  • sentences
  • statistical approach
  • statistical model
  • statistics
  • surface form
  • system performance
  • technique
  • terms
  • test data
  • test set
  • text
  • text corpora
  • text segment
  • tokens
  • training
  • training corpus
  • training data
  • training examples
  • training material
  • transformation
  • transition probabilities
  • translation output
  • translations
  • trigram
  • trigram language model
  • unigram
  • unigram model
  • vocabulary
  • weighting scheme
  • word
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***