ACL RD-TEC 1.0 Summarization of H05-1057

Paper Title:
MATCHING INCONSISTENTLY SPELLED NAMES IN AUTOMATIC SPEECH RECOGNIZER OUTPUT FOR INFORMATION RETRIEVAL

Authors: Hema Raghavan and James Allan

Other assigned terms:

  • annotators
  • approach
  • asr output
  • association for computational linguistics
  • canonical form
  • case
  • characters
  • cluster
  • clusters
  • co-reference
  • community
  • contextual information
  • corpora
  • detection task
  • dictionary
  • document
  • document frequency
  • document vectors
  • edit distance
  • error rate
  • evaluation measures
  • evaluations
  • events
  • feature
  • foreign language
  • french
  • generative model
  • generative models
  • human judgments
  • ibm models
  • implementation
  • information retrieval community
  • inverse document frequency
  • language model
  • language modeling toolkit
  • levenshtein distance
  • lexicon
  • linguistics
  • machine translation model
  • mapping
  • mean average precision
  • meaning
  • measure
  • measures
  • method
  • modeling toolkit
  • named entities
  • named entity
  • names
  • natural language
  • nist
  • noisy channel
  • opinions
  • pairs of words
  • parallel corpus
  • parallel text
  • perplexity
  • person names
  • precision
  • probabilistic model
  • probabilities
  • probability
  • process
  • proper names
  • queries
  • query
  • recognition errors
  • retrieval performance
  • retrieval task
  • sentence
  • sentences
  • signal
  • size of the corpus
  • source language
  • source language text
  • statistical significance
  • string edit distance
  • target language
  • target language text
  • technique
  • technology
  • term
  • term frequency
  • test corpus
  • test set
  • text
  • toolkit
  • topics
  • training
  • training corpus
  • training set
  • transcript
  • transcriptions
  • transcripts
  • translation model
  • translation models
  • translation probabilities
  • translations
  • trec-7
  • understanding
  • user
  • vector space
  • vocabulary
  • word
  • word error rate
  • word error rates
  • words

Extracted Section Types:



This page last edited on 10 May 2017.

*** ***