ACL RD-TEC 1.0 Summarization of H05-1062

Paper Title:
ROBUST NAMED ENTITY EXTRACTION FROM LARGE SPOKEN ARCHIVES

Authors: Benoit Favre and Frédéric Bechet and Pascal Nocéra

Other assigned terms:

  • ambiguity
  • ambiguity rate
  • annotation
  • approach
  • asr output
  • association for computational linguistics
  • broadcast news
  • broadcast news data
  • composition
  • corpora
  • correlation
  • development set
  • distribution
  • document
  • document content
  • document frequency
  • document retrieval evaluation
  • domain information
  • entity type
  • entity types
  • entropy
  • entropy models
  • error rate
  • evaluations
  • events
  • extraction process
  • f-measure
  • fact
  • feature
  • french
  • grammars
  • hmm-based model
  • hypotheses
  • hypothesis
  • implementation
  • index
  • index terms
  • inverse document frequency
  • knowledge
  • labeling
  • language model
  • language models
  • language resources
  • lattice
  • lattices
  • lexicon
  • likelihood
  • linguistics
  • maximum entropy models
  • measure
  • measures
  • message
  • message understanding conferences
  • metadata
  • method
  • n-best list
  • n-gram
  • n-gram model
  • named entities
  • named entity
  • names
  • natural language
  • nist
  • noise
  • noisy input
  • oracle
  • part-of-speech
  • part-of-speech tags
  • pauses
  • precision
  • probability
  • process
  • proper names
  • recognition errors
  • retrieval task
  • search space
  • sentence
  • sentence boundaries
  • slot
  • speech input
  • statistical model
  • symbols
  • tag sequence
  • tags
  • tagset
  • technology
  • term
  • terms
  • test corpora
  • test corpus
  • test data
  • test set
  • testing data
  • text
  • text corpora
  • toolkit
  • topics
  • training
  • training and testing data
  • training corpus
  • training data
  • training set
  • transcript
  • transcriptions
  • transcripts
  • trigram
  • understanding
  • vocabulary
  • word
  • word error rate
  • word lattice
  • word lattices
  • word lists
  • word sequence
  • word sequences
  • word string
  • words
  • written texts

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***