ACL RD-TEC 1.0 Summarization of A00-1012

Paper Title:
EXPERIMENTS ON SENTENCE BOUNDARY DETECTION

Authors: Mark Stevenson and Robert Gaizauskas

Other assigned terms:

  • annotation
  • annotator
  • annotators
  • approach
  • asr output
  • boundary information
  • break
  • british national corpus
  • broadcast news
  • brown corpus
  • capitalization information
  • case
  • case information
  • characters
  • classification task
  • classification tasks
  • computational approach
  • detection task
  • disambiguation task
  • discourse
  • entropy
  • error rate
  • estimation
  • evaluation metrics
  • feature
  • human annotation
  • human annotators
  • human performance
  • input text
  • inter-annotator agreement
  • kappa
  • kappa statistic
  • kappa value
  • knowledge
  • lexical information
  • linguistic
  • markup
  • measure
  • measures
  • method
  • methodology
  • opinion
  • part of speech
  • part of speech tags
  • pause
  • penn treebank
  • phoneme
  • phrase
  • pitch
  • pre-pausal lengthening
  • precision
  • priori
  • probabilities
  • probability
  • probability distributions
  • process
  • prosodic information
  • punctuation
  • punctuation marks
  • recognition model
  • sense disambiguation problem
  • sentence
  • sentence boundaries
  • sentence boundary
  • sentences
  • speech information
  • speech tag
  • spoken language
  • statistic
  • statistics
  • suffix
  • symbol
  • symbols
  • tags
  • technologies
  • technology
  • television
  • terms
  • test corpus
  • test data
  • test set
  • text
  • tokens
  • training
  • training corpus
  • training example
  • training examples
  • training text
  • transcribed speech
  • transcriptions
  • transcripts
  • tree
  • treebank
  • trigram
  • trigram model
  • vocabulary
  • wall street journal text
  • word
  • word boundaries
  • word boundary
  • word error rate
  • word sense
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***