ACL RD-TEC 1.0 Summarization of A97-1004

Paper Title:
A MAXIMUM ENTROPY APPROACH TO IDENTIFYING SENTENCE BOUNDARIES

Authors: Jeffrey C. Reynar and Adwait Ratnaparkhi

Other assigned terms:

  • abbreviation
  • alphabet
  • annotated corpora
  • annotated corpus
  • approach
  • brown corpus
  • characters
  • contextual information
  • corpora
  • decision rule
  • distribution
  • domain-specific information
  • domain-specific knowledge
  • entropy
  • error rate
  • feature
  • genre
  • joint probability
  • joint probability distribution
  • knowledge
  • lexica
  • lexicon
  • likelihood
  • part-of-speech
  • part-of-speech tags
  • penn treebank
  • portability
  • pos tag
  • pos tag information
  • probabilities
  • probability
  • probability distribution
  • procedure
  • punctuation
  • punctuation marks
  • roman alphabet
  • sentence
  • sentence boundaries
  • sentence boundary
  • sentences
  • set size
  • suffix
  • symbol
  • symbols
  • system performance
  • tag information
  • tags
  • test data
  • test set
  • text
  • tokens
  • training
  • training corpus
  • training data
  • training data.
  • tree
  • treebank
  • wall street journal text
  • word
  • words
  • wsj corpus

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***