ACL RD-TEC 1.0 Summarization of P05-1046

Paper Title:
UNSUPERVISED LEARNING OF FIELD SEGMENTATION MODELS FOR INFORMATION EXTRACTION

Authors: Trond Grenager and Dan Klein and Christopher Manning

Other assigned terms:

  • abbreviation
  • annotated training set
  • annotation
  • approach
  • break
  • case
  • citation
  • cluster
  • computer science research
  • content words
  • convergence
  • correlations
  • data set
  • development set
  • discourse
  • distribution
  • document
  • email
  • emission model
  • english text
  • entropy
  • estimation
  • evaluation method
  • evaluations
  • events
  • experimental results
  • fact
  • function words
  • generative model
  • grammar
  • heuristic
  • human reader
  • identity uncertainty
  • knowledge
  • labeling
  • likelihood
  • linguistic
  • linguistic phenomena
  • linguistic structure
  • mapping
  • markov models
  • method
  • model parameters
  • model structure
  • named entity
  • natural language
  • noise
  • part-of-speech
  • part-of-speech tagging task
  • parts-of-speech
  • penn treebank
  • probabilistic model
  • probability
  • probability distributions
  • procedure
  • process
  • punctuation
  • search procedure
  • search space
  • segments
  • simultaneity
  • statistics
  • structured text
  • style
  • syntax
  • system development
  • tagging task
  • technique
  • technology
  • term
  • test data
  • test set
  • text
  • tokens
  • topics
  • training
  • training data
  • training documents
  • training examples
  • training set
  • transition information
  • transition matrix
  • treebank
  • unlabeled examples
  • user
  • word
  • word distribution
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***