ACL RD-TEC 1.0 Summarization of P01-1005

Paper Title:
SCALING TO VERY VERY LARGE CORPORA FOR NATURAL LANGUAGE DISAMBIGUATION

Authors: Michele Banko and Eric Brill

Other assigned terms:

  • ambiguity
  • annotated corpus
  • annotation
  • approach
  • bias
  • case
  • checker
  • classification accuracy
  • classification task
  • community
  • corpora
  • corpus size
  • data set
  • data sets
  • disambiguation task
  • entropy
  • fact
  • grammar
  • human annotation
  • labeled training data
  • labeling
  • language classification task
  • language disambiguation task
  • large corpora
  • large training
  • large training corpora
  • linguistic
  • linguistic information
  • manual annotation
  • measure
  • method
  • natural language
  • nlp community
  • parse
  • part of speech
  • part-of-speech
  • probabilities
  • probability
  • representations
  • seed
  • sentence
  • sentences
  • set size
  • small training corpora
  • tags
  • target word
  • technique
  • test set
  • text
  • text corpora
  • training
  • training corpora
  • training corpus
  • training data
  • training instance
  • training material
  • training samples
  • training set
  • training set size
  • training size
  • training time
  • transcripts
  • trees
  • unlabeled corpus
  • unlabeled examples
  • wall street journal text
  • word
  • word sense
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***