ACL RD-TEC 1.0 Summarization of W99-0608

Paper Title:
IMPROVING POS TAGGING USING MACHINE-LEARNING TECHNIQUES

Authors: Lluis Marquez and Horacio Rodriguez and Josep Carmona and Josep Montolio

Other assigned terms:

  • 10-fold cross-validation
  • acronym
  • adjective
  • adverb
  • ambiguity
  • ambiguous word
  • ambiguous words
  • annotated corpora
  • approach
  • backoff
  • case
  • characters
  • chi-square statistic
  • classification problem
  • classification tasks
  • collocational information
  • compact representation
  • composition
  • constraint grammars
  • contextual information
  • contextual model
  • corpora
  • cross-validation experiment
  • data set
  • data sparseness
  • dictionary
  • distribution
  • error rate
  • experimental results
  • fact
  • feature
  • generation
  • grammars
  • human knowledge
  • implementation
  • independence assumption
  • index
  • information gain
  • interpolation
  • knowledge
  • language model
  • lexical information
  • lexicon
  • local context
  • method
  • morphological features
  • morphological information
  • n-gram
  • orthography
  • part of speech
  • part-of-speech
  • part-of-speech tags
  • penn treebank
  • penn treebank tag
  • penn treebank tag set
  • penn treebank tagset
  • pos tag
  • precision
  • prediction accuracy
  • probabilities
  • probability
  • probability distribution
  • procedure
  • process
  • proper noun
  • runtime
  • sentence
  • sentence level
  • statistic
  • style
  • suffix
  • suffixes
  • symbols
  • tag set
  • tagging accuracy
  • tags
  • tagset
  • target word
  • technique
  • terms
  • test set
  • text
  • training
  • training corpus
  • training data
  • training examples
  • training material
  • training set
  • tree
  • treebank
  • treebank tag set
  • trees
  • user
  • verb
  • vocabulary
  • word
  • word form
  • words
  • wsj corpus

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***