ACL RD-TEC 1.0 Summarization of J95-4004

Paper Title:
TRANSFORMATION-BASED-ERROR-DRIVEN LEARNING AND NATURAL LANGUAGE PROCESSING: A CASE STUDY IN PART-OF-SPEECH TAGGING

Primarily assigned technology terms:

Other assigned terms:

  • adjective
  • adverb
  • affix
  • affixes
  • ambiguity
  • annotation
  • annotator
  • approach
  • association for computational linguistics
  • bigram
  • binary tree
  • british national corpus
  • case
  • characters
  • class information
  • classification problem
  • collocation
  • concept
  • contextual information
  • corpora
  • correlation
  • determiner
  • dictionary
  • entropy
  • error rate
  • fact
  • frequency counts
  • generation
  • grammar
  • grammar rules
  • grammars
  • index
  • knowledge
  • knowledge acquisition bottleneck
  • large corpora
  • lexical ambiguity
  • lexical association
  • lexical entries
  • lexical entry
  • lexicography
  • lexicon
  • likelihood
  • linguistic
  • linguistic behavior
  • linguistic information
  • linguistic knowledge
  • linguistic phenomena
  • linguistic structure
  • linguistics
  • manual rule construction
  • markov models
  • measure
  • measures
  • method
  • n-gram
  • names
  • natural language
  • nouns
  • ordered list
  • part of speech
  • part-of-speech
  • part-of-speech tag
  • part-of-speech tags
  • parts of speech
  • penn treebank
  • penn treebank corpus
  • personal pronoun
  • phrase
  • phrase attachment
  • phrase structure
  • predicate-argument
  • predicate-argument structure
  • preposition
  • prepositional phrase
  • prepositional phrase attachment
  • prepositions
  • probabilities
  • probability
  • procedure
  • pronoun
  • pronunciation
  • proper noun
  • queries
  • semantic
  • sentence
  • sparse data
  • statistical approach
  • statistics
  • stochastic model
  • structural ambiguity
  • style
  • suffix
  • suffixes
  • synonym
  • tag sequence
  • tagged corpora
  • tagged corpus
  • tagging accuracy
  • tags
  • technique
  • test corpus
  • test set
  • text
  • text corpora
  • text corpus
  • training
  • training corpus
  • training data
  • training material
  • training set
  • transformation
  • transition probabilities
  • tree
  • tree path
  • tree structure
  • treebank
  • treebank corpus
  • trees
  • trigram
  • unannotated text
  • untagged corpus
  • verb
  • verb form
  • vocabulary
  • wall street journal corpus
  • word
  • word class information
  • word classes
  • word corpus
  • word sequences
  • wordnet
  • words
  • wordsense
  • world knowledge

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***