ACL RD-TEC 1.0 Summarization of C04-1140

Paper Title:
HIGH-PERFORMANCE TAGGING ON MEDICAL TEXTS

Authors: Udo Hahn and Joachim Wermter

Other assigned terms:

  • abbreviations
  • adjective
  • annotated corpus
  • annotation
  • annotation effort
  • annotators
  • bigram
  • biology
  • break
  • case
  • co-occurrences
  • corpora
  • distribution
  • document
  • document collection
  • document structure
  • fact
  • feature
  • genre
  • german language
  • gold standard
  • grammar
  • hypothesis
  • interpolation
  • interpretation
  • knowledge
  • language corpora
  • language data
  • language model
  • language resources
  • lexicon
  • linguistic
  • linguistic data
  • manual annotation
  • markov models
  • measure
  • measures
  • medical corpora
  • medical corpus
  • medical terminology
  • n-gram
  • n-grams
  • negra
  • negra corpus
  • news corpus
  • newspaper corpus
  • newspaper language
  • noun phrase
  • null hypothesis
  • parse
  • part-of-speech
  • pathology
  • penn treebank
  • phrase
  • portability
  • pos category
  • pos tag
  • priori
  • probability
  • probability distribution
  • procedure
  • process
  • random sample
  • sentences
  • similarity measures
  • specialist lexicon
  • standard deviation
  • statistical model
  • statistics
  • sublanguage
  • suffix
  • tagger lexicon
  • tagging accuracy
  • tagging performance
  • tags
  • tagset
  • technologies
  • technology
  • terms
  • test data
  • test set
  • text
  • text corpus
  • textbook
  • tokens
  • training
  • training corpora
  • training set
  • training size
  • treebank
  • trees
  • trigram
  • understanding
  • unigram
  • vocabulary
  • word
  • word types
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***