ACL RD-TEC 1.0 Summarization of N04-2007

Paper Title:
A PRELIMINARY LOOK INTO THE USE OF NAMED ENTITY INFORMATION FOR BIOSCIENCE TEXT TOKENIZATION

Other assigned terms:

  • 10-fold cross-validation
  • acronym
  • ambiguous punctuation
  • approach
  • biology
  • bioscience text
  • break
  • case
  • characters
  • classification problem
  • dictionary
  • distribution
  • document
  • f-measure
  • feature
  • feature vectors
  • genia
  • genia corpus
  • implementation
  • information gain
  • knowledge
  • labeling
  • majority class baseline
  • markov models
  • measures
  • medline
  • methodology
  • named entities
  • named entity
  • names
  • noun phrase
  • ontology
  • orthography
  • part-of-speech
  • parts of speech
  • phrase
  • precision
  • proper names
  • punctuation
  • queries
  • query
  • sentence
  • sentence boundary
  • tags
  • technical terminology
  • term
  • terms
  • test set
  • testing set
  • text
  • tokens
  • tree
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***