ACL RD-TEC 1.0 Summarization of W04-1218
Paper Title:
ADAPTING AN NER-SYSTEM FOR GERMAN TO THE BIOMEDICAL DOMAIN
ADAPTING AN NER-SYSTEM FOR GERMAN TO THE BIOMEDICAL DOMAIN
Primarily assigned technology terms:
- algorithm
- binary classification
- bootstrapping
- bootstrapping process
- boundary detection
- capitalization
- chunker
- classification
- classifier
- classifiers
- disambiguation
- dynamic programming
- dynamic programming approach
- entity recognition
- indexing
- kernel
- kernels
- language technology
- learning
- learning algorithm
- lexical bootstrapping
- machine learning
- machine learning algorithm
- markov model
- morphological analyser
- named entity recognition
- ne tagger
- nlp
- optimization
- parsing
- part-of-speech tagger
- polynomial kernel
- pos-tagging
- post-processing
- processing
- recognition
- search
- sense disambiguation
- shallow parsing
- speech tagger
- tagger
- transcription
- weak classifier
- word sense disambiguation
Other assigned terms:
- analyser
- annotated corpus
- approach
- bigram
- biomedical domain
- community
- context words
- dictionaries
- discourse
- discourse level
- discourse unit
- discourse units
- document
- evaluation data
- f-score
- fact
- feature
- feature set
- feature sets
- genia
- genia corpus
- handcrafted knowledge
- heuristic
- heuristics
- homonymy
- kernel function
- knowledge
- labeling
- lexical resources
- lexicon
- linguistic
- linguistic knowledge
- linguistic resources
- medline
- method
- morphological features
- n-grams
- named entity
- names
- nlp community
- part of speech
- part-of-speech
- person names
- polysemous word
- polysemy
- precision
- probabilities
- process
- processing time
- programming approach
- proper names
- relative frequency
- search term
- semantic
- semantic class
- semantic classes
- suffix
- svms
- technique
- technology
- term
- terms
- text
- training
- training data
- transformation
- unlabeled corpus
- word
- word corpus
- word form
- word sense
- words