ACL RD-TEC 1.0 Summarization of W05-0619
Paper Title:
INVESTIGATING THE EFFECTS OF SELECTIVE SAMPLING ON THE ANNOTATION TASK
INVESTIGATING THE EFFECTS OF SELECTIVE SAMPLING ON THE ANNOTATION TASK
Authors: Ben Hachey and Beatrice Alex and Markus Becker
Primarily assigned technology terms:
- active annotation
- active learning
- annotation tool
- biomedicine
- bionlp
- bootstrapping
- boundary identification
- chunking
- classification
- classifiers
- coding
- computational linguistics
- computational natural language learning
- corpus preparation
- digital library
- document classification
- entity recognition
- error analysis
- error rate reduction
- error reduction
- feature split
- identification
- instrumentation
- language learning
- language processing
- learner
- learning
- learning algorithms
- learning approach
- learning methods
- machine learning
- markov model
- matching
- measuring
- named entity recognition
- natural language learning
- natural language processing
- np chunking
- parsing
- part-of-speech tagging
- pearson correlation
- phrasal alignment
- processing
- random sampling
- rate reduction
- recognition
- sample selection
- sampling
- selective sampling
- splitting
- supervised learner
- supervised training
- tagger
- tagging
Other assigned terms:
- annotated corpora
- annotation
- annotation task
- annotator
- annotator accuracy
- annotators
- approach
- association for computational linguistics
- biomedical literature
- case
- class information
- coding scheme
- coefficient
- cognitive
- computational linguist
- conditional markov model
- confusion matrix
- corpora
- correlation
- correlation coefficient
- correlations
- data sets
- distribution
- document
- entity type
- entity types
- error rate
- evaluation metrics
- evaluations
- f-measure
- f-score
- fact
- feature
- feature set
- fmeasure
- gold standard
- human annotation
- human annotators
- inter-annotator agreement
- kappa
- kappa coefficient
- linguist
- linguistics
- manual annotation
- measures
- method
- methodology
- named entity
- names
- natural language
- negra
- part-of-speech
- pearson correlation coefficient
- penn treebank
- phrase
- phrase boundary
- phrase level
- phrase type
- precision
- probability
- probability distribution
- probability distributions
- query
- seed
- sentence
- sentence level
- sentences
- terms
- test data
- tokens
- training
- treebank
- understanding
- words