ACL RD-TEC 1.0 Summarization of W05-1301
Paper Title:
WEAKLY SUPERVISED LEARNING METHODS FOR IMPROVING THE QUALITY OF GENE NAME NORMALIZATION DATA
WEAKLY SUPERVISED LEARNING METHODS FOR IMPROVING THE QUALITY OF GENE NAME NORMALIZATION DATA
Primarily assigned technology terms:
- algorithm
- bagging
- biomedical text mining
- biomedical text processing
- bootstrap
- classification
- classifier
- classifiers
- co-training
- conditional random fields
- crfs
- cross validation
- crossvalidation
- database
- databases
- entity tagger
- entropy classifier
- error analysis
- exact matching
- expectation maximization
- expectation maximization algorithm
- factoring
- feature split
- gene name normalization
- gene tagger
- identification
- learning
- learning approaches
- learning methods
- learning system
- logistic regression
- machine learning
- matching
- maximal matching
- maximization algorithm
- maximum entropy
- maximum entropy classifier
- maximum entropy classifiers
- mining
- name normalization
- nlp
- normalization
- processing
- ranking
- re-training
- regression
- self-training
- splitting
- supervised learning
- tagger
- tagging
- text mining
- text processing
- tuning
- validation
- weakly supervised learning
Other assigned terms:
- affixes
- approach
- bias
- biomedical text
- case
- community
- concepts
- conditional probability
- data flow
- data set
- data sets
- determiners
- development set
- entity types
- entropy
- entropy models
- evaluation data
- f-measure
- fact
- feature
- feature set
- feature types
- flybase
- gaussian prior
- gene name
- genia
- genia corpus
- heuristic
- labeled training data
- labeling
- lexical entries
- lexicon
- log-likelihood
- maximum entropy models
- method
- methodology
- model parameters
- ontology
- phrase
- prefixes and suffixes
- prepositions
- probabilities
- probability
- process
- punctuation
- suffixes
- synonym
- system description
- system performance
- tags
- test data
- text
- tokens
- trained model
- training
- training corpus
- training data
- training instance
- training set
- training time
- words