ACL RD-TEC 1.0 Summarization of W06-2918
Paper Title:
USING GAZETTEERS IN DISCRIMINATIVE INFORMATION EXTRACTION
USING GAZETTEERS IN DISCRIMINATIVE INFORMATION EXTRACTION
Authors: Andrew Smith and Miles Osborne
Primarily assigned technology terms:
- algorithm
- capitalization
- clustering
- computational linguistics
- computational natural language learning
- conditional random fields
- crf decoding
- crf training
- decision making
- decoding
- encoding
- entity recognition
- error analysis
- extraction systems
- feature encoding
- illustration
- information extraction
- information extraction systems
- language learning
- learning
- loglinear
- maximum likelihood
- modelling
- named entity recognition
- natural language learning
- np-chunking
- numerical optimisation
- optimisation
- partitioning
- recognition
- search
- training procedure
Other assigned terms:
- approach
- association for computational linguistics
- biomedical domain
- case
- clusters
- conditional probability
- conll-x
- data set
- development set
- distribution
- english dataset
- entity types
- events
- external knowledge
- f score
- fact
- feature
- feature set
- feature sets
- feature types
- gaussian prior
- gazetteer
- gazetteer information
- gene names
- heuristics
- interpretation
- knowledge
- labeling
- likelihood
- linguistics
- local context
- log-likelihood
- measure
- method
- model parameters
- n-grams
- named entity
- names
- natural language
- opinion
- parameter values
- predicates
- prior distribution
- probability
- probability distribution
- procedure
- punctuation
- search space
- sentence
- sentences
- statistical models
- tags
- terms
- test set
- tokens
- training
- training data
- training set
- word
- words