ACL RD-TEC 1.0 Summarization of P05-1060
Paper Title:
MULTI-FIELD INFORMATION EXTRACTION AND CROSS-DOCUMENT FUSION
MULTI-FIELD INFORMATION EXTRACTION AND CROSS-DOCUMENT FUSION
Authors: Gideon Mann and David Yarowsky
Primarily assigned technology terms:
- add-lambda smoothing
- answer fusion
- automatic annotation
- bootstrap
- bootstrapping
- bootstrapping method
- capitalization
- conditional random field
- conditional random fields
- crfs
- cross-document information fusion
- cross-field bootstrapping
- data mining
- database
- extraction system
- extraction systems
- extractor
- fact extraction
- field extraction
- hmms
- information extraction
- information extraction systems
- information extraction tasks
- information fusion
- learning
- learning process
- mallet system
- mining
- multi-document information extraction
- multi-field extraction
- normalization
- question answering
- question answering systems
- smoothing
- statistical extraction
- summarization
- voting
Other assigned terms:
- annotation
- annotation scheme
- background model
- case
- conditional probability
- data set
- document
- document set
- estimation
- experimental results
- fact
- feature
- heuristic
- human knowledge
- hypothesis
- knowledge
- language model
- large corpus
- lexico-syntactic template
- manual annotation
- markup
- measure
- method
- n-gram
- names
- positive and negative examples
- precision
- probabilities
- probability
- probability score
- process
- question answering research
- regular expressions
- sentence
- sentences
- set size
- slot
- statistical information
- statistical models
- system performance
- template slot
- text
- training
- training data
- training documents
- training set
- training set size
- training text
- unigram
- unigram language model
- unigram model
- web pages
- word
- wordnet
- words