ACL RD-TEC 1.0 Summarization of N06-1038
Paper Title:
INTEGRATING PROBABILISTIC EXTRACTION MODELS AND DATA MINING TO DISCOVER RELATIONS AND PATTERNS IN TEXT
INTEGRATING PROBABILISTIC EXTRACTION MODELS AND DATA MINING TO DISCOVER RELATIONS AND PATTERNS IN TEXT
Authors: Aron Culotta and Andrew McCallum and Jonathan Betz
Primarily assigned technology terms:
- abstracting
- algorithm
- art extraction
- bootstrap
- bottom-up extraction
- bottom-up knowledge discovery
- classification
- classification approach
- classifier
- conditional random field
- conditional random fields
- crf training
- crfs
- data mining
- database
- database construction
- databases
- entity recognition
- entropy classifier
- error reduction
- expectation-maximization
- extraction systems
- extractor
- feature induction
- forward-backward algorithm
- induction
- inductive logic programming
- information extraction
- information extraction systems
- kernel
- knowledge bases
- knowledge discovery
- language processing
- learning
- learning method
- logic programming
- logistic regression
- machine learning
- matching
- maximum entropy
- maximum entropy classifier
- mining
- modeling
- normalization
- optimization
- pairwise classification
- parsing
- part-of-speech tagging
- pattern discovery
- pattern matching
- pos tagger
- processing
- pruning
- recognition
- regression
- regularization
- relation classification
- relation extraction
- relational database
- relational pattern discovery
- search
- sequence labeling
- subjective evaluation
- supervised machine learning
- tagger
- tagging
Other assigned terms:
- ambiguity
- ambiguous word
- annotator
- approach
- background knowledge
- case
- conditional distribution
- conditional probability
- contextual features
- distribution
- document
- encyclopedia
- entropy
- fact
- feature
- feature set
- human annotator
- implementation
- inductive logic
- inferences
- knowledge
- knowledge base
- labeled training data
- labeling
- language processing tasks
- length distribution
- likelihood
- likelihood function
- log-linear combination
- logic
- measure
- method
- methodology
- model parameter
- namedentity
- names
- natural language
- noise
- normalization factor
- opinions
- paragraphs
- part-of-speech
- precision
- priori
- probabilistic model
- probability
- procedure
- processing tasks
- queries
- recognition phase
- relation
- semantic
- sentence
- sentences
- sequence model
- signal
- syntactic information
- system performance
- technique
- technology
- test set
- testing set
- text
- training
- training data
- training database
- training set
- user
- verb
- wikipedia
- word
- words