ACL RD-TEC 1.0 Summarization of W06-1671
Paper Title:
LEARNING FIELD COMPATIBILITIES TO EXTRACT DATABASE RECORDS FROM UNSTRUCTURED TEXT
LEARNING FIELD COMPATIBILITIES TO EXTRACT DATABASE RECORDS FROM UNSTRUCTURED TEXT
Authors: Michael Wick and Aron Culotta and Andrew McCallum
Primarily assigned technology terms:
- agglomerative clustering
- algorithm
- average-link clustering algorithm
- binary classifier
- binary feature function
- binary relation extraction
- classification
- classifier
- clustering
- clustering algorithm
- complex reasoning
- computational linguistics
- coreference resolution
- correlational clustering
- database
- entity recognizer
- error reduction
- extraction system
- extraction systems
- factoring
- graph partitioning
- information extraction
- information extraction systems
- iterative method
- knowledge discovery
- language processing
- learning
- link clustering
- logistic regression
- machine learning
- matching
- maximum-entropy
- maximum-likelihood
- modeling
- named-entity recognition
- natural language processing
- overlapping clustering
- partitioning
- pattern matching
- processing
- question answering
- reasoning
- recognition
- recognition systems
- recognizer
- record extraction
- regression
- relation extraction
- sampling
- segmentation
- supervised machine learning
- text analysis
- vector space model
Other assigned terms:
- approach
- association for computational linguistics
- attribute type
- binary feature
- binary relation
- binary relations
- case
- cluster
- clusters
- conditional distribution
- dependency trees
- distribution
- document
- edge weight
- fact
- feature
- field compatibility
- geometric mean
- grammar
- heuristic
- heuristics
- ie task
- inter-cluster edge weight
- knowledge
- labeled training data
- labeling
- likelihood
- linguistics
- log-likelihood
- logistic regression model
- mapping
- maps
- maximum-entropy model
- measure
- measures
- method
- model parameters
- named-entity
- names
- natural language
- parse
- positive and negative examples
- precision
- probabilistic grammar
- probabilistic model
- probabilities
- process
- regression model
- regular expressions
- relation
- schema
- sentence
- sentences
- string similarity
- syntactic information
- terms
- text
- tokens
- training
- training data
- training examples
- transitive closure
- transitivity
- trees
- vector space
- vertex
- web documents
- web pages