ACL RD-TEC 1.0 Summarization of P06-1102
Paper Title:
NAMES AND SIMILARITIES ON THE WEB: FACT EXTRACTION IN THE FAST LANE
NAMES AND SIMILARITIES ON THE WEB: FACT EXTRACTION IN THE FAST LANE
Authors: Marius Paşca and Dekang Lin and Jeffrey Bigham and Andrei Lifchits and Alpa Jain
Primarily assigned technology terms:
- bootstrapping
- computational linguistics
- computing
- entity recognizers
- fact extraction
- google search engine
- information extraction
- iterative acquisition
- iterative extraction
- large-scale fact extraction
- matching
- parsers
- parsing
- pattern acquisition
- processing
- ranking
- relative distance
- scoring
- search
- search engine
- syntactic parsing
- validation
- web search
Other assigned terms:
- approach
- association for computational linguistics
- case
- context window
- corpora
- distributional similarity
- document
- evaluation measures
- extraction patterns
- fact
- feature
- feature vector
- feature vectors
- frequency score
- hand-built ontology
- knowledge
- large corpus
- linear combination
- linguistics
- measure
- measures
- method
- mutual information
- named entities
- named entity
- names
- news corpus
- ontologies
- ontology
- part of speech
- part of speech tags
- part-of-speech
- part-of-speech tag
- part-of-speech tags
- phrase
- pointwise mutual information
- precision
- procedure
- process
- proper names
- queries
- relation
- search results
- seed
- sentence
- sentences
- similarity between words
- similarity measure
- similarity score
- similarity scores
- tags
- technique
- terms
- text
- text collection
- web documents
- web site
- word
- wordnet
- words