ACL RD-TEC 1.0 Summarization of W00-1323
Paper Title:
COMBINING LEXICAL AND FORMATTING CUES FOR NAMED ENTITY ACQUISITION FROM THE WEB
COMBINING LEXICAL AND FORMATTING CUES FOR NAMED ENTITY ACQUISITION FROM THE WEB
Authors: Christian Jacquemin and Caroline Bush
Primarily assigned technology terms:
- corpus harvesting
- data collection
- database
- discourse marker
- harvesting
- illustration
- induction
- knowledge acquisition
- language engineering
- learning
- learning techniques
- linguistic analysis
- machine learning
- machine learning techniques
- matching
- name tagging
- nlp
- parser
- parsers
- parsing
- pattern matching
- processing
- quantitative evaluation
- search
- search engine
- search engines
- shallow parser
- splitting
- string matching
- syntactic analysis
- tagging
- wrapper induction
Other assigned terms:
- abbreviations
- ambiguity
- anchor
- anchors
- approach
- case
- characters
- corpora
- corpus size
- determiner
- determiners
- discourse
- discourse markers
- distribution
- document
- evaluations
- fact
- grammar
- human inspection
- hypernym
- knowledge
- linguistic
- linguistic features
- linguistic pattern
- linguistics
- mark-up
- meaning
- medical science
- named entities
- named entity
- names
- paragraph
- part of speech
- phrase
- plural noun
- polysemy
- precision
- prepositional phrase
- procedure
- process
- queries
- query
- regular expressions
- relative clause
- sentence
- sentences
- statistics
- tagging task
- tags
- technique
- text
- translations
- type ambiguity
- usability
- verb
- web page
- web pages
- word
- words
- wrapper