ACL RD-TEC 1.0 Summarization of P03-2031
Paper Title:
AUTOMATIC ACQUISITION OF NAMED ENTITY TAGGED CORPUS FROM WORLD WIDE WEB
AUTOMATIC ACQUISITION OF NAMED ENTITY TAGGED CORPUS FROM WORLD WIDE WEB
Authors: Joohui An and Seungwoo Lee and Gary Geunbae Lee
Primarily assigned technology terms:
- automatic annotation
- automatic annotation process
- automatic generation
- decision list learning
- entity recognition
- entity recognition systems
- internet
- internet search
- language processing
- learning
- learning approach
- learning method
- learning methods
- learning system
- machine learning
- machine learning approach
- matching
- named entity recognition
- natural language processing
- part-of-speech tagger
- processing
- recognition
- recognition systems
- search
- search engine
- search engines
- segmentation
- sentence separation
- splitting
- supervised learning
- tagger
- web search
- web search engine
- world wide web
Other assigned terms:
- ambiguity
- annotated corpus
- annotation
- annotation process
- approach
- boundary ambiguity
- case
- category label
- co-reference
- compound noun
- context features
- context information
- contextual information
- corpus size
- data sparseness
- data sparseness problem
- dictionary
- document
- experimental results
- fact
- functional word
- generation
- generation process
- heuristics
- human intervention
- knowledge
- language resource
- large corpus
- large training
- linguistic
- linguistic information
- method
- n-gram
- named entities
- named entity
- names
- natural language
- ne corpus
- nouns
- part-of-speech
- person names
- procedure
- process
- proper noun
- queries
- robot
- segments
- sentence
- sentence boundary
- sentence level
- sentences
- size of the corpus
- sparseness problem
- substring
- tagged corpus
- test corpus
- text
- training
- training corpus
- web documents
- word
- word window
- words