ACL RD-TEC 1.0 Summarization of W00-1203
Paper Title:
KNOWLEDGE EXTRACTION FOR IDENTIFICATION OF CHINESE ORGANIZATION NAMES
KNOWLEDGE EXTRACTION FOR IDENTIFICATION OF CHINESE ORGANIZATION NAMES
Authors: Keh-Jiann Chen and Chao-jan Chert
Primarily assigned technology terms:
- algorithm
- ambiguity resolution
- analyzer
- automatic extraction
- automatic knowledge extraction
- automatic learning
- capitalization
- categorization
- chinese morphology
- disambiguation
- disambiguation process
- extraction algorithm
- extraction technique
- identification
- identification system
- information extraction
- keyword extraction
- knowledge extraction
- language processing
- learning
- matching
- morphological analysis
- morphological analyzer
- morphology
- natural language processing
- personal computer
- processing
- reading
- segmentation
- segmentation process
- sentence processing
- statistical methods
- unknown word identification
- web spider
- word identification
- word segmentation
Other assigned terms:
- abbreviation
- abbreviations
- ambiguity
- approach
- characters
- chinese text
- ckip dictionary
- composition
- compounding
- compounds
- concepts
- context information
- contextual information
- corpora
- dictionaries
- dictionary
- domain corpus
- extraction process
- fact
- heuristic
- heuristic rules
- implementation
- key words
- keyword
- knowledge
- lexicon
- linguistic
- linguistic knowledge
- linguistics
- location name
- matching process
- meaning
- meanings
- method
- morpheme
- morphemes
- morphological knowledge
- morphological structure
- names
- natural language
- news corpus
- nouns
- organization names
- personal names
- precision
- process
- processing model
- proper name
- proper names
- research topic
- schema
- segments
- semantic
- semantic categories
- semantic category
- semantic composition
- semantic relations
- semantic types
- sentence
- syntactic categories
- technique
- testing corpus
- text
- text corpora
- training
- training corpus
- training set
- word
- word boundaries
- word strings
- words