ACL RD-TEC 1.0 Summarization of W03-1118
Paper Title:
TEXT CATEGORIZATION USING AUTOMATICALLY ACQUIRED DOMAIN ONTOLOGY
TEXT CATEGORIZATION USING AUTOMATICALLY ACQUIRED DOMAIN ONTOLOGY
Authors: Shih-Hung Wu and Tzong-Han Tsai and Wen-Lian Hsu
Primarily assigned technology terms:
- acquisition process
- algorithm
- automatic method
- categorization
- chinese news categorization
- classifier
- database
- decision tree
- decision tree algorithm
- domain identification
- editing
- identification
- inference engine
- infomap ontology
- information retrieval
- iterative process
- keyword identification
- knowledge discovery
- knowledge management
- knowledge management system
- knowledge representation
- language understanding
- natural language understanding
- news categorization
- nlp
- ontology acquisition
- ontology-based approach
- qa system
- question answering
- segmentation
- statistical methods
- text categorization
- tf\/idf
- threshold selection
- training process
- tree algorithm
- weighting
- word segmentation
Other assigned terms:
- adjective
- ambiguity
- approach
- characters
- chinese characters
- chinese corpus
- chinese sentence
- cluster
- co-occurrence
- concept
- concepts
- corpora
- correlation
- dictionary
- distribution
- document
- document frequency
- domain corpus
- domain knowledge
- domain ontology
- event structure
- events
- experimental results
- f-score
- feature
- frame
- hyponym
- hyponyms
- index
- inverse document frequency
- keyword
- knowledge
- latent semantic
- lexicon
- linguistic
- measure
- method
- morphological rules
- n-gram
- n-grams
- natural language
- natural language sentence
- nlp applications
- noise
- nouns
- ontologies
- ontology
- part-of-speech
- phrase
- phrase structure
- precision
- probability
- process
- representation framework
- representations
- root node
- seed
- semantic
- semantic index
- semantic relationship
- sentence
- sentences
- similarity score
- structure of a sentence
- synonym
- syntactic constraints
- taxonomy
- term
- term frequency
- testing set
- text
- topics
- training
- training corpus
- training data
- training set
- tree
- understanding
- verb
- word
- word boundary
- words