ACL RD-TEC 1.0 Summarization of W04-1111
Paper Title:
A STATISTICAL MODEL FOR HANGEUL-HANJA CONVERSION IN TERMINOLOGY DOMAIN
A STATISTICAL MODEL FOR HANGEUL-HANJA CONVERSION IN TERMINOLOGY DOMAIN
Authors: Jin-Xia Huang and Sun-Mee Bae and Key-sun Choi
Primarily assigned technology terms:
- algorithm
- analyzer
- automatic evaluation
- candidate ranking
- computer science
- decision tree
- error analysis
- hangeul-hanja conversion
- hanja correspondence selection
- kana-kanji conversion
- noisy channel model
- pos tagger
- pos tagging
- pre-processing
- processing
- ranking
- recognition
- search
- similarity calculation
- similarity evaluation
- smoothing
- syntactic analyzer
- tagger
- tagging
- terminology
- tokenization
- translator
- transliteration
- viterbi
- viterbi algorithm
- word conversion
- word recognition
- word tokenization
Other assigned terms:
- affix
- approach
- automatic conversion
- bigram
- case
- case frame
- characters
- chinese characters
- chinese corpus
- chinese language
- chinese word
- co-occurrence
- coefficient
- collocation
- concept
- concept hierarchy
- conditional probability
- content words
- context information
- conversion method
- data sparseness
- data sparseness problem
- dictionaries
- dictionary
- dictionary data
- frame
- heuristic
- implementation
- interpolation
- japanese corpus
- japanese language
- kanji
- knowledge
- knowledge base
- korean language
- language model
- language resource
- language resource utilization
- language resources
- measure
- measures
- method
- modifier
- morph
- noise
- noisy channel
- nouns
- open test
- performance evaluation
- phrase
- pinyin
- pinyin input
- pos tag
- precision
- probabilities
- probability
- process
- relation
- sentence
- source sentence
- sparseness problem
- statistical approach
- statistical information
- statistical model
- suffix
- syllables
- system evaluation
- system performance
- tag restriction
- tags
- technical domain
- term
- terms
- test set
- testing data
- thesaurus
- transfer model
- transfer probability
- tree
- unigram
- user
- verb
- word
- word co-occurrence
- word dictionary
- word level
- word model
- word sequence
- words