ACL RD-TEC 1.0 Summarization of W00-0803
Paper Title:
CHINESE-JAPANESE CROSS LANGUAGE INFORMATION RETRIEVAL: A HAN CHARACTER BASED APPROACH
CHINESE-JAPANESE CROSS LANGUAGE INFORMATION RETRIEVAL: A HAN CHARACTER BASED APPROACH
Authors: Maruf Hasan and Yuji Matsumoto
Primarily assigned technology terms:
- algorithm
- ambiguity resolution
- approximation
- automatic segmentation
- character clustering
- character encoding
- character indexing
- chinese information retrieval
- clir method
- clustering
- coding
- comparative analysis
- computer processing
- digital library
- dimensionality reduction
- disambiguation
- disarnbiguation
- document indexing
- document translation
- encoding
- indexing
- indexing approach
- information processing
- information retrieval
- internet
- internet search
- language processing
- language search
- latent semantic indexing
- learning
- learning techniques
- machine learning
- machine learning techniques
- machine translation
- mapping algorithm
- monolingual information retrieval
- morphological analysis
- name detection
- natural language processing
- neural network
- phrase indexing
- preprocessing
- processing
- query expansion
- retrieving
- search
- search engine
- search engines
- segmentation
- semantic indexing
- sense disambiguation
- sense disarnbiguation
- smoothing
- table lookup
- table lookup mapping
- text indexing
- unification
- vector space model
- weighting
- word segmentation
- word sense disarnbiguation
Other assigned terms:
- ambiguity
- approach
- asian language
- bilingual text
- blank space
- case
- characters
- chinese language
- chinese text
- co-occurrences
- cosine similarity
- data sparseness
- data sparseness problem
- dictionaries
- dictionary
- dimensionality
- document
- document collection
- document vectors
- electronic information
- encoding scheme
- entropy
- experimental results
- fact
- foreign words
- french
- index
- japanese language
- japanese text
- kanji
- katakana
- knowledge
- language information
- latent semantic
- linguistic
- linguistic knowledge
- mapping
- mapping table
- meaning
- method
- monolingual dictionary
- morphemes
- n-gram
- n-grams
- names
- natural language
- ordered list
- paragraph
- paragraphs
- phrase
- phrase level
- process
- pronunciation
- proper name
- proper names
- queries
- query
- query vector
- relation
- semantic
- semantic information
- semantic relation
- sentences
- sparseness problem
- syntactic information
- technique
- technology
- term
- terms
- text
- text collection
- thesaurus
- translations
- understanding
- vector space
- weighting scheme
- word
- word sense
- words
- written texts