ACL RD-TEC 1.0 Summarization of P98-1069
Paper Title:
AN IR APPROACH FOR TRANSLATING NEW WORDS FROM NONPARALLEL, COMPARABLE TEXTS
AN IR APPROACH FOR TRANSLATING NEW WORDS FROM NONPARALLEL, COMPARABLE TEXTS
Authors: Pascale Fung and Lo Yuen Yee
Primarily assigned technology terms:
- algorithm
- bilingual lexicon compilation
- chinese segmentation
- classification
- data collection
- disambiguation
- document analysis
- document classification
- document comparison
- extraction tool
- indexing
- information retrieval
- keyword weighting
- language processing
- language translation
- lexicon compilation
- machine translation
- machine translation system
- mt system
- natural language processing
- nlp
- online search
- pos tagging
- processing
- ranking
- ranking algorithm
- reasoning
- search
- search engine
- search engines
- segmentation
- sense disambiguation
- statistical methods
- statistical terminology translation
- tagging
- terminology
- terminology translation
- tf\/idf
- translation system
- vector space model
- weighting
- word extraction
- word translation
- world wide web
Other assigned terms:
- ambiguity
- approach
- bilingual corpus
- bilingual lexicon
- case
- chinese text
- chinese words
- coefficient
- collocate
- collocation
- community
- comparable corpus
- content words
- context words
- corpora
- cosine measure
- dice
- dice coefficient
- document
- document frequency
- document text
- electronic form
- english text
- english translation
- fact
- forest
- french
- function word
- inverse document frequency
- keyword
- language model
- language pairs
- large corpora
- lexicon
- linguistic
- meaning
- measure
- measures
- method
- names
- natural language
- parallel corpora
- parallel texts
- part-of-speech
- part-of-speech set
- precision
- proper names
- query
- query term
- seed
- seed words
- segments
- sentences
- similarity measure
- similarity measures
- similarity scores
- source language
- source language word
- statistical models
- synonyms
- target language
- term
- term frequency
- terms
- text
- translations
- vector space
- word
- word boundaries
- word frequencies
- word meaning
- word order
- word pair
- word vector
- words