ACL RD-TEC 1.0 Summarization of H05-1061
Paper Title:
MINING KEY PHRASE TRANSLATIONS FROM WEB CORPORA
MINING KEY PHRASE TRANSLATIONS FROM WEB CORPORA
Authors: Fei Huang and Ying Zhang and Stephan Vogel
Primarily assigned technology terms:
- algorithm
- candidate selection
- computational linguistics
- distance calculation
- dynamic programming
- feature combination
- human language
- human language technology
- key phrase extraction
- language processing
- language technology
- machine translation
- machine translation system
- machine transliteration
- matching
- matching algorithm
- mining
- mt engine
- natural language processing
- ne translation
- normalization
- phrase extraction
- phrase translation
- preprocessing
- processing
- query expansion
- query expansion and translation
- query translation
- retrieving
- romanization
- search
- search engine
- searching
- statistical machine translation
- statistical machine translation system
- string alignment
- string matching
- translation extraction
- translation system
- translator
- transliteration
- word translation
Other assigned terms:
- alignment accuracy
- alignment model
- alignment probability
- approach
- association for computational linguistics
- bilingual phrase
- case
- characters
- chinese characters
- chinese words
- chinese-english lexicon
- confidence measure
- confidence scores
- corpora
- correlation
- english translation
- english translation candidate
- english translations
- english web
- fact
- feature
- feature set
- hypotheses
- hypothesis
- ibm model
- ibm models
- key phrase
- language pairs
- language processing applications
- lexical translation
- lexicon
- likelihood
- linear combination
- linguistic
- linguistic structures
- linguistics
- maps
- meaning
- measure
- measures
- method
- mixed-language web page
- monolingual corpora
- names
- natural language
- natural language processing applications
- noun phrases
- part-of-speech
- phoneme
- phonetic similarity
- phrase
- phrase translation accuracy
- pinyin
- probabilities
- probability
- pronunciation
- punctuation
- punctuation marks
- queries
- query
- reference translations
- search space
- search strategy
- semantic
- semantic features
- semantic similarity
- target string
- target word
- technology
- terms
- test set
- translation accuracy
- translation candidate
- translation candidates
- translation model
- translation models
- translation pairs
- translation probabilities
- translation probability
- translation quality
- translations
- transliteration model
- web page
- web pages
- word
- words