ACL RD-TEC 1.0 Summarization of W05-0712
Paper Title:
AN INTEGRATED APPROACH FOR ARABIC-ENGLISH NAMED ENTITY TRANSLATION
AN INTEGRATED APPROACH FOR ARABIC-ENGLISH NAMED ENTITY TRANSLATION
Authors: Hany Hassan and Jeffrey Sorensen
Primarily assigned technology terms:
- algorithm
- alignment technique
- arabic-english named entity translation
- beam search
- block translation
- boosting
- capitalization
- cross-language information retrieval
- decoder
- decoding
- dynamic programming
- editing
- entity phrase translation
- entity translation
- entity transliteration
- finite state
- finite state transducer
- grouping
- hmm viterbi training
- identification
- information retrieval
- knowledge-based system
- language processing
- machine translation
- machine translation system
- matching
- modeling
- mt system
- multi-cost alignment
- multi-cost expansion
- named entity alignment
- named entity translation
- named entity transliteration
- natural language processing
- ne alignment
- ne detection
- ne identification
- ne phrase extraction
- ne phrase translation
- ne translation
- ne transliteration
- neural network
- phase alignment
- phonetic transliteration
- phrase extraction
- phrase translation
- phrase-based translation
- processing
- scoring
- search
- semantic translation
- single phase alignment
- single reference translation
- statistical machine translation
- statistical machine translation system
- statistical mt
- system integration
- transducer
- transducers
- translation system
- transliteration
- two-phase alignment
- viterbi
- viterbi training
- weighting
- word alignment
- word translation
- word-based translation
Other assigned terms:
- alignment accuracy
- alignment models
- approach
- beam
- bilingual dictionary
- bleu
- bleu score
- case
- co-occurrence
- content words
- corpora
- dictionary
- distance measure
- edit distance
- entity type
- events
- expansion technique
- generation
- hypotheses
- hypothesis
- language model
- language models
- language pair
- language pairs
- language processing applications
- maps
- measure
- method
- monolingual corpora
- n-gram
- n-grams
- named entities
- named entity
- names
- natural language
- natural language processing applications
- organization names
- parallel corpora
- parallel corpus
- person names
- phrase
- precision
- probabilities
- probability
- procedure
- process
- pronunciation
- reference translation
- semantic
- sentence
- sentences
- source language
- source sentence
- style
- target language
- target languages
- target sentence
- target word
- technique
- temporal expressions
- term
- test set
- training
- training data
- translation models
- translation module
- translation probabilities
- translation probability
- translation problem
- translation task
- translations
- transliteration approach
- trigram
- trigram language model
- trigram model
- unigram
- unigram model
- vocabulary
- word
- words