ACL RD-TEC 1.0 Summarization of P03-1041
Paper Title:
EFFECTIVE PHRASE TRANSLATION EXTRACTION FROM ALIGNMENT MODELS
EFFECTIVE PHRASE TRANSLATION EXTRACTION FROM ALIGNMENT MODELS
Authors: Ashish Venugopal and Stephan Vogel and Alex Waibel
Primarily assigned technology terms:
- approximation
- clustering
- decoder
- decoding
- density estimation
- extraction method
- extraction technique
- giza
- harvesting
- hmm method
- hypothesis translation
- ibm translation
- identification
- link clustering
- machine translation
- maximum approximation
- maximum likelihood
- morphology
- normalization
- partitioning
- phrasal extraction
- phrasal lexicon
- phrasal translation
- phrase extraction
- phrase translation
- preprocessing
- pruning
- scoring
- scoring method
- segmentation
- sentence alignment
- sentence scoring
- splitting
- tagging
- translation extraction
- translation process
- validation
- viterbi
- weighting
- word alignment
Other assigned terms:
- aligned parallel corpus
- alignment model
- alignment models
- approach
- bayesian framework
- bilingual dictionary
- bleu
- bleu metric
- bleu score
- candidate translation
- chinese word
- conditional probability
- data consortium
- dictionary
- distribution
- english sentence
- estimation
- extraction process
- function words
- generation
- generation task
- gold standard
- hypotheses
- hypothesis
- ibm model
- ibm models
- index
- joint probability
- knowledge
- language model
- language pair
- lexica
- lexicon
- likelihood
- linguistics
- linguistics data
- measure
- measures
- method
- n-gram
- n-grams
- nist
- noise
- normal distribution
- parallel corpus
- perplexity
- phrase
- phrase boundary
- phrase level
- precision
- probabilities
- probability
- probability estimate
- probability estimates
- probability score
- process
- reference translations
- reordering
- scoring metric
- sentence
- sentence length model
- sentence level
- sentence pair
- sentences
- source language
- source sentence
- statistic
- statistical significance
- target language
- target sentence
- target word
- technique
- test set
- text
- toolkit
- training
- training corpus
- translation hypothesis
- translation lexicon
- translation model
- translation models
- translation pair
- translation pairs
- translation probabilities
- translation probability
- translation quality
- translations
- unigram
- viterbi path
- word
- word alignment model
- word level
- words