ACL RD-TEC 1.0 Summarization of P06-1002
Paper Title:
GOING BEYOND AER: AN EXTENSIVE ANALYSIS OF WORD ALIGNMENTS AND THEIR IMPACT ON MT
GOING BEYOND AER: AN EXTENSIVE ANALYSIS OF WORD ALIGNMENTS AND THEIR IMPACT ON MT
Authors: Necip Fazil Ayan and Bonnie J. Dorr
Primarily assigned technology terms:
- alignment combination
- alignment evaluation
- automated evaluation
- bipartite matching
- chinese-to-english translation
- computational linguistics
- computing
- decoder
- decoding
- giza
- hidden markov
- hidden markov models
- intrinsic evaluation
- language modeling
- learning
- learning techniques
- lexical weighting
- machine translation
- matching
- maximum entropy
- maximum weighted bipartite matching
- modeling
- mt system
- mt systems
- perceptron
- perceptron learning
- phrase extraction
- phrase selection
- quantitative analysis
- ranking
- recall-oriented alignment
- scoring
- search
- smoothing
- sri language modeling
- statistical machine translation
- statistical word alignment
- supervised alignment combination
- supervised learning
- weighted bipartite matching
- weighting
- word alignment
Other assigned terms:
- aligned corpus
- alignment error rate
- approach
- association for computational linguistics
- bleu
- bleu metric
- bleu score
- bleu scores
- brevity penalty
- case
- corpora
- correlation
- data sparsity
- distribution
- entropy
- error rate
- evaluation measures
- evaluation metric
- evaluation metrics
- evaluations
- feature
- gold standard
- heuristic
- ibm models
- implementation
- language model
- language modeling toolkit
- language pair
- language pairs
- linguistics
- log-linear model
- log-linear models
- markov models
- measure
- measures
- method
- modeling toolkit
- n-grams
- nist
- pairs of words
- parallel corpora
- phrase
- phrase level
- precision
- probabilities
- process
- relation
- search space
- sentence
- technique
- terms
- test data
- test set
- text
- toolkit
- training
- training and test data
- training corpus
- training data
- translation probabilities
- translation quality
- weighting scheme
- word
- word alignments
- word level
- words