ACL RD-TEC 1.0 Summarization of W05-0908
Paper Title:
ON SOME PITFALLS IN AUTOMATIC EVALUATION AND SIGNIFICANCE TESTING FOR MT
ON SOME PITFALLS IN AUTOMATIC EVALUATION AND SIGNIFICANCE TESTING FOR MT
Authors: Stefan Riezler and John T. Maxwell
Primarily assigned technology terms:
- algorithm
- approximate randomization test
- automatic evaluation
- bootstrap
- bootstrap sampling
- computational linguistics
- decoding
- dependency-based parsing
- discriminative reranking
- estimator
- extrinsic evaluation
- feature selection
- incremental feature selection
- intrinsic evaluation
- language modeling
- language processing
- machine translation
- matching
- maximum-entropy
- measuring
- modeling
- natural language processing
- optimization
- parsing
- processing
- randomization
- randomization test
- randomization testing
- regularization
- reporting
- reranking
- sampling
- scoring
- selection technique
- significance testing
- smt system
- statistical hypothesis
- statistical machine translation
- statistical significance testing
- summarization
- word aligner
- word alignment
Other assigned terms:
- approximate randomization
- argumentation
- association for computational linguistics
- baseline model
- benchmark
- bigram
- bleu
- bleu score
- case
- coefficient
- dependency relations
- development set
- distribution
- error rate
- estimation
- evaluation measure
- evaluation measures
- evaluation metric
- evaluation metrics
- evaluation task
- evaluations
- extrinsic evaluation measures
- f-score
- fact
- feature
- feature sets
- grammatical relations
- hypothesis
- hypothesis test
- inferences
- knowledge
- language model
- lexical choice
- likelihood
- linguistics
- log-likelihood
- log-linear model
- meaning
- measure
- measures
- method
- mt evaluation
- n-gram
- n-grams
- natural language
- nist
- null hypothesis
- optimization criterion
- order variation
- parallel corpus
- parameter values
- parse
- phrase
- phrase-based system
- precision
- probability
- procedure
- reference translation
- reference translations
- relation
- semantic
- sentence
- sentences
- similarity measures
- statistic
- statistical significance
- statistics
- structural information
- system development
- technique
- technologies
- term
- test corpus
- test data
- test set
- textbook
- training
- training and test data
- training data
- training set
- translation quality
- translational adequacy
- translations
- trigram
- word
- word order
- word order variation
- words