ACL RD-TEC 1.0 Summarization of I05-5002
Paper Title:
AUTOMATICALLY CONSTRUCTING A CORPUS OF SENTENTIAL PARAPHRASES
AUTOMATICALLY CONSTRUCTING A CORPUS OF SENTENTIAL PARAPHRASES
Authors: William B. Dolan and Chris Brockett
Primarily assigned technology terms:
- algorithm
- author identification
- classifier
- classifiers
- clustering
- corpus extraction
- corpus selection
- document clustering
- identification
- learning
- learning algorithm
- machine translation
- machine translation systems
- paraphrase identification
- paraphrase recognition
- rating
- recognition
- search
- statistical learning
- summarization
- tagging
- translation systems
- validation
Other assigned terms:
- alignment error rate
- anaphora
- anaphors
- authorship
- clusters
- corpus coverage and quality
- data set
- document
- error rate
- heuristic
- heuristics
- information content
- inter-rater agreement
- kappa
- knowledge
- large corpus
- lexical content
- meanings
- methodology
- paraphrase
- paraphrase corpus
- paraphrases
- positive and negative examples
- precision
- search space
- seed
- sentence
- sentences
- stem
- tagging task
- technique
- technology
- text
- training
- training corpus
- translations
- words