ACL RD-TEC 1.0 Summarization of P06-2058
Paper Title:
OBFUSCATING DOCUMENT STYLOMETRY TO PRESERVE AUTHOR ANONYMITY
OBFUSCATING DOCUMENT STYLOMETRY TO PRESERVE AUTHOR ANONYMITY
Authors: Gary Kacmarcik and Michael Gamon
Primarily assigned technology terms:
- algorithm
- authorship detection
- authorship identification
- authorship obfuscation
- automated authorship attribution
- capitalization
- categorization
- classification
- classifier
- classifiers
- computational linguistics
- computing
- cross-validation
- cryptography
- decision tree
- decision trees
- detection method
- evaluation process
- feature modification
- feature selection
- feature selection process
- grammar checking
- identification
- learning
- learning algorithm
- machine learning
- machine learning algorithm
- machine translation
- morphological analysis
- morphology
- optimization
- parser
- processing
- ranking
- ranking algorithm
- selection process
- spelling
- text categorization
- thresholding
- tokenization
- word processing
Other assigned terms:
- approach
- association for computational linguistics
- author attribution
- authorship
- authorship attribution
- baseline model
- bias
- case
- classification accuracy
- corpora
- data set
- document
- document feature
- document sets
- fact
- feature
- feature set
- feature sets
- feature value
- feature vector
- feature vectors
- french
- function word
- function words
- genre
- grammar
- hypothesis
- implementation
- language usage
- large corpus
- likelihood
- linguistic
- linguistic expressions
- linguistics
- measure
- measures
- message
- method
- ordered list
- paragraph
- paraphrase
- probabilities
- probability
- process
- punctuation
- punctuation marks
- rewrite rules
- root node
- sentence
- style
- svms
- tags
- technique
- term
- term frequency
- terms
- test data
- test data set
- test set
- text
- tokens
- toolkit
- training
- training corpus
- training set
- tree
- trees
- word
- word frequencies
- words