ACL RD-TEC 1.0 Summarization of P06-2058

Paper Title:
OBFUSCATING DOCUMENT STYLOMETRY TO PRESERVE AUTHOR ANONYMITY

Authors: Gary Kacmarcik and Michael Gamon

Other assigned terms:

  • approach
  • association for computational linguistics
  • author attribution
  • authorship
  • authorship attribution
  • baseline model
  • bias
  • case
  • classification accuracy
  • corpora
  • data set
  • document
  • document feature
  • document sets
  • email
  • fact
  • feature
  • feature set
  • feature sets
  • feature value
  • feature vector
  • feature vectors
  • french
  • function word
  • function words
  • genre
  • grammar
  • hypothesis
  • implementation
  • language usage
  • large corpus
  • likelihood
  • linguistic
  • linguistic expressions
  • linguistics
  • measure
  • measures
  • message
  • method
  • ordered list
  • paragraph
  • paraphrase
  • probabilities
  • probability
  • process
  • punctuation
  • punctuation marks
  • rewrite rules
  • root node
  • sentence
  • style
  • svms
  • tags
  • technique
  • term
  • term frequency
  • terms
  • test data
  • test data set
  • test set
  • text
  • tokens
  • toolkit
  • training
  • training corpus
  • training set
  • tree
  • trees
  • word
  • word frequencies
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***