ACL RD-TEC 1.0 Summarization of C04-1088
Paper Title:
LINGUISTIC CORRELATES OF STYLE: AUTHORSHIP CLASSIFICATION WITH DEEP LINGUISTIC ANALYSIS FEATURES
LINGUISTIC CORRELATES OF STYLE: AUTHORSHIP CLASSIFICATION WITH DEEP LINGUISTIC ANALYSIS FEATURES
Primarily assigned technology terms:
- 5-fold cross validation
- abstracting
- algorithm
- authorship identification
- authorship verification
- automatic language analysis
- categorization
- classification
- classifier
- classifier algorithm
- classifiers
- cross validation
- cross-validation
- cutoff
- error reduction
- feature extraction
- feature selection
- identification
- intelligent thresholding
- kernel
- language analysis
- language analysis system
- learning
- learning algorithm
- learning system
- learning technique
- learning techniques
- linguistic analysis
- linguistic processing
- machine learning
- machine learning algorithm
- machine learning techniques
- newspaper style detection
- parser
- processing
- semantic analysis
- style assessment
- style classification
- style detection
- subcategorization
- support vector machines
- text categorization
- thresholding
- training process
- validation
- voting
- weighted voting
Other assigned terms:
- 10-fold cross-validation
- approach
- authorship
- authorship attribution
- bigram
- case
- characters
- classification accuracy
- classification task
- classification tasks
- context-free grammar
- correlation
- correlations
- dependency graphs
- document
- error rate
- events
- fact
- feature
- feature sets
- feature types
- feature vector
- feature vectors
- function word
- genre
- grammar
- hapax legomena
- lexeme
- likelihood
- likelihood ratio
- linguistic
- linguistic expression
- linguistic feature
- linguistic features
- linguistic structure
- measure
- measures
- methodology
- n-gram
- natural language
- ngram
- nouns
- parse
- part of speech
- part-of-speech
- personal pronouns
- precision
- process
- pronouns
- semantic
- semantic feature
- semantic features
- semantic graph
- semantic information
- semantic relationship
- sentence
- sentences
- statistics
- style
- subordinate clauses
- support vector
- svms
- syntactic features
- syntactic patterns
- tags
- technical domain
- technique
- text
- tokens
- training
- training set
- trigram
- verb
- word
- word frequencies
- word frequency
- words