ACL RD-TEC 1.0 Summarization of P99-1032
Paper Title:
DEVELOPMENT AND USE OF A GOLD-STANDARD DATA SET FOR SUBJECTIVITY CLASSIFICATIONS
DEVELOPMENT AND USE OF A GOLD-STANDARD DATA SET FOR SUBJECTIVITY CLASSIFICATIONS
Authors: Janyce M. Wiebe and Rebecca F. Bruce and Thomas P. O'Hara
Primarily assigned technology terms:
- algorithm
- broadcasting
- categorization
- classification
- classifier
- classifiers
- coding
- computer science
- cross validation
- cutoff
- discourse annotation
- discourse processing
- discourse tagging
- em algorithm
- feature representation
- feature selection
- information extraction
- information retrieval
- learning
- learning algorithms
- likelihood estimate
- machine learning
- machine learning algorithms
- machine translation
- maximum likelihood
- model selection
- naive bayes
- nlp
- parameter estimation
- processing
- rating
- reporting
- search
- search engine
- segmentation
- statistical techniques
- summarization
- tagging
- text categorization
- text processing
- validation
Other assigned terms:
- 10-fold cross validation
- adjective
- adverb
- annotated corpus
- annotation
- annotation process
- approach
- bias
- binary feature
- case
- clusters
- co-occurrence
- compound sentence
- conditional independence
- conjunct
- corpora
- correlations
- data set
- dialog
- discourse
- document
- estimation
- evaluations
- events
- fact
- feature
- generation
- human performance
- intention
- intercoder agreement
- intercoder reliability
- kappa
- kappa value
- knowledge
- latent class
- likelihood
- likelihood ratio
- linguistic
- linguistic theory
- linguistics
- maximum likelihood estimate
- method
- model fit
- opinion
- opinion category
- opinions
- paragraph
- polarity
- probability
- probability model
- procedure
- process
- pronoun
- punctuation
- punctuation marks
- segments
- semantic
- semantic classes
- sentence
- sentences
- statistic
- subjectivity
- switchboard-damsl
- tagging task
- tags
- term
- terms
- test set
- text
- theory
- tokens
- training
- training data
- treebank
- treebank corpus
- user
- word