ACL RD-TEC 1.0 Summarization of W04-1610

Paper Title:
AUTOMATIC ARABIC DOCUMENT CATEGORIZATION BASED ON THE NAÏVE BAYES ALGORITHM

Authors: Mohamed El Kourdi and Amine Bensaid and Tajje-eddine Rachidi

Other assigned terms:

  • ambiguity
  • arabic language
  • arabic morphology
  • arabic text
  • canonical form
  • case
  • classification accuracy
  • classification error
  • classification error rate
  • classification performance
  • confusion matrix
  • correlations
  • culture
  • data set
  • data sets
  • disjunction
  • document
  • document frequency
  • email
  • error rate
  • evaluation set
  • experimental setting
  • expert knowledge
  • extraction process
  • fact
  • feature
  • feature selection criterion
  • information gain
  • interpretation
  • knowledge
  • labeling
  • learning module
  • marketing
  • measure
  • method
  • minimum description length
  • natural language
  • non-concatenative language
  • paragraphs
  • posteriori probability
  • precision
  • probabilities
  • probability
  • process
  • statistic
  • statistics
  • stem
  • stems
  • technical documentation
  • technique
  • television
  • term
  • terms
  • test set
  • testing set
  • text
  • text documents
  • theorem
  • training
  • training documents
  • training set
  • transformation
  • tree
  • vocabulary
  • web documents
  • web site
  • web text
  • word
  • word morphology
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***