ACL RD-TEC 1.0 Summarization of W06-1710

Paper Title:
WEB CORPUS MINING BY INSTANCE OF WIKIPEDIA

Authors: RĂ¼diger Gleim and Alexander Mehler and Matthias Dehmer

Other assigned terms:

  • ambiguity
  • approach
  • argumentation
  • bag of words
  • case
  • categorization task
  • cluster
  • clusters
  • collocation
  • comparative study
  • computational complexity
  • corpora
  • distance matrix
  • document
  • document object model
  • document structure
  • edit distance
  • f-measure
  • fact
  • feature
  • feature vectors
  • genre
  • human reader
  • hypothesis
  • implementation
  • information gain
  • interpretation
  • large corpora
  • lexical content
  • linguistic
  • linguistics
  • mapping
  • maps
  • markup
  • measure
  • measures
  • method
  • polymorphism
  • probability
  • random order
  • relation
  • representations
  • segments
  • signal
  • support vector
  • svms
  • tags
  • test corpus
  • test set
  • text
  • text structure
  • textual unit
  • textual units
  • tokens
  • topics
  • training
  • training examples
  • training set
  • tree
  • tree node
  • trees
  • web content
  • web corpus
  • web documents
  • web page
  • web pages
  • webgenre
  • wikipedia
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***