ACL RD-TEC 1.0 Summarization of P06-1144

Paper Title:
MULTILINGUAL DOCUMENT CLUSTERING: AN HEURISTIC APPROACH BASED ON COGNATE NAMED ENTITIES

Authors: Soto Montalvo and Raquel Martínez and Arantza Casillas and Víctor Fresno

Other assigned terms:

  • adjective
  • anchor
  • approach
  • association for computational linguistics
  • bilingual dictionaries
  • bilingual dictionary
  • case
  • cluster
  • cluster similarity
  • clusters
  • comparable corpora
  • comparable corpus
  • corpora
  • customization
  • dictionaries
  • dictionary
  • document
  • document frequency
  • entity types
  • evaluation measure
  • evaluation metric
  • events
  • f-measure
  • fact
  • feature
  • grammatical categories
  • grammatical category
  • heuristic
  • knowledge
  • levenshtein edit-distance
  • levenshtein edit-distance function
  • linear combination
  • linguistic
  • linguistic resources
  • linguistics
  • mapping
  • maps
  • measure
  • measures
  • method
  • methodology
  • monolingual corpus
  • multilingual corpus
  • multilingual document
  • named entities
  • named entity
  • names
  • news corpus
  • noise
  • nouns
  • paragraph
  • parallel corpora
  • parallel corpus
  • precision
  • procedure
  • process
  • regular expressions
  • representations
  • russian
  • semantic
  • semantic similarity
  • statistic
  • statistics
  • style
  • technologies
  • technology
  • terms
  • text
  • thesaurus
  • training
  • user
  • verb
  • word
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***