ACL RD-TEC 1.0 Summarization of W06-1104
Paper Title:
AUTOMATICALLY CREATING DATASETS FOR MEASURES OF SEMANTIC RELATEDNESS
AUTOMATICALLY CREATING DATASETS FOR MEASURES OF SEMANTIC RELATEDNESS
Authors: Torsten Zesch and Iryna Gurevych
Primarily assigned technology terms:
- automatic extraction
- classification
- computational linguistics
- computer science
- computing
- corpus-based approach
- disambiguation
- german information retrieval
- indexing
- information retrieval
- information retrieval systems
- information retrieval task
- lemmatization
- lexical chaining
- measuring
- pos-tagging
- post-processing
- preprocessing
- rating
- retrieval system
- retrieval systems
- search
- selection process
- sense disambiguation
- terminology
- tf.idf-weighting
- tokenization
- weighting
- word sense disambiguation
Other assigned terms:
- annotators
- approach
- association for computational linguistics
- bias
- break
- case
- coefficient
- cohesion
- concept
- concepts
- corpora
- correlation
- correlation coefficient
- dictionaries
- dictionary
- distribution
- domain-specific corpora
- domain-specific vocabulary
- english corpus
- extraction process
- fact
- foreign words
- germanet
- gold standard
- human annotators
- human judgments
- hypernymy
- inter-subject correlation
- intra-subject correlation
- knowledge
- language pairs
- large corpora
- lexical chain
- lexical cohesion
- linguistic
- linguistics
- meaning
- meanings
- measure
- measures
- names
- nouns
- ontologies
- paragraph
- parts-of-speech
- polysemous word
- polysemous words
- procedure
- process
- psycholinguistics
- relation
- retrieval task
- semantic
- semantic information
- semantic relatedness
- semantic relations
- semantic similarity
- sense inventory
- similarity measures
- social science
- standard deviation
- synonyms
- synonymy
- system architecture
- technical terminology
- technical terms
- terms
- test set
- text
- topics
- training
- user
- verb
- vocabulary
- wikipedia
- word
- word classes
- word level
- word sense
- word senses
- wordnet
- words