ACL RD-TEC 1.0 Summarization of W06-1654
Paper Title:
RANDOM INDEXING USING STATISTICAL WEIGHT FUNCTIONS
RANDOM INDEXING USING STATISTICAL WEIGHT FUNCTIONS
Authors: James Gorman and James R. Curran
Primarily assigned technology terms:
- algorithm
- approximation
- computational linguistics
- context extraction
- cutoff
- decomposition
- dimensionality reduction
- dimensionality reduction technique
- extraction technique
- extractor
- frequency weighting
- incremental learning
- incremental sampling
- indexing
- language processing
- latent semantic analysis
- learning
- lemmatisation
- lexicon acquisition
- matching
- measuring
- natural language processing
- pos tagging
- processing
- random indexing
- random projection
- ranking
- sampling
- search
- semantic analysis
- singular value decomposition
- smoothing
- tagging
- thesaurus extraction
- vector comparison
- vector space model
- weighting
Other assigned terms:
- acquisition task
- approach
- association for computational linguistics
- beam
- bilingual corpora
- bilingual lexicon
- bilingual lexicons
- british national corpus
- co-occurrence
- co-occurrence matrix
- concreteness
- context information
- context vector
- context vectors
- corpora
- corpus size
- cosine measure
- data set
- data sets
- dice
- dictionary
- dimensionality
- distance measure
- distribution
- distributional similarity
- document
- europarl corpora
- evaluation data
- evaluation measures
- foreign language
- frequency cut-off
- frequency distribution
- gold standard
- grammatical relation
- grammatical relations
- index
- large corpus
- latent semantic
- lemma
- lexical resources
- lexicon
- linguistics
- mapping
- meaning
- meanings
- measure
- measures
- method
- multi-word expression
- natural language
- nouns
- paragraph
- paragraphs
- precision
- probability
- projection
- relation
- semantic
- semantic similarity
- sentence
- similarity scores
- size of the corpus
- source language
- suffix
- synonym
- synonyms
- synonymy
- target language
- target languages
- technique
- term
- terms
- text
- thesaurus
- token frequency
- training
- translation candidate
- translations
- vector space
- vocabulary
- vocabulary size
- weighting scheme
- window-based context
- word
- word corpus
- wordnet
- words