ACL RD-TEC 1.0 Summarization of W03-1812
Paper Title:
AN EMPIRICAL MODEL OF MULTIWORD EXPRESSION DECOMPOSABILITY
AN EMPIRICAL MODEL OF MULTIWORD EXPRESSION DECOMPOSABILITY
Authors: Timothy Baldwin and Colin Bannard and Takaaki Tanaka and Dominic Widdows
Primarily assigned technology terms:
- blocking
- chunker
- classification
- correlation analysis
- database
- decomposition
- disambiguation
- indexing
- information retrieval
- latent semantic analysis
- linear regression
- machine translation
- measuring
- modelling
- mwe detection
- mwe extraction
- parsing
- partitioning
- pos tagger
- predictor
- ranking
- regression
- regression test
- relative distance
- rescoring
- searching
- semantic analysis
- similarity method
- singular-value decomposition
- statistical analysis
- statistical machine translation
- tagger
- tagging
- vector space model
- voting
- weighted voting
- word-sense disambiguation
Other assigned terms:
- annotator
- approach
- array
- bigram
- british national corpus
- brown corpus
- case
- chunk
- collocation
- composition
- compositionality
- compound nominal
- compounds
- concept
- concepts
- content words
- corpora
- corpus frequency
- correlation
- data sparseness
- detection task
- determiners
- dictionaries
- dictionary
- distribution
- document
- fact
- generalisation
- generation
- grammar
- head noun
- hierarchical lexicon
- hierarchical structure
- hyponym
- hyponyms
- hyponymy
- hyponymy relation
- hypothesis
- idiom
- implementation
- inflection
- information content
- inheritance
- interpretation
- latent semantic
- lexemes
- lexical hierarchy
- lexical relations
- lexicon
- likelihood
- linguistic
- linguistic constraints
- logical form
- meaning
- measure
- measures
- method
- modifier
- monolingual corpora
- multiple inheritance
- multiword expressions
- mutual information
- mwes
- nn compound
- noun phrase
- nouns
- part-of-speech
- part-of-speech information
- particle
- particles
- parts-ofspeech
- phrase
- polysemy
- precision
- prepositions
- process
- query
- relation
- semantic
- semantic content
- semantic distance
- semantic similarity
- semcor
- sentence
- sentential context
- similarity between words
- similarity measure
- similarity measures
- statistics
- stems
- synonyms
- synonymy
- synsets
- syntactic composition
- syntactic variation
- syntax
- tags
- technique
- technologies
- terms
- text
- thesaurus
- tokens
- topology
- training
- training data
- transitivity
- translations
- vector space
- verb
- verbal inflection
- word
- word boundaries
- word pair
- word senses
- word-net
- wordnet
- wordnet class
- wordnet hierarchy
- wordnet similarity
- words
- wsj corpus