ACL RD-TEC 1.0 Summarization of W06-1105
Paper Title:
COMPARISON OF SIMILARITY MODELS FOR THE RELATION DISCOVERY TASK
COMPARISON OF SIMILARITY MODELS FOR THE RELATION DISCOVERY TASK
Primarily assigned technology terms:
- agglomerative clustering
- algorithm
- approximation
- automatic content extraction
- boundary detection
- category matching
- classification
- classifiers
- clustering
- computational linguistics
- corpus modelling
- data analysis
- decomposition
- dimensionality reduction
- disambiguation
- encoding
- entity recognition
- extrinsic evaluation
- hierarchical clustering
- identification
- idf term weighting
- information extraction
- information retrieval
- kernels
- language processing
- latent dirichlet allocation
- latent semantic analysis
- lda dimensionality reduction
- learning
- matching
- measuring
- mining
- model selection
- modeling
- modelling
- named entity recognition
- natural language processing
- nlp
- optimisation
- preprocessing
- probabilistic lsa
- processing
- recognition
- relation discovery
- relation extraction
- relation identification
- rule engineering
- sampling
- semantic analysis
- sense disambiguation
- sense disambiguation task
- sense discrimination
- sentence boundary detection
- singular value decomposition
- statistical analysis
- supervised learning
- supervised relation extraction
- term weighting
- text mining
- textual dimensionality reduction
- tokenisation
- tree kernels
- vector representation
- weighting
- word sense discrimination
Other assigned terms:
- abbreviations
- ace corpus
- annotation
- approach
- association for computational linguistics
- broadcast news
- case
- class information
- cluster
- clusters
- co-occurrence
- co-occurrence matrix
- co-occurrences
- coefficient
- cognitive
- cognitive science
- computational complexity
- context feature
- context vectors
- context words
- correlation
- cosine similarity
- cosine similarity measure
- data sets
- development set
- dimensionality
- dirichlet allocation
- disambiguation task
- discourse
- discourse relations
- distribution
- document
- entity class
- entity type
- entropy
- estimation
- events
- f-score
- f-score performance
- fact
- feature
- feature matrix
- frame
- generalisation
- gold standard
- graphical representation
- hypothesis
- identification task
- implementation
- interpretation
- joint distribution
- kl divergence
- knowledge
- kullback-leibler divergence
- labeling
- language change
- latent semantic
- lexical semantics
- linear algebra
- linguistic
- linguistic knowledge
- linguistics
- mapping
- meaning
- measure
- measures
- method
- multinomial distribution
- named entities
- named entity
- names
- natural language
- null hypothesis
- penn treebank
- posterior
- precision
- preposition
- probabilistic model
- probability
- probability distributions
- probability estimates
- procedure
- process
- pronouns
- random sample
- relation
- representations
- semantic
- semantic features
- semantic similarity
- semantic space
- sentence
- sentence boundary
- similarity matrix
- similarity measure
- similarity measures
- skew divergence
- statistic
- statistical significance
- stop word list
- synonym
- syntax
- system performance
- technique
- term
- term co-occurrence
- terms
- test set
- text
- tf \* idf
- tokens
- topics
- training
- training data
- tree
- treebank
- uniform distribution
- vector space
- word
- word co-occurrence
- word features
- word order
- word sense
- words
- world knowledge