ACL RD-TEC 1.0 Summarization of W05-0609
Paper Title:
DISCRIMINATIVE TRAINING OF CLUSTERING FUNCTIONS: THEORY AND EXPERIMENTS WITH ENTITY IDENTIFICATION
DISCRIMINATIVE TRAINING OF CLUSTERING FUNCTIONS: THEORY AND EXPERIMENTS WITH ENTITY IDENTIFICATION
Authors: Xin Li and Dan Roth
Primarily assigned technology terms:
- algorithm
- classification
- classifier
- classifiers
- clustering
- clustering algorithm
- comparative evaluation
- computational linguistics
- computational natural language learning
- coreference resolution
- cross-validation
- data collection
- databases
- discriminative clustering
- discriminative training
- distance function
- distributional clustering
- domain question answering
- entity identification
- entity identifier
- entity tagger
- error reduction
- identification
- k-means
- k-means clustering
- kernels
- language learning
- language processing
- learner
- learning
- learning algorithm
- learning procedure
- learning process
- learning task
- matching
- measuring
- metric learning
- multi-class classification
- named entity tagger
- natural language learning
- natural language processing
- nlp
- normalization
- one clustering
- optimization
- optimization procedure
- pairwise classification
- perceptron
- processing
- question answering
- question answering system
- record linkage
- search
- semantic abstraction
- softtfidf
- string comparison
- supervised clustering
- supervised training
- tagger
- thresholding
- training procedure
- training process
- two-fold cross-validation
- unsupervised clustering
- unsupervised method
- unsupervised training
- weighting
Other assigned terms:
- annotation
- appearance similarity
- approach
- association for computational linguistics
- case
- cluster
- clusters
- coefficient
- concept
- convergence
- corpora
- data set
- data sets
- distance metric
- distributional similarity
- document
- domain knowledge
- entity type
- entity types
- euclidean distance
- experimental results
- fact
- feature
- feature set
- feature space
- formalization
- hypothesis
- hypothesis space
- identification task
- index
- information sources
- intention
- jensen-shannon divergence
- knowledge
- kullback-leibler divergence
- language models
- learning rate
- likelihood
- linguistics
- measure
- measures
- method
- named entity
- names
- natural language
- neighbor graph
- nlp tasks
- noise
- open domain
- optimization problem
- personal names
- procedure
- process
- semantic
- sentence
- sentences
- similarity between words
- similarity metric
- similarity metrics
- statistics
- test set
- text
- text documents
- theory
- tokens
- training
- training data
- training examples
- training set
- transitivity
- trec corpus
- understanding
- weighting scheme
- words