ACL RD-TEC 1.0 Summarization of W04-2402
Paper Title:
SEMANTIC LEXICON CONSTRUCTION: LEARNING FROM UNLABELED DATA VIA SPECTRAL ANALYSIS
SEMANTIC LEXICON CONSTRUCTION: LEARNING FROM UNLABELED DATA VIA SPECTRAL ANALYSIS
Primarily assigned technology terms:
- algorithm
- approximation
- automatic content extraction
- bayes classifier
- bootstrap
- bootstrapping
- bootstrapping method
- bracketing
- categorization
- centroid-based classifier
- chunker
- classification
- classifier
- classifiers
- co-training
- computing
- decomposition
- dimensionality reduction
- disambiguation
- disambiguation method
- entity detection
- entity recognizers
- expectation maximization
- extraction systems
- extractor
- factor analysis
- feature selection
- global optimization
- indexing
- information extraction
- information extraction systems
- inner product
- iterative algorithm
- iterative process
- latent semantic indexing
- learning
- learning process
- learning techniques
- lemmatizer
- lexicon construction
- list construction
- local maximization
- local optimization
- machine learning
- machine learning techniques
- maximum likelihood
- measuring
- model parameter estimation
- naive bayes
- naive bayes classifier
- naive bayes classifiers
- nlp
- noun phrase bracketing
- optimization
- parameter estimation
- phrase bracketing
- principal component analysis
- semantic indexing
- semantic lexicon construction
- semi-automatic construction
- sense disambiguation
- singular value decomposition
- spectral analysis
- supervised classification
- supervised learning
- text categorization
- tf-idf weighting
- transductive learning
- vector dimensionality reduction
- vector representation
- weighting
- word classification
- word selection
- word sense disambiguation
- word similarity measurement
Other assigned terms:
- annotation
- base noun
- base noun phrase
- case
- categorization task
- class information
- class membership
- classification problem
- classification task
- conditional independence
- corpora
- data sets
- data sparseness
- dimensionality
- distribution
- document
- entity class
- estimation
- events
- experimental results
- f-measure
- fact
- feature
- feature information
- feature space
- feature vector
- feature vectors
- gazetteer
- generation
- head word
- implementation
- independence assumption
- intelligence
- interpretation
- knowledge
- labeled training data
- labeling
- latent semantic
- lexical items
- lexicon
- likelihood
- linguistic
- local maximum
- measure
- method
- model parameter
- model parameters
- named entity
- names
- nlp tasks
- noun phrase
- nouns
- parameter settings
- phrase
- precision
- preposition
- probabilities
- probability
- probability distribution
- procedure
- process
- projection
- pronouns
- real-world knowledge
- research and development
- scalability
- seed
- seed words
- semantic
- semantic lexicon
- similarity measure
- statistics
- subjectverb
- synonym
- syntactic constructions
- syntactic features
- task performance
- term-document matrix
- terms
- test corpora
- test data
- text
- training
- training data
- unannotated corpus
- unlabeled examples
- web documents
- word
- word pair
- word sense
- word similarity
- word similarity measure
- words