ACL RD-TEC 1.0 Summarization of J98-2002
Paper Title:
GENERALIZING CASE FRAMES USING A THESAURUS AND THE MDL PRINCIPLE
GENERALIZING CASE FRAMES USING A THESAURUS AND THE MDL PRINCIPLE
Authors: Hang Li and Naoki Abe
Primarily assigned technology terms:
- acquisition process
- algorithm
- approximation
- cd-rom
- coding
- cognitive modeling
- collapsing
- computational linguistics
- computing
- cross-validation
- data compression
- decision trees
- disambiguation
- disambiguation method
- disambiguation problem
- dynamic programming
- dynamic programming technique
- encoding
- error-driven learning
- estimation method
- estimation process
- estimator
- extraction tool
- induction
- language processing
- learning
- learning method
- learning methods
- maximum-likelihood
- maximum-likelihood estimation
- mdl-based generalization
- model selection
- modeling
- natural language processing
- parameter estimation
- parsing
- pattern acquisition
- pp-attachment disambiguation
- processing
- programming technique
- qualitative evaluation
- recognition
- recursive algorithm
- semantic tagging
- smoothing
- statistical estimation
- structural disambiguation
- subcategorization
- supervised learning
- tagging
- terminology
- transformation-based error-driven learning
- tree-cut
- unsupervised learning
Other assigned terms:
- acyclic graph
- ambiguity
- approach
- association for computational linguistics
- association measure
- attachment site
- bias
- bracketed corpus
- case
- case frame
- class-based model
- co-occurrence
- coding scheme
- cognitive
- concept
- concepts
- conditional distribution
- conditional probability
- conditional probability distribution
- convergence
- corpora
- data set
- data sets
- data sparseness
- data sparseness problem
- distribution
- encoding scheme
- entropy
- estimation
- experimental results
- extraction problem
- fact
- frame
- generation
- heuristic
- heuristic rules
- human intervention
- hypothesis
- information theory
- interpretation
- knowledge
- language processing tasks
- large corpora
- large thesaurus
- leaf
- lexical association
- lexical semantic
- lexicon
- likelihood
- linguistic
- linguistic expressions
- linguistics
- mdl principle
- measure
- measures
- method
- methodology
- minimum description length
- model complexity
- n-gram
- n-gram models
- natural language
- natural language processing tasks
- noise
- nouns
- parsed corpus
- penn tree bank
- penn treebank
- penn treebank corpus
- phrase
- posterior
- posterior probability
- pp-attachment
- pp-attachment ambiguity
- ppattachment
- prepositional phrase
- prior probability
- priori
- probabilities
- probability
- probability distribution
- probability model
- probability value
- procedure
- process
- processing tasks
- proposition
- root node
- running time
- selectional association
- semantic
- semantic knowledge
- sentences
- slot
- sparseness problem
- standard deviation
- statistical significance
- statistics
- subgraph
- subtree
- subtrees
- taxonomy
- technique
- term
- terms
- test data
- theory
- thesaurus
- thesaurus tree
- training
- training data
- transformation
- transformation rules
- tree
- tree bank
- tree structure
- tree structures
- treebank
- treebank corpus
- trees
- uniform distribution
- uniform probability
- verb
- wall street journal corpus
- word
- word sense
- word senses
- word usage
- word-based model
- wordnet
- words