ACL RD-TEC 1.0 Summarization of N06-1020
Paper Title:
EFFECTIVE SELF-TRAINING FOR PARSING
EFFECTIVE SELF-TRAINING FOR PARSING
Authors: David McClosky and Eugene Charniak and Mark Johnson
Primarily assigned technology terms:
- algorithm
- bootstrapping
- co-training
- context-free parser
- error analysis
- error reduction
- factor analysis
- language modeling
- learner
- learning
- maxent
- maximum entropy
- maximum entropy model
- model interpolation
- modeling
- n-best parsing
- parser
- parser adaptation
- parser-reranker
- parsers
- parsing
- parsing algorithm
- pos-tagging
- regression
- regularization
- reranking
- search
- self-training
- sentence selection
- unsupervised learning
- weighting
Other assigned terms:
- baseline performance
- brown corpus
- case
- conjunct
- corpora
- data sets
- discriminative model
- distribution
- entropy
- evaluations
- events
- experimental results
- f-score
- fact
- feature
- genre
- geometric mean
- grammar
- hypothesis
- interpolation
- labeled training data
- language model
- learning problem
- method
- methodology
- n-best list
- noise
- oracle
- parse
- parse tree
- parsing accuracy
- parsing model
- parts of speech
- parts-of-speech
- partsof-speech
- penn treebank
- pp attachment
- prepositional phrases
- prepositional-phrase attachment
- prepositions
- probabilities
- probability
- probability distributions
- process
- sentence
- sentence boundaries
- sentence level
- sentences
- statistics
- syntactic information
- syntactic structure
- technique
- term
- terms
- text
- text corpus
- training
- training data
- training set
- tree
- treebank
- words