ACL RD-TEC 1.0 Summarization of W06-2902
Paper Title:
PORTING STATISTICAL PARSERS WITH DATA-DEFINED KERNELS
PORTING STATISTICAL PARSERS WITH DATA-DEFINED KERNELS
Authors: Ivan Titov and James Henderson
Primarily assigned technology terms:
- algorithm
- classifier
- classifiers
- computational linguistics
- computational natural language learning
- computing
- extractor
- joint training
- kernel
- kernels
- language learning
- language parsing
- learning
- left-corner parsing
- machine learning
- maximum entropy
- model interpolation
- natural language learning
- natural language parsing
- neural network
- one-to-one mapping
- parameterization
- parse reranking
- parser
- parser transferring
- parsers
- parsing
- perceptron
- perceptron algorithm
- probabilistic parser
- pruning
- reparameterization
- reranking
- statistical parser
- statistical parsers
- svm classifier
- validation
- voted perceptron
- voted perceptron algorithm
- weighting
- wsj parsing
Other assigned terms:
- annotated corpora
- approach
- association for computational linguistics
- baseline model
- benchmark
- brown corpus
- case
- conll-x
- corpora
- derivations
- development set
- distribution
- domain corpus
- domain knowledge
- domain vocabulary
- entropy
- estimation
- experimental results
- fact
- feature
- generative probability
- generative probability model
- history representation
- history-based model
- hypothesis
- implementation
- interpolation
- knowledge
- large corpus
- linguistics
- mapping
- meaning
- measure
- measures
- method
- model parameters
- model size
- natural language
- out-of-domain corpus
- parse
- parse tree
- parser performance
- parser portability
- parsing model
- parsing problem
- parsing strategy
- parsing task
- partial parse
- pcfg
- penn treebank
- portability
- probabilistic model
- probabilistic models
- probabilities
- probability
- probability distribution
- probability estimates
- probability model
- procedure
- sentence
- sentences
- statistics
- svms
- syntactic structure
- syntactic structures
- technique
- terms
- testing set
- text
- trained model
- training
- training data
- training phase
- training set
- training time
- tree
- treebank
- treebank wsj corpus
- trees
- vocabulary
- vocabulary size
- wall street journal corpus
- word
- words
- wsj corpus
- wsj dataset