ACL RD-TEC 1.0 Summarization of W01-0521
Paper Title:
CORPUS VARIATION AND PARSER PERFORMANCE
CORPUS VARIATION AND PARSER PERFORMANCE
Primarily assigned technology terms:
Other assigned terms:
- approach
- argument structure
- base noun
- bigram
- brown corpus
- case
- clusters
- co-occurrences
- context-free grammar
- corpora
- corpus size
- correlation
- derivation
- distribution
- events
- fact
- feature
- framenet
- framenet project
- grammar
- head word
- hypothesis
- implementation
- interpolation
- interpolation scheme
- language models
- lexical bigram
- lexical information
- model size
- n-gram
- noun phrases
- pairs of words
- parse
- parse tree
- parser performance
- parsing model
- parsing models
- parsing task
- part of speech
- penn treebank
- precision
- preposition
- probabilities
- probability
- probability distributions
- probability estimate
- probability model
- punctuation
- semantic
- semantic categories
- sentence
- sentences
- speech tag
- statistical model
- statistics
- style
- subcategorization frames
- symbol
- syntactic category
- syntactic context
- syntactic features
- tags
- technique
- term
- test data
- test material
- test set
- text
- tokens
- training
- training and test data
- training data
- training material
- training set
- tree
- treebank
- verb
- verb argument
- vocabulary
- wall street journal corpus
- word
- word frequencies
- word pair
- words
- wsj corpora
- wsj corpus