ACL RD-TEC 1.0 Summarization of W02-2001
Paper Title:
EXTRACTING THE UNEXTRACTABLE: A CASE STUDY ON VERB-PARTICLES
EXTRACTING THE UNEXTRACTABLE: A CASE STUDY ON VERB-PARTICLES
Authors: Timothy Baldwin and Aline Villavicencio
Primarily assigned technology terms:
- algorithm
- attachment resolution
- brill tagger
- chunk parser
- chunk parsing
- chunker
- chunking
- classiflcation
- classifler
- classifler-based method
- disambiguation
- disambiguation method
- extraction method
- k-nearest neighbor
- language tools
- parser
- parsing
- parsing method
- partitioning
- regular expression
- relative distance
- rule-based method
- search
- searching
- tagger
- tagging
- unsupervised attachment disambiguation
- voting
- weighted voting
- weighting
Other assigned terms:
- adjective
- ambiguity
- annotation
- approach
- attachment ambiguity
- brown corpus
- case
- chunk
- chunk type
- chunks
- class distribution
- collocation
- corpora
- corpus study
- distribution
- english verb
- f-score
- fact
- feature
- feature set
- feature space
- feature vector
- feature vectors
- gold standard
- grammar
- lemma
- lemmata
- lexical type
- linguistic
- linguistic features
- log-likelihood
- log-likelihood ratio
- log-linear models
- manual annotation
- meaning
- method
- morph
- natural language
- natural language tools
- noise
- noun chunk
- parse
- parsed corpus
- parser output
- particle
- particlehood
- particles
- partof-speech
- penn treebank
- penn treebank annotation
- pos tag
- precision
- preposition
- preposition combination
- prepositions
- process
- pronominal noun
- punctuation
- punctuation mark
- random sample
- raw text corpora
- relative frequency
- segments
- sparse data
- subcategorisation
- syntactic structure
- tags
- tagset
- terms
- test data
- text
- text corpora
- token frequency
- training
- training and test data
- training data
- transitivity
- treebank
- treebank annotation
- treebank parse
- trees
- verb
- wall street journal corpus
- word
- word lemmata
- word level
- words