ACL RD-TEC 1.0 Summarization of W94-0106
Paper Title:
DO WE NEED LINGUISTICS WHEN WE HAVE STATISTICS? A COMPARATIVE ANALYSIS OF THE CONTRIBUTIONS OF LINGUISTIC CUES TO A STATISTICAL WORD GROUPING SYSTEM
DO WE NEED LINGUISTICS WHEN WE HAVE STATISTICS? A COMPARATIVE ANALYSIS OF THE CONTRIBUTIONS OF LINGUISTIC CUES TO A STATISTICAL WORD GROUPING SYSTEM
Primarily assigned technology terms:
- classification
- cluster analysis
- clustering
- clustermg
- collocation extraction
- collocation extraction \
- comparative analysis
- computational lexicography
- data collection
- data extraction
- disambiguation
- disambiguation problem
- estimator
- extraction method
- finite-state parser
- grouping
- identification
- information retrieval
- linear regression
- machine translation
- matching
- modeling
- morphology
- nlp
- nlp systems
- non-hierarchical clustermg
- parser
- part-of-speech tagging
- pattern matcher
- pattern matching
- post-processing
- predictor
- regression
- regular expression
- sense disambiguation
- smoothing
- spell-checking
- statistical approaches
- statistical methods
- statistical system
- tagging
- word classification
- word grouping
Other assigned terms:
- adjective
- adverbial modification
- ambiguous words
- approach
- binary features
- binomial distribution
- case
- cluster
- clusters
- collocation
- compounds
- concepts
- corpora
- corpus size
- correlation
- disambiguation system
- distribution
- experimental results
- f-measure
- f-score
- feature
- finite-state grammar
- frequency counts
- generation
- genre
- grammar
- heuristics
- hyponyms
- hypothesis
- knowledge
- lexicography
- lexicon
- linear regression model
- linguistic
- linguistic constraints
- linguistic feature
- linguistic features
- linguistic information
- linguistic knowledge
- linguistics
- meaning
- measure
- measures
- method
- methodology
- morphology module
- natural language
- nlp applications
- noise
- nouns
- null hypothesis
- parameter space
- part-of-speech
- part-of-speech information
- precision
- probabilities
- procedure
- process
- regression model
- run-time
- semantic
- semantic classes
- semantic information
- semantic relatedness
- sense disambiguation problem
- sentence
- sentence boundaries
- similarity scores
- size of the corpus
- statistical model
- statistical significance
- statistics
- stem
- suffix
- synonyms
- syntactic relationship
- technology
- terms
- test set
- text
- text corpus
- translations
- typographical errors
- word
- word classes
- word corpus
- words