ACL RD-TEC 1.0 Summarization of P06-1038
Paper Title:
EFFICIENT UNSUPERVISED DISCOVERY OF WORD CATEGORIES USING SYMMETRIC PATTERNS AND HIGH FREQUENCY WORDS
EFFICIENT UNSUPERVISED DISCOVERY OF WORD CATEGORIES USING SYMMETRIC PATTERNS AND HIGH FREQUENCY WORDS
Authors: Dmitry Davidov and Ari Rappoport
Primarily assigned technology terms:
- agglomerative clustering
- algorithm
- category acquisition
- category discovery
- category merging
- category pruning
- clustering
- clustering algorithm
- coclustering
- computational linguistics
- corpus annotation
- corpus windowing
- crawling
- decomposition
- entity recognition
- graph representation
- identification
- induction
- information extraction
- k-means
- lexical acquisition
- lexical category acquisition
- lsa-based clustering
- matrix decomposition
- named entity recognition
- parsing
- pattern induction
- phrasing
- pos tagging
- pruning
- random selection
- ranking
- reasoning
- recognition
- search
- settheoretic inference
- syntactic annotation
- tagging
- unsupervised lexical category acquisition
- vector representation
- web search
Other assigned terms:
- adjective
- annotation
- approach
- association for computational linguistics
- baseline clustering
- case
- category size
- clusters
- content words
- context feature
- corpora
- english corpus
- evaluation method
- evaluation methodology
- feature
- feature vectors
- heuristics
- human judgment
- human judgments
- hyponym
- index
- kappa
- large corpora
- lexical category
- lexical resources
- lexical semantic
- linguistic
- linguistic data
- linguistics
- meaning
- measure
- measures
- method
- methodology
- named entity
- names
- noise
- nouns
- pairs of words
- parts of speech
- precision
- process
- punctuation
- russian
- scalability
- seed
- semantic
- semantic categories
- similarity measures
- size of the corpus
- statistics
- subgraph
- subtree
- syntactic features
- syntactic information
- syntactic patterns
- tagged corpus
- target word
- technique
- terms
- text
- unannotated corpus
- verb
- vocabulary
- web corpus
- web pages
- window size
- word
- word senses
- word-net
- wordnet
- words