ACL RD-TEC 1.0 Summarization of W04-1806
Paper Title:
AUTOMATICALLY INDUCING ONTOLOGIES FROM CORPORA
AUTOMATICALLY INDUCING ONTOLOGIES FROM CORPORA
Authors: Interjeet Mani and Ken Samuel and Kris Concepcion and David Vogel
Primarily assigned technology terms:
- algorithm
- approximation
- automatic cataloguing
- automatic evaluation
- browser
- cataloguing
- classifiers
- clustering
- computational terminology
- computing
- constraint relaxation
- data-driven approach
- database
- databases
- evidence combination
- extrinsic evaluation
- finite-state parsing
- greedy approximation
- illustration
- induction
- information access
- information extraction
- learning
- ontology comparison
- ontology construction
- ontology editor
- ontology evaluation
- ontology induction
- parsing
- phrase analysis
- preprocessing
- processing
- query expansion
- querying
- scoring
- syntactic parsing
- tagger
- term discovery
- term scoring
- termdiscovery
- terminology
- tokenization
- top-down clustering
- various information access
Other assigned terms:
- approach
- background corpus
- background knowledge
- binomial model
- biology
- case
- cluster
- clusters
- community
- compounds
- concepts
- conditional probability
- corpora
- distribution
- document
- document frequency
- domain corpus
- domain knowledge
- domain-independence
- domain-specific knowledge
- evaluation experiment
- evaluations
- f-measure
- free distribution
- gene ontology
- grammar
- human intervention
- human judgments
- hypernym
- hypothesis
- implementation
- inferences
- information gain
- inter-annotator agreement
- inverse document frequency
- kappa
- knowledge
- language data
- leaf
- likelihood
- likelihood ratio
- linguistic
- machine-induced ontology
- measure
- measures
- medline
- message
- method
- mutual information
- names
- natural language
- news corpus
- null hypothesis
- ontologies
- ontology
- paragraph
- part-of-speech
- phrase
- pointwise mutual information
- precision
- probabilistic measure
- probability
- procedure
- process
- proper names
- protein names
- punctuation
- query
- relation
- seed
- semantic
- semantic relations
- semantic relationships
- standard deviation
- statistic
- statistics
- subsumption
- suffix
- synonyms
- system architecture
- taxonomy
- term
- term frequency
- term list
- terms
- text
- text collection
- tf \* idf
- thesaurus
- training
- training data
- transitive closure
- transitivity
- tree
- user
- web site
- word
- wordnet