ACL RD-TEC 1.0 Summarization of J03-3005

Paper Title:
USING THE WEB TO OBTAIN FREQUENCIES FOR UNSEEN BIGRAMS

Authors: Frank Keller and Mirella Lapata

Other assigned terms:

  • adjective
  • ambiguity
  • anaphora
  • annotation
  • approach
  • association measure
  • bias
  • bigram
  • british english
  • british national corpus
  • case
  • class-based approach
  • class-based model
  • clustering model
  • co-occurrence
  • co-occurrence frequency
  • co-occurrences
  • coefficient
  • compounds
  • concept
  • concept hierarchy
  • conditional model
  • conditional probabilities
  • conditional probability
  • conditional probability model
  • context-free grammar
  • corpora
  • corpus evidence
  • corpus frequency
  • corpus size
  • correlation
  • correlation coefficient
  • correlations
  • data set
  • data sets
  • data sparseness
  • data sparseness problem
  • development set
  • dictionary
  • distribution
  • distributional similarity
  • error rate
  • estimation
  • evaluations
  • fact
  • french
  • frequency counts
  • genre
  • grammar
  • head noun
  • heuristic
  • heuristics
  • human judgments
  • hypothesis
  • interpolation
  • interpretation
  • joint probability
  • joint probability model
  • language model
  • linguistic
  • linguistic data
  • linguistic phenomenon
  • linguistics
  • linguistics literature
  • manual annotation
  • measure
  • method
  • model parameters
  • n-gram
  • n-grams
  • nantc coefficient
  • nlp tasks
  • noise
  • nominal anaphora
  • nouns
  • parameter settings
  • part of speech
  • part-of-speech
  • pp attachment
  • predicate-argument
  • probabilities
  • probability
  • probability estimates
  • probability model
  • procedure
  • queries
  • query
  • questionnaire
  • random order
  • relation
  • selectional association
  • semantic
  • semantic class
  • semantic classes
  • semantic hierarchy
  • semantic similarity
  • sense ambiguity
  • sentence
  • sparse data
  • sparseness problem
  • spoken language
  • statistics
  • syntactic patterns
  • syntactic relations
  • syntax
  • tagged corpus
  • taxonomy
  • technique
  • terms
  • test data
  • test set
  • text
  • text corpus
  • theoretical linguistics
  • training
  • training corpus
  • training data
  • training set
  • transcripts
  • transformation
  • translations
  • tree
  • trigram
  • trigram language model
  • unigram
  • verb
  • webexp software package
  • word
  • word error rate
  • word sense
  • word sense ambiguity
  • word senses
  • word sequences
  • wordnet
  • wordnet taxonomy
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***