ACL RD-TEC 1.0 Summarization of P06-1085

Paper Title:
CONTEXTUAL DEPENDENCIES IN UNSUPERVISED WORD SEGMENTATION

Authors: Sharon Goldwater and Thomas L. Griffiths and Mark Johnson

Other assigned terms:

  • approach
  • association for computational linguistics
  • backoff
  • bigram
  • bigram language model
  • bigram model
  • boundary marker
  • cache
  • case
  • characters
  • childes database
  • collocation
  • conditional probability
  • convergence
  • corpora
  • correlation
  • dictionary
  • dirichlet distribution
  • distribution
  • f-score
  • fact
  • generative model
  • grammar
  • hypotheses
  • hypothesis
  • hypothesis space
  • language model
  • lexical entries
  • lexical entry
  • lexical items
  • lexicon
  • likelihood
  • linguistics
  • markov chain
  • measures
  • method
  • multinomial distribution
  • n-gram
  • n-gram model
  • n-gram models
  • natural language
  • parameter settings
  • phoneme
  • phonemes
  • phonemic representation
  • phonemic transcription
  • posterior
  • posterior distribution
  • posterior probability
  • precision
  • prior probability
  • priori
  • probabilistic models
  • probabilities
  • probability
  • probability distribution
  • procedure
  • process
  • process model
  • search procedure
  • segmentation accuracy
  • statistics
  • stems
  • technique
  • term
  • terms
  • text
  • token frequency
  • tokens
  • trigram
  • uniform distribution
  • unigram
  • unigram language model
  • unigram model
  • utterance
  • word
  • word boundaries
  • word boundary
  • word frequencies
  • word type
  • word types
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***