ACL RD-TEC 1.0 Summarization of W05-0708

Paper Title:
POS TAGGING OF DIALECTAL ARABIC: A MINIMALLY SUPERVISED APPROACH

Authors: Kevin Duh and Katrin Kirchhoff

Other assigned terms:

  • affix
  • affixation
  • affixes
  • ambiguous words
  • annotation
  • approach
  • bias
  • bilingual dictionary
  • case
  • chinese\/english corpora
  • cluster
  • clusters
  • conditional probability
  • contextual model
  • corpora
  • corpus frequency
  • corpus size
  • data set
  • data sets
  • data sparseness
  • development set
  • dialectal speech
  • dictionary
  • distribution
  • evaluation set
  • fact
  • generation
  • generation process
  • hmm model
  • interpolation
  • joint probability
  • knowledge
  • language corpora
  • lexical model
  • lexicon
  • lexicon entry
  • likelihood
  • mapping
  • maximum likelihood estimate
  • method
  • modern standard arabic
  • msa treebank
  • mutual information
  • n-gram
  • n-grams
  • natural language
  • noise
  • notational simplicity
  • noun phrase
  • opinions
  • parallel corpora
  • part-of-speech
  • particle
  • phrase
  • prepositions
  • probabilistic model
  • probabilistic models
  • probabilities
  • probability
  • probability distribution
  • probability distributions
  • process
  • punctuation
  • relative frequency
  • russian
  • sentence
  • spoken language
  • spoken language corpora
  • standard arabic
  • stem
  • stems
  • substring
  • suffix
  • tag sequence
  • tagger lexicon
  • tagging accuracy
  • tagging performance
  • tags
  • tagset
  • technique
  • technology
  • terms
  • text
  • tokens
  • toolkit
  • topics
  • training
  • training corpus
  • training data
  • training set
  • transcriptions
  • transcripts
  • treebank
  • treebank corpus
  • trigram
  • trigram model
  • unannotated corpora
  • verb
  • vocabulary
  • word
  • word alignments
  • word features
  • word fragments
  • word order
  • word sequences
  • word types
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***