ACL RD-TEC 1.0 Summarization of P05-1071

Paper Title:
ARABIC TOKENIZATION, PART-OF-SPEECH TAGGING AND MORPHOLOGICAL DISAMBIGUATION IN ONE FELL SWOOP

Authors: Nizar Habash and Owen Rambow

Other assigned terms:

  • affixation
  • affixes
  • ambiguity
  • annotation
  • approach
  • arabic orthography
  • arabic treebank
  • backoff
  • binary feature
  • buckwalter lexicon
  • case
  • confidence measure
  • confidence score
  • corpora
  • data consortium
  • dictionary
  • english penn treebank
  • exponential model
  • f-measure
  • fact
  • feature
  • feature sets
  • generation
  • gold standard
  • heuristics
  • implementation
  • inflection
  • interpretation
  • knowledge
  • large corpus
  • lexicon
  • linguistic
  • linguistic data
  • linguistic data consortium
  • linguistic features
  • linguistic knowledge
  • meaning
  • measure
  • measures
  • mood
  • morphological features
  • morphological variation
  • nouns
  • nunation
  • orthography
  • part-of-speech
  • part-of-speech tag
  • particles
  • parts-of-speech
  • penn treebank
  • phrase
  • pos tag
  • precision
  • prefixes and suffixes
  • prepositions
  • process
  • pronouns
  • punctuation
  • relation
  • representations
  • run-time
  • sentence
  • stem
  • stems
  • suffix
  • suffixes
  • support vector
  • symbols
  • tag set
  • tagging accuracy
  • tags
  • tagset
  • term
  • terms
  • test corpora
  • test corpus
  • text
  • tokens
  • training
  • training corpora
  • training corpus
  • training data
  • treebank
  • trigram
  • trigram model
  • unannotated corpus
  • unigram
  • verb
  • word
  • word classes
  • word form
  • word stem
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***