ACL RD-TEC 1.0 Summarization of N04-4038
Paper Title:
AUTOMATIC TAGGING OF ARABIC TEXT: FROM RAW TEXT TO BASE PHRASE CHUNKS
AUTOMATIC TAGGING OF ARABIC TEXT: FROM RAW TEXT TO BASE PHRASE CHUNKS
Authors: Mona Diab and Kadri Hacioglu and Daniel Jurafsky
Primarily assigned technology terms:
- algorithm
- analyzer
- arabic natural language processing
- automatic tagging
- bp chunking
- chunker
- chunking
- classification
- classifier
- data-driven approach
- kernel
- language processing
- learning
- learning algorithm
- learning approach
- learning approaches
- machine learning
- machine learning approaches
- machine-learning
- morphological analysis
- morphological analyzer
- morphology
- natural language processing
- nlp
- part of speech tagging
- phrase chunking
- polynomial kernel
- pos tagger
- pos tagging
- processing
- processing tools
- segmentation
- semantic chunking
- speech tagging
- supervised learning
- supervised learning algorithm
- supervised learning approach
- support vector machine
- support vector machines
- svm-tok tokenizer
- tagger
- tagging
- tagging system
- tokenization
- tokenizer
- transliteration
- viterbi
- viterbi algorithm
- word tokenization
Other assigned terms:
- adjective
- affix
- affixes
- approach
- arabic language
- arabic text
- arabic treebank
- break
- characters
- chunk
- chunk phrase type
- chunks
- classification problem
- classification task
- community
- confusion matrix
- derivational morphology
- determiner
- determiners
- development set
- dictionaries
- dictionary
- dictionary entries
- distribution
- english text
- evaluation metrics
- f-measure
- function words
- inflection
- inflectional morphology
- knowledge
- lexical level
- linguistic
- linguistic context
- meaning
- modern standard arabic
- morphemes
- natural language
- nlp applications
- nlp tasks
- nouns
- part of speech
- part-ofspeech
- phrase
- phrase type
- pos tag
- precision
- prepositions
- process
- pronouns
- punctuation
- segments
- semantic
- semantic roles
- sentences
- standard arabic
- stem
- stems
- suffix
- support vector
- svms
- syntactic level
- tag set
- tagging scheme
- tagging task
- tags
- technique
- templatic morphology
- test data
- test set
- text
- tokens
- topics
- training
- training and test data
- training data
- training set
- tree
- treebank
- trees
- word
- word boundaries
- word form
- words