ACL RD-TEC 1.0 Summarization of W05-0701
Paper Title:
MEMORY-BASED MORPHOLOGICAL ANALYSIS GENERATION AND PART-OF-SPEECH TAGGING OF ARABICAuthors: Erwin Marsi and Antal
MEMORY-BASED MORPHOLOGICAL ANALYSIS GENERATION AND PART-OF-SPEECH TAGGING OF ARABIC
Authors: Erwin Marsi and Antalvan den Bosch and Abdelhadi Soudi
Primarily assigned technology terms:
- algorithm
- analyzer
- arabic morphological analyzer
- automatic generation
- brill tagger
- classification
- classifier
- computational linguistics
- cross-validation
- data preparation
- dictionary lookup
- distance function
- error-driven learning
- feature weighting
- finite state
- generation method
- identification
- inductive learning
- inductive learning algorithm
- instantiation
- k-nearest neighbor
- k-nearest neighbor classifier
- k-nn
- kernel
- lazy learning
- learning
- learning algorithm
- learning algorithms
- learning methods
- machine learning
- machine learning algorithms
- machine learning methods
- machine-learning
- majority voting
- matching
- memory-based learning
- memory-based tagging
- mining
- morpho-syntactic analysis
- morphological analysis
- morphological analyzer
- morphological segmentation
- morphology
- natural-language processing
- nearest neighbors
- neighbor classification
- parsing
- part-of-speech tagger
- part-of-speech tagging
- pos tagger
- pos tagging
- processing
- ranking
- reporting
- segmentation
- shallow parsing
- simple majority voting
- statistical tagger
- stem identification
- support vector machines
- tagger
- tagger generator
- taggers
- tagging
- text mining
- transliteration
- validation
- viterbi
- viterbi algorithm
- vocalization
- voting
- weighting
Other assigned terms:
- 10-fold cross-validation
- affixes
- annotation
- annotator
- annotators
- approach
- arabic morphology
- arabic text
- arabic treebank
- association for computational linguistics
- case
- characters
- classification tasks
- cross-validation experiment
- data consortium
- data set
- derivation
- determiners
- dictionary
- distribution
- f-score
- fact
- feature
- feature space
- feature-value vector
- finite state model
- fixed-length vector
- frequency list
- generation
- input string
- kernel function
- large corpus
- lexical entries
- lexicon
- linguistic
- linguistic constraints
- linguistic data
- linguistic data consortium
- linguistics
- linguists
- local context
- mapping
- maps
- method
- morpheme
- morphemes
- morphological structure
- natural-language
- nouns
- parameter settings
- part-of-speech
- part-of-speech tag
- pos tag
- precision
- prediction task
- prefixes and suffixes
- prepositions
- process
- pronouns
- punctuation
- segments
- semitic languages
- sentence
- sentences
- stem
- stems
- suffix
- suffixes
- support vector
- surface form
- symbol
- syntactic features
- tag sequence
- tagging accuracy
- tagging task
- tags
- terms
- test data
- test material
- test set
- text
- token frequency
- tokens
- training
- training corpus
- training data
- training material
- training set
- treebank
- trigram
- verb
- vowel
- vowel melody morpheme
- word
- word structure
- word types
- words