ACL RD-TEC 1.0 Summarization of P03-1050
Paper Title:
UNSUPERVISED LEARNING OF ARABIC STEMMING USING A PARALLEL CORPUS
UNSUPERVISED LEARNING OF ARABIC STEMMING USING A PARALLEL CORPUS
Authors: Monica Rogati and Scott McCarley and Yiming Yang
Primarily assigned technology terms:
- algorithm
- approximation
- arabic information retrieval
- arabic retrieval
- arabic stemmer
- cutoff
- detection and tracking
- disambiguation
- information retrieval
- information retrieval task
- learning
- learning approach
- machine translation
- measuring
- morphology
- normalization
- plural detection
- preprocessing
- processing
- ranking
- scoring
- search
- sense disambiguation
- smoothing
- statistical machine translation
- statistical methods
- stem translation probability
- stemmer
- supervised learning
- task-based evaluation
- text processing
- topic detection
- topic detection and tracking
- translators
- transliteration
- unsupervised learning
- unsupervised stemmer
- unsupervised training
- word sense disambiguation
Other assigned terms:
- affix
- affixes
- annotation
- approach
- case
- concept
- conditional probabilities
- convergence
- corpora
- corpus size
- distribution
- document
- english sentence
- evaluation metric
- fact
- foreign word
- french
- genre
- joint probability
- knowledge
- learning paradigm
- lexical item
- linguistic
- linguistic knowledge
- mapping
- mean average precision
- meaning
- meanings
- minimum description length
- monolingual corpora
- monolingual corpus
- named entity
- opinions
- parallel corpus
- parallel sentence
- parallel text
- part of speech
- part of speech tags
- phrase
- precision
- prefixes and suffixes
- prior distribution
- probabilities
- probability
- probability distribution
- process
- queries
- query
- retrieval task
- scoring model
- sentence
- sentences
- speech tag
- statistics
- stem
- stems
- suffix
- suffixes
- tags
- target language
- test set
- testing set
- text
- tokens
- training
- training corpus
- training data
- training phase
- translation model
- translation probabilities
- translation probability
- trees
- unannotated text
- word
- word sense
- words