ACL RD-TEC 1.0 Summarization of P06-1084
Paper Title:
AN UNSUPERVISED MORPHEME-BASED HMM FOR HEBREW MORPHOLOGICAL DISAMBIGUATION
AN UNSUPERVISED MORPHEME-BASED HMM FOR HEBREW MORPHOLOGICAL DISAMBIGUATION
Authors: Meni Adler and Michael Elhadad
Primarily assigned technology terms:
- algorithm
- analyzer
- approximation
- automatic speech recognition
- backoff smoothing
- baum-welch algorithm
- capitalization
- classifiers
- computational linguistics
- decoder
- disambiguation
- disambiguation procedure
- editing
- encoding
- entity recognition
- entity recognition system
- error reduction
- expert system
- indexing
- information retrieval
- information retrieval system
- learning
- learning algorithm
- modeling
- morpheme segmentation
- morphological analysis
- morphological analyzer
- morphological analyzers
- morphological disambiguation
- morphological disambiguation system
- morphological generation
- morphology
- named entity recognition
- parameter estimation
- parser
- parsing
- pos tagger
- pos tagging
- processing
- re-estimation
- recognition
- recognition system
- recognizer
- retrieval system
- searching
- segmentation
- segmentation and pos tagging
- segmenter
- semi-supervised learning
- smoothing
- smoothing method
- smoothing techniques
- speech recognition
- speech recognizer
- supervised training
- tagger
- taggers
- tagging
- text encoding
- text representation
- transformation learning
- unsupervised learning
- viterbi
- word segmenter
Other assigned terms:
- acoustic likelihood
- affix
- affixes
- ambiguity
- ambiguous words
- annotated corpus
- approach
- arabic corpora
- arabic morphology
- association for computational linguistics
- backoff
- case
- chunk
- complex word
- confusion matrix
- contextual information
- corpora
- data set
- data sparseness
- data sparseness problem
- dictionary
- disambiguation system
- distribution
- english text
- estimation
- experimental results
- fact
- generation
- grammar
- hand-crafted grammar
- hebrew corpus
- hierarchical model
- hmm model
- index
- interpretation
- knowledge
- language model
- large scale corpus
- lattice
- lemma
- lexemes
- lexical items
- lexicon
- likelihood
- linguistic
- linguistics
- method
- modern hebrew
- morpheme
- morphemes
- morpho-lexical probabilities
- morphological features
- mwes
- named entity
- names
- nouns
- phrase
- probabilistic model
- probabilities
- probability
- procedure
- process
- proper names
- sentence
- sentence representation
- sparseness problem
- static knowledge
- statistics
- stochastic model
- supervised model
- symbol
- symbols
- syntactic constraints
- tag set
- tagged corpus
- tagged text
- tags
- tagset
- term
- test corpus
- text
- time complexity
- trained model
- training
- training corpora
- training corpus
- transformation
- transition probabilities
- uniform distribution
- unknown word model
- untagged corpus
- word
- word formation
- word formation rules
- word model
- word type
- word types
- word-based model
- words