ACL RD-TEC 1.0 Summarization of C04-1152
Paper Title:
EFFICIENT UNSUPERVISED RECURSIVE WORD SEGMENTATION USING MINIMUM DESCRIPTION LENGTH
EFFICIENT UNSUPERVISED RECURSIVE WORD SEGMENTATION USING MINIMUM DESCRIPTION LENGTH
Authors: Shlomo Argamon and Navot Akiva and Amihood Amir and Oren Kapah
Primarily assigned technology terms:
- algorithm
- approximate matching
- automatic word segmentation
- beam-search
- caching
- coding
- greedy algorithm
- greedy construction
- greedy search
- greedy segmentation
- heuristic algorithm
- incremental algorithm
- indexing
- language processing
- latent semantic analysis
- learning
- map estimation
- matching
- morphological analysis
- morphological segmentation
- morphology
- natural language processing
- postprocessing
- prefix trie
- processing
- recognition
- search
- search algorithm
- search method
- segmentation
- semantic analysis
- suffix trie
- suffix trie construction
- trie construction
- unsupervised learning
- word segmentation
Other assigned terms:
- affix
- affixes
- agglutinative language
- approach
- case
- character sequence
- characters
- composition
- corpora
- data structure
- data structures
- derivation
- dictionary
- distribution
- edit distance
- estimation
- experimental results
- heuristic
- inflection
- latent semantic
- measure
- method
- minimum description length
- morph
- morpheme
- mutual information
- n-gram
- natural language
- orthographic similarity
- prefixes and suffixes
- probabilities
- probability
- process
- schema
- semantic
- semantic context
- semantic similarity
- size of the corpus
- statistical models
- statistics
- stem
- stems
- suffix
- suffixes
- term
- terms
- tokens
- translations
- turkish corpora
- unigram
- vowel
- word
- word types
- words