ACL RD-TEC 1.0 Summarization of H94-1054
Paper Title:
JAPANESE WORD SEGMENTATION BY HIDDEN MARKOV MODEL
JAPANESE WORD SEGMENTATION BY HIDDEN MARKOV MODEL
Primarily assigned technology terms:
- algorithm
- boundary determination
- classification
- computing
- data extraction
- decision making
- dynamic programming
- dynamic programming algorithm
- extraction system
- factoring
- finite state
- finite state machines
- forward-backward algorithm
- hidden markov
- hidden markov model
- hidden markov models
- incremental processing
- japanese segmentation
- japanese word processing
- japanese word segmentation
- learning
- likelihood segmentation
- machine translation
- machine translation system
- markov model
- maximum likelihood
- maximum likelihood segmentation
- model development
- morphological processor
- morphology
- preprocessing
- processing
- processing technology
- processor
- programming algorithm
- re-training
- recognition
- s6gmentation
- search
- segmentation
- segmentation algorithm
- stochastic process
- supervised training
- text processing
- text segmenting
- training procedure
- translation system
- viterbi
- viterbi algorithm
- viterbi algorithm \
- word processing
- word processor
- word segmentation
Other assigned terms:
- alphabet
- annotated corpus
- approach
- automatic processing
- character type
- characters
- english text
- error rate
- fact
- generation
- grammar
- grammar rules
- hypothesis
- implementation
- japanese language
- japanese text
- japanese text \
- japanese word
- kanji
- katakana
- knowledge
- knowledge base
- labeling
- large corpus
- lexicon
- likelihood
- markov models
- meaning
- measure
- part of speech
- preprocessor
- priori
- probabilistic model
- probabilities
- probability
- procedure
- process
- segmentation accuracy
- segments
- sentence
- sentences
- set size
- stochastic model
- symbols
- technology
- test corpus
- test data
- text
- topology
- trained model
- training
- training data
- training material
- training set
- training set size
- word
- word boundaries
- word boundary
- word segmentation accuracy
- words