ACL RD-TEC 1.0 Summarization of N03-1029
Paper Title:
COMMA RESTORATION USING CONSTITUENCY INFORMATION
COMMA RESTORATION USING CONSTITUENCY INFORMATION
Authors: Stuart M. Shieber and Xiaopeng Tao
Primarily assigned technology terms:
- algorithm
- approximation
- automaton
- boundary detection
- boundary disambiguation
- categorization
- comma restoration
- decoding
- disambiguation
- hmm method
- hmms
- language modeling
- learning
- maximum entropy
- maximum likelihood
- modeling
- parser
- parsers
- parsing
- parsing technology
- part-of-speech tagging
- recognition
- recognition systems
- segmentation
- sentence boundary detection
- sentence boundary disambiguation
- smoothing
- speech recognition
- speech recognition systems
- statistical language modeling
- statistical parser
- statistical parsers
- statistical parsing
- tagging
- thresholding
- transformation-based learning
- viterbi
- viterbi decoding
Other assigned terms:
- approach
- bigram
- case
- constituency information
- constituent structure
- correlation
- data sparsity
- distribution
- duration
- end-user
- entropy
- entropy models
- error rate
- f-measure
- fact
- fmeasure
- hmm model
- joint probability
- k value
- language model
- language modeling toolkit
- language models
- likelihood
- linguistic
- maximum entropy models
- method
- modeling toolkit
- parse
- parsing model
- part of speech
- part of speech tags
- part-of-speech
- penn treebank
- precision
- probabilistic model
- probabilities
- probability
- prosodic features
- prosodic information
- prosody
- punctuation
- segments
- sentence
- sentence boundary
- sentences
- speech information
- speech model
- statistical parsing model
- syntactic information
- syntactic structure
- tags
- technology
- term
- text
- textual information
- toolkit
- training
- training data
- transcribed speech
- transcriptions
- transition probabilities
- tree
- treebank
- treebank parse
- trigram
- trigram language model
- trigram model
- unigram
- word
- word error rate
- words