ACL RD-TEC 1.0 Summarization of C00-2170
Paper Title:
JURILINGUISTIC ENGINEERING IN CANTONESE CHINESE: AN N-GRAM-BASED SPEECH TO TEXT TRANSCRIPTION SYSTEM
JURILINGUISTIC ENGINEERING IN CANTONESE CHINESE: AN N-GRAM-BASED SPEECH TO TEXT TRANSCRIPTION SYSTEM
Authors: B K T'sou and K K Sin and S W K Chan and T B Y Lai and C Lun and K T Ko and G K K Chan and L Y L Cheung
Primarily assigned technology terms:
Other assigned terms:
- ambiguity
- baseline model
- bigram
- case
- character sequence
- characters
- chinese characters
- chinese text
- co-occurrence
- computational complexity
- conditional probability
- corpora
- corpus size
- data sets
- estimation
- experimental results
- homonymy
- intelligibility
- large training
- linguistic
- mandarin chinese
- measure
- measures
- n-gram
- n-gram model
- orthography
- probability
- procedure
- process
- sentence
- simplified chinese
- statistical data
- statistical model
- statistical models
- syllables
- system architecture
- terms
- testing corpora
- testing corpus
- testing data
- text
- text corpora
- tokens
- tone
- training
- training and testing data
- training corpora
- training corpus
- training data
- training set
- trigram
- unigram
- vocabulary
- word
- word morphology
- words