ACL RD-TEC 1.0 Summarization of W01-1623
Paper Title:
TOWARD A LARGE SPONTANEOUS MANDARIN DIALOGUE CORPUS
TOWARD A LARGE SPONTANEOUS MANDARIN DIALOGUE CORPUS
Primarily assigned technology terms:
Other assigned terms:
- abbreviations
- annotation
- case
- characters
- chinese characters
- conversation
- corpora
- determiners
- dialogue acts
- dialogue annotation
- dialogue corpus
- dialogues
- discourse
- discourse markers
- distribution
- high-frequency word
- information science
- intonation
- knowledge
- labeling
- large speech corpora
- lexical database
- linguistic
- linguistic features
- linguistic phenomena
- linguistic structures
- meaning
- negation
- particles
- pauses
- pinyin
- pronouns
- prosody
- segments
- semantic
- semantic meaning
- speech corpora
- speech data
- spoken language
- spontaneous conversation
- spontaneous dialogue
- stress
- tags
- text
- token frequency
- tokens
- transcript
- transcriptions
- turn-taking
- understanding
- utterance
- word
- words
- written texts