ACL RD-TEC 1.0 Summarization of J96-3004

Paper Title:
A STOCHASTIC FINITE-STATE WORD-SEGMENTATION ALGORITHM FOR CHINESE

Authors: Richard Sproat and William Gales and Chilin Shih and Nancy Chang

Primarily assigned technology terms:

Other assigned terms:

abbreviation
abbreviations
acronym
adjective
adverb
affix
affixation
affixes
ambiguity
approach
arithmetic mean
backoff
backoff model
bias
bigram
case
chinese sentence
chinese text
chinese word
class-based model
cluster
constraint satisfaction
contextual information
corpora
correlation
data consortium
dictionaries
dictionary
dictionary entries
dictionary entry
discourse
discourse context
distance matrix
empty string
english sentence
english text
essay
evaluation method
evaluations
fact
foreign words
foreign-name
frame
genre
grammar
grammars
grammatical features
grammatical information
heuristics
human judgments
human performance
hypotheses
implementation
independence model
interpretation
intonational phrase
knowledge
language model
language models
lattice
lexical information
lexical relations
lexical rules
lexicon
likelihood
linguistic
linguistic constraints
linguistic data
linguistic data consortium
linguistic information
linguistic knowledge
linguistic work
linguistics
machine-readable dictionary
main verb
mandarin chinese
mapping
mappings
maps
maximum likelihood estimate
meaning
meanings
measure
measures
method
modal verb
morpheme
morphemes
morphological information
morphological rules
mutual information
names
natural language
nlp application
nlp task
nouns
orthography
paragraphs
part of speech
part-of-speech
part-of-speech information
pause
personal names
phrase
pinyin
plural noun
precision
precision measure
probabilities
probability
probability estimate
procedure
process
pronunciation
proper noun
punctuation
question formation
relation
relaxation technique
segmentation problem
segments
semantic
semantic class
semantic classes
semantic features
semantic interpretation
sentence
sentences
similarity matrix
similarity measures
singular noun
source language
statistical information
stem
stems
style
suffix
suffixes
syllables
symbols
technique
term
terms
test corpora
test corpus
test set
text
text database
theorem
tokens
tone
toolkit
topics
training
training corpus
transcriptions
transitive closure
trees
unigram
unigram model
verb
vocabulary
word
word boundaries
word classes
word corpus
word frequencies
word frequency
word level
word meaning
words
writing system

Extracted Section Types:

Download the PDF file from the ACL Anthology.
Brwose this paper on the University of Michigan CLAIR Group's interface.