ACL RD-TEC 1.0 Summarization of P03-1006
Paper Title:
GENERALIZED ALGORITHMS FOR CONSTRUCTING STATISTICAL LANGUAGE MODELS
GENERALIZED ALGORITHMS FOR CONSTRUCTING STATISTICAL LANGUAGE MODELS
Authors: Cyril Allauzen and Mehryar Mohri and Brian Roark
Primarily assigned technology terms:
- algorithm
- approximation
- automaton
- backoff bigram
- class-based language modeling
- clustering
- computing
- difference method
- encoding
- extraction system
- final state
- grammar processing
- greedy approach
- illustration
- information extraction
- information extraction system
- language model automaton
- language modeling
- language processing
- large-vocabulary speech recognition
- maximum likelihood
- mining
- model building
- modeling
- natural language processing
- offline optimization
- optimization
- probability semiring
- processing
- processor
- recognition
- recognition system
- recognition systems
- recognizer
- regular expression
- shortest-distance
- shortest-distance algorithm
- smoothing
- smoothing techniques
- speech mining
- speech processing
- speech recognition
- speech recognition system
- speech recognition systems
- splitting
- transducer
- transducers
- transduction
- viterbi
- weight pushing
- weighted automata
- weighted determinization
- weighted difference method
- weighted transducer
Other assigned terms:
- alphabet
- approach
- automata
- backoff
- backoff model
- bigram
- bigram model
- cache
- case
- cluster
- compact representation
- composition
- conditional probabilities
- conditional probability
- distribution
- duration
- experimental results
- finite automaton
- finite set
- grammar
- grammars
- implementation
- information sources
- input string
- interpolation
- interpretation
- joint probability
- labeling
- language model
- language models
- language processing applications
- large-vocabulary speech
- lattice
- lemma
- likelihood
- likelihood probability
- method
- names
- natural language
- natural language processing applications
- negation
- np-hard problem
- numerical stability
- probabilistic model
- probabilities
- probability
- probability distribution
- probability distributions
- procedure
- projection
- proposition
- recursion
- regular expressions
- rewrite rules
- semiring
- sentence
- switchboard training corpus
- symbol
- symbols
- technique
- terms
- text
- theorem
- topology
- training
- training corpus
- trigram
- trigram model
- tropical semiring
- vocabulary
- vocabulary size
- word
- word lattice
- words