ACL RD-TEC 1.0 Summarization of P97-1017
Paper Title:
MACHINE TRANSLITERATION
MACHINE TRANSLITERATION
Authors: Kevin Knight and Jonathan Graehl
Primarily assigned technology terms:
- algorithm
- approximation
- arabic\/english transliteration
- backtransliterator
- character recognition
- computational linguistics
- corpus analysis
- decision trees
- dictionary lookup
- disambiguation
- em training
- extraction algorithm
- finite-state transducers
- forward transliteration
- hybrid neural-net\/expert-system
- japanese\/english machine translation
- language processing
- learning
- learning algorithm
- learning approach
- machine translation
- machine transliteration
- neural-net\/expert-system
- optical character recognition
- patternmatching
- processing
- pronouncer
- pruning
- recognition
- recognizer
- romanization
- scoring
- scoring method
- search
- shortest-path
- shortest-path extraction
- speech recognition
- speech recognizer
- spelling
- tile
- transducers
- translator
- translators
- transliteration
- transliteration process
- unigram scoring
- viterbi
- weighted finite-state transducers
- word processing
- word selection
Other assigned terms:
- accent
- alphabet
- approach
- auxiliary verbs
- back-transliteration
- bigram
- bigram model
- bilingual dictionaries
- bilingual dictionary
- bilingual glossary
- case
- characters
- composition
- conditional probabilities
- corpora
- device
- dictionaries
- dictionary
- distribution
- english translations
- evaluations
- feature
- frequency list
- generative model
- generative models
- glossary
- hypotheses
- katakana
- knowledge
- language model
- language pairs
- linguistics
- mapping
- mappings
- maps
- method
- names
- noise
- pause
- phoneme
- phonetic alphabet
- phrase
- probabilistic models
- probabilities
- probability
- probability distribution
- probability distributions
- process
- pronunciation
- pronunciation dictionary
- proper names
- recognition errors
- sequence model
- stress
- symbol
- symbols
- technical terms
- terms
- text
- textbook
- theory
- tokens
- training
- training corpora
- training set
- transformation
- translations
- trees
- unigram
- vocabulary
- vowel
- wall street journal corpus
- word
- word sequence
- word sequences
- words
- world knowledge