ACL RD-TEC 1.0 Summarization of W05-0805
Paper Title:
REVEALING PHONOLOGICAL SIMILARITIES BETWEEN RELATED LANGUAGES FROM AUTOMATICALLY GENERATED PARALLEL CORPORA
REVEALING PHONOLOGICAL SIMILARITIES BETWEEN RELATED LANGUAGES FROM AUTOMATICALLY GENERATED PARALLEL CORPORA
Primarily assigned technology terms:
- algorithm
- automatic generation
- clustering
- clustering algorithm
- clustering method
- computing
- data generation
- database
- databases
- em algorithm
- em-based clustering
- expectation maximization
- finite-state transducers
- identification
- induction
- language learning
- learner
- learning
- machine translation
- measuring
- probabilistic clustering
- processing
- qualitative evaluation
- re-estimation
- scoring
- search
- subcategorization
- syllabification
- transcription
- transducers
- translation search
- tuning
- unsupervised clustering
- unsupervised training
- verb subcategorization
- weighted finite-state transducers
Other assigned terms:
- alignment procedure
- approach
- back-transliteration
- bilingual dictionary
- class probability
- cluster
- corpora
- development set
- dictionaries
- dictionary
- distribution
- dutch
- english translation
- generation
- generation process
- generative model
- gold standard
- knowledge
- language pairs
- lexical entries
- lexical level
- lexicon
- likelihood
- linguistic
- linguistics
- measure
- measures
- method
- middle dutch
- modern english
- morphemes
- noise
- noisy input
- nucleus
- parallel corpora
- parallel texts
- phoneme
- phonemes
- phonetic representation
- possible translation
- probabilities
- probability
- probability distribution
- probability model
- procedure
- process
- pronunciation
- pronunciation dictionary
- relation
- similarity score
- similarity scores
- source language
- stem
- stems
- subcategorization frames
- suffix
- suffixes
- syllable structure
- syllables
- syntactic features
- syntax
- target language
- technique
- training
- training corpora
- training data
- training material
- training set
- transcriptions
- translation models
- translation task
- translations
- verb
- vowel
- word
- word form
- word lists
- word order
- word pair
- words