ACL RD-TEC 1.0 Summarization of P99-1067
Paper Title:
AUTOMATIC IDENTIFICATION OF WORD TRANSLATIONS FROM UNRELATED ENGLISH AND GERMAN CORPORA
AUTOMATIC IDENTIFICATION OF WORD TRANSLATIONS FROM UNRELATED ENGLISH AND GERMAN CORPORA
Primarily assigned technology terms:
- ambiguity resolution
- chi-square test
- clustering
- co-occurrence counting
- computing
- decomposition
- human language
- human language acquisition
- identification
- jaccard coefficient
- language acquisition
- lemmatization
- lemmatizer
- matrix permutation
- pre-processing
- processing
- quantitative evaluation
- random selection
- similarity computation
- singular value decomposition
- spelling
- statistical methods
- syntactical analysis
- terminology
- translators
- vector comparison
- word clustering
Other assigned terms:
- ambiguity
- ambiguous word
- approach
- case
- co-occurrence
- co-occurrence matrix
- co-occurrences
- coefficient
- comparable corpora
- conditional probabilities
- conditional probability
- content words
- context feature
- corpora
- corpus frequency
- correlation
- cosine coefficient
- cosine measure
- dice
- dice coefficient
- dictionary
- disk
- english corpus
- english translation
- english translations
- english vocabulary
- feature
- function word
- function words
- german corpus
- german dictionary
- german text
- implementation
- inflectional language
- knowledge
- knowledge base
- language pair
- language resources
- lemma
- lexicon
- log-likelihood
- log-likelihood ratio
- mapping
- meaning
- measure
- measures
- method
- monolingual corpora
- morphological lexicon
- mutual information
- names
- natural languages
- non-parallel corpora
- parallel corpora
- parallel corpus
- parallel texts
- particles
- permutation
- probabilities
- probability
- procedure
- processing time
- pronouns
- random order
- seed
- seed words
- segments
- semantic
- similarity function
- similarity measures
- source language
- sparse data
- synonyms
- syntactic patterns
- target language
- terms
- test set
- text
- text segments
- tokens
- translation candidates
- translations
- verb
- vocabulary
- window size
- word
- word alignments
- word form
- word frequency
- word order
- word sequences
- word types
- words