ACL RD-TEC 1.0 Summarization of W03-0308
Paper Title:
TREQ-AL: A WORD ALIGNMENT SYSTEM WITH LIMITED LANGUAGE RESOURCES
TREQ-AL: A WORD ALIGNMENT SYSTEM WITH LIMITED LANGUAGE RESOURCES
Authors: Dan Tufic{s} and Ana-Maria Barbu and Radu Ion
Primarily assigned technology terms:
- algorithm
- alignment system
- bootstrapping
- clustering
- data processing
- dictionary extraction
- disambiguation
- encoding
- extraction program
- extraction system
- extractor
- grouping
- linking
- machine translation
- parallel training
- post-processing
- preprocessing
- processing
- processing tools
- recognition
- relative distance
- segmentation
- sense disambiguation
- sentence-alignment
- splitting
- tagger
- tagging
- tokenization
- word alignment
- word clustering
- word sense disambiguation
- word-alignment
Other assigned terms:
- adjective
- alignment accuracy
- alignment error rate
- alignment procedure
- alignment task
- approach
- bilingual dictionary
- bitext
- case
- characters
- collocation
- compounds
- content words
- corpora
- data model
- dictionaries
- dictionary
- error rate
- evaluations
- f-measure
- feature
- functional word
- gold standard
- heuristics
- idiomatic expressions
- knowledge
- language model
- language models
- language resources
- lemma
- linguistic
- linguistic knowledge
- log-likelihood
- main verb
- mapping
- mappings
- measures
- names
- negation
- ontology
- parallel corpora
- parallel corpus
- parallel text
- parallel texts
- parallel training corpus
- polysemy
- precision
- preposition
- priori
- procedure
- proper names
- punctuation
- punctuation mark
- punctuation marks
- sentence
- source language
- symbol
- tagging accuracy
- tags
- tagset
- target languages
- target word
- terms
- test data
- text
- tokens
- training
- training and test data
- training corpus
- training data
- training time
- translation dictionary
- translation equivalence
- translation equivalent
- translation equivalents
- translation pair
- translations
- user
- verb
- word
- word alignment task
- word alignments
- word order
- word sense
- wordnet
- words