ACL RD-TEC 1.0 Summarization of W00-0506
Paper Title:
PRE-PROCESSING CLOSED CAPTIONS FOR MACHINE TRANSLATION
PRE-PROCESSING CLOSED CAPTIONS FOR MACHINE TRANSLATION
Authors: Davide Turcato and Fred Popowich and Paul McFetridge and Devlan Nicholson and Janine Toole
Primarily assigned technology terms:
- backtracking
- capitalization
- final state
- linguistic analysis
- linguistic techniques
- machine translation
- matching
- mt system
- mt systems
- name recognition
- name recognizer
- normalization
- parsing
- pre-processing
- processing
- proper name recognition
- recognition
- recognizer
- regular expression
- segmentation
- segmenter
- speech translation
- speech-to-speech translation
- tagger
- tagging
- tokenization
- tokenizer
- transcription
- transducer
- transducers
- translation process
- translation systems
Other assigned terms:
- abbreviations
- adverb
- ambiguity
- approach
- cache
- capitalization information
- case
- characters
- compound words
- compounding
- comprehension
- determiner
- device
- disjunction
- ellipsis
- evaluation measure
- evaluations
- fact
- feature
- formalism
- grammars
- grammaticality
- idiomatic expressions
- implementation
- interpretation
- knowledge
- language type
- lexical coverage
- lexicon
- linguistic
- linguistic information
- linguistic knowledge
- linguistic resources
- main clause
- meaning
- measure
- names
- negation
- noise
- noun phrase
- nouns
- part of speech
- part of speech tags
- parts of speech
- phrase
- political science
- precision
- predicates
- prepositional phrase
- process
- processing time
- proper name
- proper names
- punctuation
- questionnaire
- relation
- segments
- sentence
- sentence boundary
- sentences
- source text
- speech information
- speech tag
- spoken language
- stem
- symbol
- syntactic ambiguity
- syntactic categories
- syntactic relation
- system architecture
- tags
- technology
- terms
- text
- tokens
- transcribed speech
- translation direction
- translation output
- translation quality
- translations
- user
- utterance
- verb
- visual context
- word
- words