ACL RD-TEC 1.0 Summarization of C92-1063
Paper Title:
THE TYPOLOGY OF UNKNOWN WORDS: AN EXPERIMENTAL STUDY OF TWO CORPORA
THE TYPOLOGY OF UNKNOWN WORDS: AN EXPERIMENTAL STUDY OF TWO CORPORA
Authors: Xiaobo Ren and Francois Perrault
Primarily assigned technology terms:
- artificial intelligence
- classification
- computer-assisted translation
- data collection
- databases
- dictionary search
- electronic dictionary
- error correction
- error detection
- error detection and correction
- generation method
- hypothesis generation
- language processing
- machine translation
- machine translation system
- morphology
- natural language processing
- nlp
- nlp system
- nlp systems
- processing
- proof reading
- reading
- search
- spelling
- syntactic analysis
- tile
- tokenization
- translation system
- translators
- typographical error correction
Other assigned terms:
- abbreviations
- accent
- affixation
- affixes
- alphabet
- blank space
- case
- characters
- composition
- compositionality
- compounds
- computer program
- corpora
- derivation
- dictionaries
- dictionary
- distribution
- document
- english corpus
- fact
- foreign words
- french
- french corpus
- frequency distribution
- generation
- grammar
- hansard corpus
- hypotheses
- hypothesis
- inflectional morphology
- intelligence
- knowledge
- linguistic
- linguistic knowledge
- measure
- method
- natural language
- noise
- nouns
- part of speech
- probability
- process
- proper noun
- punctuation
- search space
- semantic
- semantic compositionality
- sentence
- source text
- spoken language
- style
- suffix
- technique
- technology
- text
- tokens
- transcript
- transcripts
- translations
- transposition
- typographical errors
- verb
- vocabulary
- word
- word types
- words