ACL RD-TEC 1.0 Summarization of W06-2403
Paper Title:
AUTOMATIC EXTRACTION OF CHINESE MULTIWORD EXPRESSIONS WITH A STATISTICAL TOOL
AUTOMATIC EXTRACTION OF CHINESE MULTIWORD EXPRESSIONS WITH A STATISTICAL TOOL
Authors: Scott S.L. Piao and Guangfan Sun and Paul Rayson and Qi Yuan
Primarily assigned technology terms:
- algorithm
- automatic extraction
- automatic mt
- chinese-english mt
- classification
- collocation extraction
- corpus-based approach
- database
- dictionary compilation
- extraction tool
- identification
- illustration
- lexicon compilation
- machine translation
- mt system
- mt systems
- multilingual processing
- mwe extraction
- nlp
- parser
- pattern-matching
- pos tagger
- pos-tagging
- processing
- rule-based system
- search
- search algorithm
- searching
- statistical approaches
- tagger
- terminology
- tokenisation
Other assigned terms:
- adjective
- adverb
- annotation
- approach
- bilingual dictionary
- case
- characters
- chinese corpus
- chinese language
- chinese word
- co-occurrence
- collocation
- collocational information
- community
- compositionality
- comprehensibility
- concept
- concepts
- conversation
- corpora
- dictionaries
- dictionary
- distribution
- dutch
- fact
- frequency list
- function words
- genre
- human communication
- human speech
- hypothesis
- idiomatic expression
- interpretation
- knowledge
- knowledge database
- large corpus
- lexicon
- likelihood
- linguistic
- linguistic knowledge
- literal translation
- log-likelihood
- mapping
- meaning
- measure
- measures
- monolingual dictionary
- multiword expressions
- mutual information
- mwes
- nlp tasks
- noise
- noun phrases
- parse
- part-of-speech
- phrase
- phrase type
- pos pattern
- precision
- prepositional phrases
- process
- punctuation
- punctuation marks
- semantic
- semantic lexicon
- sentences
- slang
- source language
- statistical measure
- stop word list
- syntactic patterns
- syntactic structures
- tagset
- target language
- target languages
- technical terminology
- technical terms
- terms
- test corpus
- test data
- text
- topics
- translation equivalent
- translation equivalents
- translations
- understanding
- verb
- verb phrase
- word
- word collocation
- word frequency
- word sequences
- words