ACL RD-TEC 1.0 Summarization of W03-0316
Paper Title:
POS-TAGGER FOR ENGLISH-VIETNAMESE BILINGUAL CORPUS
POS-TAGGER FOR ENGLISH-VIETNAMESE BILINGUAL CORPUS
Authors: Dinh Dien and Hoang Kiem
Primarily assigned technology terms:
- algorithm
- automatic training
- bootstrap
- bootstrapping
- chunker
- chunking
- classification
- corpus-based natural language processing
- direct projection
- disambiguation
- giza
- language processing
- learning
- learning method
- learning methods
- lexicalization
- machine learning
- natural language processing
- nlp
- parser
- partial disambiguation
- phrase-chunker
- pos tagger
- pos-tagging
- processing
- segmenter
- sense disambiguation
- sentence alignment
- spelling
- spelling checker
- tagger
- tagging
- transformation-based learning
- word alignment
- word segmenter
- word sense disambiguation
- word-alignment
Other assigned terms:
- adjective
- aligned corpus
- annotated corpus
- annotated training corpora
- annotated training corpus
- annotation
- approach
- bilingual corpora
- bilingual corpus
- case
- character code
- checker
- class-based model
- convergence
- conversation
- corpora
- dictionaries
- dictionary
- english corpus
- english pos-tags
- english-vietnamese parallel corpus
- fact
- foreign language
- french
- function words
- grammatical relation
- inheritance
- language typology
- lemma
- linguistic
- linguistic annotation
- linguistic information
- linguistics
- linguists
- mapping
- mapping table
- meanings
- method
- natural language
- nlp tasks
- parallel corpus
- part-of-speech
- parts of speech
- parts-of-speech
- penn tree bank
- penn treebank
- penn treebank tagset
- phrase
- pos tag
- pos-annotation
- precision
- probability
- process
- projection
- ptb
- relation
- rule set
- semantic
- sentence
- sentence boundary
- suffixes
- susanne corpus
- syntactic trees
- tags
- tagset
- technology
- text
- tone
- training
- training corpora
- training corpus
- training data
- training set
- transformation
- transformation rule
- transformation rules
- tree
- tree bank
- treebank
- trees
- unannotated corpora
- unannotated corpus
- unannotated text
- verb
- vietnamese word
- word
- word alignments
- word boundary
- word order
- word sense
- words