ACL RD-TEC 1.0 Summarization of W03-1504
Paper Title:
LOW-COST NAMED ENTITY CLASSIFICATION FOR CATALAN: EXPLOITING MULTILINGUAL RESOURCES AND UNLABELED DATA
LOW-COST NAMED ENTITY CLASSIFICATION FOR CATALAN: EXPLOITING MULTILINGUAL RESOURCES AND UNLABELED DATA
Authors: LluÃs Mà rquez and Adrià de Gispert and Xavier Carreras and LluÃs Padró
Primarily assigned technology terms:
- adaboost
- adaboost classifier
- algorithm
- binary classification
- boosting
- boosting algorithm
- bootstrapping
- bootstrapping algorithm
- bootstrapping procedure
- bootstrapping process
- classification
- classifier
- classifiers
- decision trees
- encoding
- entity classification
- greedy agreement
- iterative bootstrapping
- learning
- learning algorithm
- learning algorithms
- learning approaches
- learning process
- linguistic pre-processing
- machine translation
- machine translation system
- mt system
- named entity classification
- ne classification
- ne recognition
- nlp
- normalization
- pre-processing
- recogniser
- recognition
- segmentation
- supervised learning
- supervised learning algorithm
- supervised training
- translation system
- unsupervised learning
- validation
- voting
- voting scheme
- weighted voting
- weighted voting scheme
- weighting
Other assigned terms:
- acronym
- affixes
- annotated corpus
- annotation
- approach
- binary feature
- binary features
- case
- catalan
- characters
- classification error
- classification model
- classification problem
- concept
- confidence measure
- context information
- corpora
- data set
- determiners
- development set
- dictionary
- distribution
- empirical results
- fact
- feature
- gazetteer
- gazetteer information
- hand-tagged corpus
- knowledge
- knowledge base
- language change
- lexical features
- lexical information
- lexical knowledge
- lexical knowledge base
- lexical resources
- linguistic
- linguistic feature
- local context
- manual annotation
- mappings
- meaning
- measure
- measures
- named entities
- named entity
- names
- nlp tasks
- person names
- phrase
- posterior
- precision
- prefixes and suffixes
- prepositions
- procedure
- process
- punctuation
- punctuation mark
- punctuation marks
- recognition errors
- recognition module
- right-hand side
- seed
- sentences
- suffix
- suffixes
- synsets
- technique
- terms
- test set
- text
- training
- training data
- training set
- translation dictionary
- translation pairs
- translations
- tree
- trees
- vocabulary
- word
- word form
- word type
- words