ACL RD-TEC 1.0 Summarization of W04-3246
Paper Title:
LEARNING HEBREW ROOTS: MACHINE LEARNING WITH LINGUISTIC CONSTRAINTS
LEARNING HEBREW ROOTS: MACHINE LEARNING WITH LINGUISTIC CONSTRAINTS
Authors: Ezra Daya and Dan Roth and Shuly Wintner
Primarily assigned technology terms:
- algorithm
- arabic root extraction
- classification
- classifier
- classifiers
- computer science
- cross validation
- disambiguation
- distance function
- dynamic programming
- entity recognition
- error analysis
- feature engineering
- hmms
- identification
- inference algorithm
- information extraction
- information extraction tasks
- learning
- learning approach
- learning techniques
- machine learning
- machine learning approach
- machine learning techniques
- markov model
- matching
- morphological analysis
- morphological analyzers
- morphological disambiguation
- morphology
- multi-class classification
- named entity recognition
- nlp
- parameter tuning
- parsing
- perceptron
- pos tagging
- ranking
- recognition
- root extraction
- scoring
- scoring function
- shallow parsing
- tagging
- tuning
- validation
- vocalization
- winner-take-all mechanism
- word-formation
Other assigned terms:
- 10-fold cross validation
- adjective
- alphabet
- ambiguity
- approach
- case
- characters
- classification problem
- classification task
- confidence measure
- confidence scores
- contextual information
- data sparseness
- development set
- dictionaries
- dictionary
- distribution
- edit distance
- f-measure
- f-score
- fact
- feature
- feature set
- feature space
- feature types
- fmeasure
- human performance
- inflected form
- inflectional morphology
- information sources
- knowledge
- learning environment
- lexeme
- lexemes
- lexical items
- likelihood
- linguist
- linguistic
- linguistic constraints
- linguistic knowledge
- linguistics
- meaning
- measure
- measures
- method
- methodology
- morpheme
- morphemes
- named entity
- natural language
- orthography
- particle
- particles
- pos tagging problem
- precision
- predictive power
- preposition
- prepositions
- probabilities
- probability
- process
- ranking candidate
- semitic languages
- sequential model
- slot
- stem
- stems
- substring
- suffix
- suffixes
- tagging problem
- tags
- test corpus
- test data
- training
- training corpus
- training data
- training set
- verb
- word
- word type
- word types
- word-net
- words