ACL RD-TEC 1.0 Summarization of W06-1647
Paper Title:
LEXICON ACQUISITION FOR DIALECTAL ARABIC USING TRANSDUCTIVE LEARNING
LEXICON ACQUISITION FOR DIALECTAL ARABIC USING TRANSDUCTIVE LEARNING
Authors: Kevin Duh and Katrin Kirchhoff
Primarily assigned technology terms:
- algorithm
- analysis tool
- analyzer
- automatic analysis tool
- automatic error correction
- automatic learning
- binary classi cation
- classi cation
- classi er
- classication
- clustering
- clustering algorithm
- clustering method
- computational linguistics
- em algorithm
- em training
- error correcting
- error correction
- expectation-maximization
- graph transducer
- hidden markov
- hidden markov model
- hmm tagger
- hypothesizing
- inductive learning
- k-nearest-neighbor
- language processing
- learner
- learning
- learning algorithm
- learning algorithms
- learning framework
- learning methods
- learning process
- learning task
- lexicon acquisition
- lexicon learning
- machine learner
- machine learning
- machine learning algorithms
- machine learning methods
- markov model
- matching
- modeling
- modeling speech
- morphological analyzer
- morphological analyzers
- morphology
- multi-class classi cation
- natural language processing
- nlp
- nlp systems
- nlp technology
- optimization
- optimization algorithm
- pos tagger
- pos tagging
- processing
- semi-supervised learning
- smoothing
- spectral graph transducer
- supervised learning
- supervised training
- support vector machines
- tagger
- taggers
- tagging
- tagging system
- transcription
- transducer
- transducers
- transduction
- transductive clustering
- transductive learning
- transliteration
- tuning
- unsupervised learning
- unsupervised tagging
- unsupervised training
Other assigned terms:
- annotated corpora
- annotation
- annotation effort
- annotator
- approach
- arabic language
- association for computational linguistics
- case
- characters
- cluster
- clusters
- co-occurrence
- co-occurrence statistics
- contextual features
- conversational telephone speech
- corpora
- cosine distance
- data set
- data sets
- development set
- distance metric
- distribution
- estimation
- fact
- feature
- feature vectors
- frame
- hypotheses
- hypothesis
- joint distribution
- knowledge
- labeling
- lattice
- learning problem
- lexical choice
- lexicon
- likelihood
- linguistics
- manual annotation
- measure
- method
- modern standard arabic
- morphological information
- natural language
- ngram
- one-vs-rest scheme
- opinions
- oracle
- part-of-speech
- pos information
- pos tag
- precision
- probabilities
- probability
- procedure
- process
- sentences
- standard arabic
- statistics
- stem
- stems
- support vector
- svms
- syntax
- tag sequence
- tagging accuracy
- tags
- tagset
- technology
- term
- terms
- test data
- test set
- text
- token frequency
- tokens
- training
- training data
- training phase
- training samples
- training set
- training text
- transcriptions
- transcripts
- transition probabilities
- transition probability
- translation lexicon
- treebank
- trigram
- vocabulary
- word
- word sequence
- word sequences
- words