ACL RD-TEC 1.0 Summarization of P06-2100
Paper Title:
MORPHOLOGICAL RICHNESS OFFSETS RESOURCE DEMAND – EXPERIENCES IN CONSTRUCTING A POS TAGGER FOR HINDI
MORPHOLOGICAL RICHNESS OFFSETS RESOURCE DEMAND – EXPERIENCES IN CONSTRUCTING A POS TAGGER FOR HINDI
Authors: Smriti Singh and Kuhoo Gupta and Manish Shrivastava and Pushpak Bhattacharyya
Primarily assigned technology terms:
- algorithm
- analyzer
- chunker
- computational linguistics
- conditional random fields
- cross validation
- decision tree
- decision trees
- disambiguation
- error-driven learning
- language processing
- learning
- learning algorithm
- learning methods
- learning technique
- learning techniques
- lexicon lookup
- linguistic analysis
- machine learning
- machine learning methods
- machine learning techniques
- markov model
- maximum entropy
- morphological analysis
- morphology
- morphology-based disambiguation
- named-entity detection
- natural language processing
- neural networks
- parser
- parsing
- pos tagger
- pos tagging
- processing
- ranking
- rule learning
- spelling
- statistical approaches
- statistical learning
- stemmer
- stochastic tagger
- tagger
- taggers
- tagging
- transformation-based error-driven learning
- validation
Other assigned terms:
- adjective
- adverb
- ambiguity
- ambiguous word
- ambiguous words
- annotated corpora
- approach
- association for computational linguistics
- auxiliary verbs
- bias
- case
- case information
- category label
- context information
- context window
- contextual information
- copula verb
- corpora
- derivational morphology
- disambiguation task
- distribution
- english sentence
- entropy
- fact
- feature
- feature information
- foreign words
- free-word order
- hindi
- implementation
- japanese language
- knowledge
- lexical categories
- lexicon
- linguistic
- linguistics
- main verb
- method
- methodology
- modality
- morpheme
- morphemes
- morphological information
- morphological structure
- named-entity
- natural language
- negation
- nouns
- paninian
- parameter values
- part-of-speech
- particles
- parts of speech
- pos category
- pos tag
- process
- pronoun
- rule type
- semantic
- semantic information
- sentence
- sentences
- statistical approach
- stem
- syntactic rules
- tagging task
- tags
- technique
- terms
- test corpora
- testing set
- tokens
- training
- training corpora
- training corpus
- training instance
- training set
- tree
- trees
- verb
- verb form
- verb group
- verb groups
- vowel
- window size
- word
- word level
- wordnet
- words