ACL RD-TEC 1.0 Summarization of A92-1018
Paper Title:
A PRACTICAL PART-OF-SPEECH TAGGER
A PRACTICAL PART-OF-SPEECH TAGGER
Authors: Doug Cutting and Julian Kupiec and Jan Pedersen and Penelope Sibun
Primarily assigned technology terms:
- algorithm
- automaton
- baum-welch algorithm
- bootstrap
- case marking
- computing
- database
- deterministic finite state automaton
- disambiguation
- disambiguation algorithm
- dynamic programming
- finite state
- finite state automaton
- forward-backward algorithm
- grammatical function assignment
- hidden markov
- hidden markov model
- hidden markov modeling
- homograph disambiguation
- identification
- indexing
- knowledge bases
- language processing
- language processing system
- lexical disambiguation
- markov model
- markov modeling
- maximum likelihood
- modeling
- noun homograph disambiguation
- parameter estimation
- parameter smoothing
- part-of-speech tagger
- part-of-speech tagging
- phrase recognition
- processing
- recognition
- regular expression
- rule-based approach
- search
- sense disambiguation
- shallow analysis
- smoothing
- state automaton
- statistical methods
- stochastic process
- supervised training
- tagger
- taggers
- tagging
- text access \
- text tagging
- tokenization
- tokenizer
- tuning
- viterbi
- viterbi algorithm
- viterbi algorithm \
- word sense disambiguation
Other assigned terms:
- adjective
- adverb
- ambiguity
- approach
- array
- break
- brown corpus
- case
- characters
- convergence
- corpora
- determiner
- dictionaries
- distribution
- dynamic programming procedure
- encyclopedia
- english lexicon
- english text
- estimation
- fact
- first-order model
- formalism
- french
- grammar
- grammatical function
- grammatical functions
- implementation
- index
- interpolation
- interpretation
- joint probability
- knowledge
- large corpora
- large text corpora
- lexical item
- lexicon
- likelihood
- linear complexity
- linguistic
- linguistic phenomena
- linguistic structure
- lisp
- local context
- lookahead
- mapping
- maps
- meaning
- meanings
- measure
- measures
- mechanisms
- method
- methodology
- model parameters
- modular architecture
- noun phrase
- noun phrases
- nouns
- numerical stability
- paragraph
- part of speech
- part-of-speech
- part-of-speech information
- part-of-speech tag
- parts of speech
- phrase
- phrase attachment
- prepositional phrase
- prepositional phrase attachment
- prepositional phrases
- priori
- probabilities
- probability
- probability distribution
- procedure
- process
- processing module
- pronoun
- pronouns
- punctuation
- queries
- recursion
- regular expressions
- sense distinctions
- sentence
- sentence boundaries
- sentence boundary
- sentences
- statistics
- stem
- stems
- suffix
- suffixes
- symbol
- syntactic evidence
- system architecture
- tag sequence
- tagged corpora
- tagged text
- tags
- tagset
- target word
- term
- terms
- text
- text corpora
- text corpus
- text database
- time complexity
- tokens
- training
- training corpora
- training corpus
- training data
- training set
- training text
- transition probabilities
- trees
- uniform probability
- untagged text
- verb
- verb group
- verb groups
- vocabulary
- word
- word order
- word sense
- word stem
- words