ACL RD-TEC 1.0 Summarization of P06-1061
Paper Title:
SEGMENT-BASED HIDDEN MARKOV MODELS FOR INFORMATION EXTRACTION
SEGMENT-BASED HIDDEN MARKOV MODELS FOR INFORMATION EXTRACTION
Authors: Zhenmei Gu and Nick Cercone
Primarily assigned technology terms:
- algorithm
- automaton
- computational linguistics
- conditional random fields
- cross validation
- estimator
- extraction procedure
- extractor
- final state
- finite state
- finite state automaton
- frequency estimator
- hidden markov
- hidden markov model
- hidden markov models
- hmm ie system
- hmm ie systems
- hmm system
- hmms
- ie method
- ie system
- information extraction
- language processing
- learning
- likelihood estimation
- markov model
- matching
- maximum entropy
- maximum likelihood
- maximum likelihood estimation
- modelling
- naive bayes
- name finding
- natural language processing
- nlp
- parameter setting
- post-processing
- probabilistic learning
- probability estimation
- processing
- recognition
- retrieval method
- retrieval system
- segment retrieval
- segment-based hmm ie
- segmentation
- sentence segmentation
- sequence matching
- sgt smoothing
- smoothing
- smoothing method
- speech recognition
- state automaton
- template filling
- template-filling
- text segmentation
- two-step extraction
- validation
Other assigned terms:
- alphabet
- approach
- association for computational linguistics
- case
- concept
- context models
- context size
- data set
- data sparseness
- data sparseness problem
- distribution
- document
- entropy
- estimation
- events
- exact match
- experimental results
- extraction evaluation
- extraction process
- fact
- frequency distribution
- ie evaluation
- ie problem
- ie task
- interpolation
- labeling
- likelihood
- linguistics
- markov models
- measure
- measures
- method
- model parameter
- model parameters
- natural language
- noise
- performance comparison
- performance evaluation
- precision
- probabilities
- probability
- probability distribution
- procedure
- process
- punctuation
- punctuation marks
- retrieval performance
- retrieval precision
- segment boundary
- segments
- sentence
- sentence boundaries
- sentences
- slot
- sparseness problem
- state labeling
- statistical models
- style
- symbol
- symbols
- tags
- technique
- term
- term distribution
- terms
- text
- text segment
- text segments
- tokens
- topology
- training
- training data
- training document
- training documents
- training examples
- transition probabilities
- transition probability
- vocabulary
- word
- words