ACL RD-TEC 1.0 Summarization of P05-1046
Paper Title:
UNSUPERVISED LEARNING OF FIELD SEGMENTATION MODELS FOR INFORMATION EXTRACTION
UNSUPERVISED LEARNING OF FIELD SEGMENTATION MODELS FOR INFORMATION EXTRACTION
Authors: Trond Grenager and Dan Klein and Christopher Manning
Primarily assigned technology terms:
- abstracting
- acoustic modeling
- algorithm
- computer science
- computing
- conditional random field
- database
- document summarization
- em algorithm
- expectation-maximization
- extraction systems
- final state
- forward-backward algorithm
- grammar learning
- greedy mapping
- hidden markov
- hidden markov models
- hill-climbing
- hmms
- induction
- information extraction
- information extraction systems
- information ordering
- internet
- language processing
- language processing technology
- learning
- learning method
- learning methods
- likelihood estimation
- loss function
- maximum entropy
- maximum likelihood
- maximum likelihood estimation
- modeling
- morphology
- natural language processing
- normalization
- parameter tuning
- parsers
- parsing
- part-of-speech tagging
- processing
- processing technology
- reestimation
- sampling
- search
- segmentation
- semi-supervised learning
- smoothing
- smoothing techniques
- summarization
- supervised learning
- supervised system
- supervised training
- taggers
- tagging
- tokenization
- tuning
- unsupervised learning
- unsupervised segmentation
- unsupervised training
- viterbi
- viterbi algorithm
Other assigned terms:
- abbreviation
- annotated training set
- annotation
- approach
- break
- case
- citation
- cluster
- computer science research
- content words
- convergence
- correlations
- data set
- development set
- discourse
- distribution
- document
- emission model
- english text
- entropy
- estimation
- evaluation method
- evaluations
- events
- experimental results
- fact
- function words
- generative model
- grammar
- heuristic
- human reader
- identity uncertainty
- knowledge
- labeling
- likelihood
- linguistic
- linguistic phenomena
- linguistic structure
- mapping
- markov models
- method
- model parameters
- model structure
- named entity
- natural language
- noise
- part-of-speech
- part-of-speech tagging task
- parts-of-speech
- penn treebank
- probabilistic model
- probability
- probability distributions
- procedure
- process
- punctuation
- search procedure
- search space
- segments
- simultaneity
- statistics
- structured text
- style
- syntax
- system development
- tagging task
- technique
- technology
- term
- test data
- test set
- text
- tokens
- topics
- training
- training data
- training documents
- training examples
- training set
- transition information
- transition matrix
- treebank
- unlabeled examples
- user
- word
- word distribution
- words