ACL RD-TEC 1.0 Summarization of P99-1057
Paper Title:
LEARNING TO RECOGNIZE TABLES IN FREE TEXT
LEARNING TO RECOGNIZE TABLES IN FREE TEXT
Authors: Hwee Tou Ng and Chung Yong Lim and Jessica Li Teng Koo
Primarily assigned technology terms:
- algorithm
- boundary recognition
- cd-rom
- classification
- classifier
- classifiers
- column recognition
- computational linguistics
- connectionist learning
- decision tree
- decision tree induction
- deterministic table recognition
- encoding
- extraction system
- extraction systems
- feature extraction
- graphical user interface
- induction
- induction algorithm
- information extraction
- information extraction system
- information extraction systems
- learning
- learning algorithm
- learning algorithms
- learning approach
- learning method
- machine learning
- machine learning algorithms
- one learning
- processing
- recognition
- recognition algorithm
- row recognition
- table recognition
- table row recognition
- text processing
- tree induction
- tree induction algorithm
- user interface
Other assigned terms:
- annotator
- annotators
- approach
- blank space
- case
- character type
- characters
- classification problem
- community
- connectionist
- document
- domain-specific knowledge
- empirical results
- extraction process
- f measure
- feature
- feature description
- html document
- human annotator
- human annotators
- input text
- knowledge
- learning rate
- linguistics
- markup
- measure
- method
- patent
- pre-determined threshold
- precision
- process
- punctuation
- segments
- sentences
- statistical data
- symbols
- table boundary
- technique
- term
- text
- training
- training example
- training examples
- tree
- user
- word
- words