ACL RD-TEC 1.0 Summarization of W03-1204
Paper Title:
EVALUATION OF FEATURES FOR SENTENCE EXTRACTION ON DIFFERENT TYPES OF CORPORA
EVALUATION OF FEATURES FOR SENTENCE EXTRACTION ON DIFFERENT TYPES OF CORPORA
Authors: Chikashi Nobata and Satoshi Sekine and Hitoshi Isahara
Primarily assigned technology terms:
- automatic summarization
- categorization
- computer science
- decision tree
- document understanding
- extraction system
- extraction systems
- information extraction
- information retrieval
- learning
- learning methods
- machine learning
- machine learning methods
- multidocument summarization
- pattern discovery
- ranking
- scoring
- scoring function
- scoring method
- segmentation
- sentence extraction
- sentence extraction system
- sentence segmentation
- single-document summarization
- subjective evaluation
- summarization
- summarization system
- summarization systems
- text summarization
- transcription
- word segmentation
Other assigned terms:
- annotator
- annotators
- approach
- compression ratio
- corpora
- correlation
- correlations
- data sets
- distribution
- document
- document frequency
- document set
- f-measure
- feature
- grammaticality
- hypothesis
- interpolation
- measure
- method
- named entities
- nist
- nouns
- parameter values
- precision
- rank correlation
- sentence
- sentence boundaries
- sentence boundary
- sentence position
- sentences
- similarity measure
- speech corpus
- standard deviation
- syntactic information
- term
- term frequency
- test data
- text
- tf \* idf
- training
- training data
- transcriptions
- tree
- understanding
- word
- words
- written corpora