ACL RD-TEC 1.0 Summarization of X98-1019
Paper Title:
IMPROVING ENGLISH AND CHINESE AD-HOC RETRIEVAL: TIPSTER TEXT PHASE 3 FINAL REPORT
IMPROVING ENGLISH AND CHINESE AD-HOC RETRIEVAL: TIPSTER TEXT PHASE 3 FINAL REPORT
Primarily assigned technology terms:
- ad-hoc retrieval
- algorithm
- automatic retrieval
- bootstrapping
- bracketing
- character indexing
- character representation
- chinese retrieval
- clustering
- clustering algorithm
- data reduction
- document clustering
- document representation
- document retrieval
- document retrieval system
- encoding
- hardware
- indexing
- information retrieval
- iterative clustering
- learning
- learning procedure
- linguistic processing
- machine translation
- matching
- multilingual retrieval
- parsing
- pircs retrieval
- pircs retrieval system
- pos tagger
- pos tagging
- processing
- query expansion
- query term weighting
- ranking
- re-ranking
- recognition
- retrieval engine
- retrieval system
- retrieval systems
- retrieving
- searching
- segmentation
- segmentation method
- segmenter
- sentence parsing
- software program
- stopword removal
- tagger
- tagging
- term weighting
- text detection
- text retrieval
- text searching
- two-stage retrieval
- web searching
- weighting
- word segmentation
Other assigned terms:
- 2-stage pseudo-relevance feedback
- acronym
- ambiguity
- ambiguity problem
- approach
- bigram
- case
- characters
- chinese characters
- chinese sentence
- chinese words
- cluster
- clusters
- concept
- document
- document content
- document frequency
- evaluations
- function words
- index
- inverse document frequency
- language usage
- lexicon
- lexicon entries
- linguistic
- measure
- measures
- method
- mutual information
- natural language
- noun phrase
- noun phrases
- occurrence frequency
- opinion
- paragraph
- parameter settings
- phrase
- phrase level
- precision
- probability
- procedure
- process
- punctuation
- queries
- query
- query length
- query term
- representations
- retrieval model
- seed
- segments
- sentence
- sentences
- statistics
- stems
- stopword list
- synonyms
- technique
- term
- term frequency
- terms
- text
- topics
- trained model
- training
- unigram
- unigram model
- user
- word
- words