ACL RD-TEC 1.0 Summarization of P99-1046
Paper Title:
STATISTICAL MODELS FOR TOPIC SEGMENTATION
STATISTICAL MODELS FOR TOPIC SEGMENTATION
Primarily assigned technology terms:
- algorithm
- automated essay grading
- capitalization
- computing
- content analysis
- coreference resolution
- database
- detection and tracking
- disambiguation
- discourse segmentation
- document retrieval
- entity recognition
- entity recognition system
- entity recognizer
- entropy modelling
- essay grading
- good-turing smoothing
- indexing
- information retrieval
- ir system
- language modeling
- lemmatizer
- linking
- manufacturing
- matching
- maximum entropy
- maximum entropy model
- message understanding
- modeling
- modelling
- named entity recognition
- named entity recognizer
- nlp
- normalization
- optimisation
- optimisation algorithm
- optimization
- pattern matching
- preprocessing
- processing
- query expansion
- random guess
- recognition
- recognition system
- recognizer
- sampling
- search
- search engine
- searching
- segmentation
- sense disambiguation
- smoothing
- speech recognition
- spoken document retrieval
- structuring
- summarization
- texttiling
- tokenization
- topic detection
- topic detection and tracking
- topic segmentation
- vector space model
- visualization
- word bigram
- word sense disambiguation
Other assigned terms:
- anchors
- annotation
- approach
- bag of words
- baseline performance
- bigram
- broadcast news
- broadcast news corpus
- broadcast news data
- clusters
- cohesion
- concept
- concepts
- conditional probabilities
- content words
- corpora
- correlation
- cue phrases
- cue words
- definite noun
- definite noun phrases
- dictionary
- discourse
- document
- document length
- domain-specific cue
- electronic form
- entropy
- essay
- evaluations
- expansion technique
- fact
- gold standard
- implementation
- index
- keyword
- knowledge
- labeling
- language models
- large corpus
- lexical cohesion
- likelihood
- machine-readable dictionary
- maps
- measure
- measures
- message
- message understanding conference
- method
- named entities
- named entity
- news corpus
- noun phrases
- paragraphs
- parameter values
- phrase
- precision
- probabilities
- probability
- process
- pronoun
- pronouns
- punctuation
- query
- recognition errors
- regular expressions
- retrieval task
- segments
- semantic
- semantic network
- sentences
- statistical model
- statistical models
- synonyms
- task performance
- technique
- television
- term
- terms
- test data
- text
- thesaurus
- topic shift
- topics
- training
- training corpus
- training data
- transcripts
- understanding
- user
- vector space
- vocabulary
- wall street journal text
- word
- word frequency
- word frequency model
- word repetition
- word sense
- word sequence
- word sequences
- word types
- wordnet
- words