ACL RD-TEC 1.0 Summarization of M98-1016
Paper Title:
DESCRIPTION OF THE KENT RIDGE DIGITAL LABS SYSTEM USED FOR MUC-7
DESCRIPTION OF THE KENT RIDGE DIGITAL LABS SYSTEM USED FOR MUC-7
Authors: Shihong Yu and Shuanhu Bai and Paul Wu
Primarily assigned technology terms:
- algorithm
- analyzer
- chinese word segmentor
- database
- disambiguation
- entity recognition
- grouping
- hidden markov
- hidden markov modeling
- information extraction
- learning
- learning process
- likelihood estimation
- machine processing
- markov modeling
- matching
- maximum likelihood
- maximum likelihood estimation
- modeling
- named entity recognition
- new word detection
- nlp
- optimization
- part of speech tagging
- part-of-speech tagger
- pattern matching
- pattern-matching
- processing
- recognition
- segmentation
- segmentor
- sentence segmentor and tokenizer
- smoothing
- speech tagger
- speech tagging
- splitting
- tagger
- tagging
- text information extraction
- tokenization
- tokenizer
- viterbi
- viterbi algorithm
- word detection
- word segmentation
- word segmentor
Other assigned terms:
- ambiguous words
- brown corpus
- case
- case information
- characters
- chinese characters
- chinese language
- chinese word
- concept
- context information
- contextual information
- corpora
- dictionary
- domain knowledge
- english text
- english writing
- estimation
- fact
- feature
- generation
- hypothesis
- hypothesis generator
- implementation
- knowledge
- language model
- language models
- lexicon
- likelihood
- local context
- location name
- meaning
- named entities
- named entity
- names
- ne task
- organization names
- orthographic information
- part of speech
- part-of-speech
- prepositions
- probability
- process
- proper names
- research topic
- semantic
- semantic classes
- sentence
- sentences
- sparse data
- sparse data problem
- statistics
- tag sequence
- tag set
- tags
- technology
- terms
- text
- text corpus
- text information
- tokens
- training
- training corpus
- understanding
- word
- word boundary
- word classes
- word sequence
- word sequences
- words
- writing system