ACL RD-TEC 1.0 Summarization of P97-1041

Paper Title:
A TRAINABLE RULE-BASED ALGORITHM FOR WORD SEGMENTATION

Other assigned terms:

  • approach
  • bigram
  • case
  • character sequence
  • characters
  • chinese word
  • chinese words
  • corpora
  • debugging
  • domain knowledge
  • english corpus
  • english language
  • english sentence
  • error rate
  • f-measure
  • fact
  • gold standard
  • idiomatic expressions
  • knowledge
  • language resources
  • large corpus
  • latex
  • lexica
  • lexical resources
  • lexicon
  • measures
  • method
  • names
  • nlp tasks
  • part-of-speech
  • person names
  • phrase
  • phrase attachment
  • precision
  • prefixes and suffixes
  • prepositional phrase
  • prepositional phrase attachment
  • procedure
  • process
  • proper names
  • punctuation
  • rule sequence
  • segmentation accuracy
  • segments
  • sentence
  • sentences
  • suffixes
  • syntax
  • technique
  • test data
  • test set
  • text
  • thai language
  • thai word
  • training
  • training data
  • training set
  • transformation
  • word
  • word boundaries
  • word lists
  • word model
  • words
  • writing system

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***