ACL RD-TEC 1.0 Summarization of A97-1034

Paper Title:
USING SGML AS A BASIS FOR DATA-INTENSIVE NLP

Authors: David McKelvie and Chris Brew and Henry Thompson

Other assigned terms:

  • annotated corpora
  • annotated corpus
  • annotation
  • annotation scheme
  • approach
  • british national corpus
  • case
  • concept
  • corpora
  • data structures
  • determiners
  • disk
  • distribution
  • document
  • document structure
  • document type definition
  • fact
  • french
  • generalisation
  • generation
  • hierarchical structure
  • implementation
  • index
  • indexing scheme
  • innovation corpus
  • intention
  • interoperability
  • large corpora
  • large scale corpora
  • large text corpora
  • lexicography
  • lexicon
  • linguistic
  • linguistic annotation
  • linguistic structures
  • linguists
  • mapping
  • maptask corpus
  • markup
  • mechanisms
  • method
  • modular architecture
  • names
  • natural language
  • nlp applications
  • paragraphs
  • part of speech
  • part-of-speech
  • part-of-speech tags
  • penn treebank
  • penn treebank tagset
  • phrase
  • process
  • public domain software
  • queries
  • query
  • regular expressions
  • search results
  • sentence
  • sentence boundaries
  • sentence boundary
  • sentences
  • sgml document
  • sgml stream
  • size of the corpus
  • statistics
  • structural information
  • structured text
  • style
  • syntactic structures
  • syntax
  • system architecture
  • tags
  • tagset
  • technology
  • terms
  • text
  • text corpora
  • tipster architecture
  • transformation
  • tree
  • tree structures
  • treebank
  • user
  • word
  • word alignments
  • word level
  • words

Extracted Section Types:


This page last edited on 10 May 2017.

*** ***