ACL RD-TEC 1.0 Summarization of P04-3031
Paper Title:
NLTK: THE NATURAL LANGUAGE TOOLKIT
NLTK: THE NATURAL LANGUAGE TOOLKIT
Authors: Steven Bird and Edward Loper
Primarily assigned technology terms:
- algorithm
- analyzer
- brill tagger
- chunking
- classifiers
- decision tree
- disambiguation
- discourse representation
- encoding
- hidden markov
- hidden markov model
- language processing
- markov model
- morphological analyzer
- natural language processing
- nlp
- nltk
- parser
- parsing
- processing
- reading
- smoothing
- smoothing techniques
- statistical natural language processing
- statistical smoothing
- stemmer
- tagger
- tokenizer
- word-sense disambiguation
Other assigned terms:
- approach
- brown corpus
- chunk
- context free grammars
- corpora
- data sets
- data structure
- design and implementation
- discourse
- discourse representation theory
- distribution
- document
- grammar
- grammars
- implementation
- interoperability
- interpreter
- language processing task
- language processing tasks
- mapping
- method
- names
- natural language
- natural language texts
- open source license
- parse
- penn treebank
- penn treebank corpus
- probability
- probability distributions
- processing module
- processing tasks
- representation theory
- sentence
- sentences
- statistical natural language
- structure of a sentence
- syntactic structure
- tagged corpus
- teaching
- text
- theory
- tokens
- toolkit
- training
- tree
- treebank
- treebank corpus
- word