ACL RD-TEC 1.0 Summarization of P98-2123
Paper Title:
A FREELY AVAILABLE MORPHOLOGICAL ANALYZER, DISAMBIGUATOR AND CONTEXT SENSITIVE LEMMATIZER FOR GERMAN
A FREELY AVAILABLE MORPHOLOGICAL ANALYZER, DISAMBIGUATOR AND CONTEXT SENSITIVE LEMMATIZER FOR GERMAN
Authors: Wolfgang Lezius and Reinhard Rapp and Manfred Wettler
Primarily assigned technology terms:
- a statistical part-of-speech
- algorithm
- analyzer
- automatic indexing
- german morphology
- indexing
- learning
- lemmatization
- lemmatizer
- linking
- lookup algorithm
- machine translation
- morphological analysis
- morphological analyzer
- morphological analyzers
- morphology
- nlp
- parsing
- part-of-speech tagger
- part-of-speech tagging
- pc-kimmo
- processing
- reading
- rule-based tagging
- smoothing
- statistical part-of-speech tagger
- tagger
- taggers
- tagging
- text representation
- unsupervised learning
- unsupervised training
- world wide web
Other assigned terms:
- ambiguity
- ambiguity rate
- ambiguous word
- approach
- bigram
- brown corpus
- case
- community
- compounding
- concept
- conditional probabilities
- corpora
- derivation
- dictionary
- disk
- duden grammar
- english language
- error rate
- events
- feature
- generation
- generation system
- grammar
- grammatical categories
- grammatical category
- grammatical features
- implementation
- infixation
- inflected form
- inflected forms
- inflection
- inflectional language
- intention
- knowledge
- large corpora
- lemma
- lemmata
- lexicon
- linguistic
- linguists
- mood
- morph
- morphological information
- morphological lexicon
- nlp applications
- nouns
- part of speech
- part-of-speech
- part-of-speech tags
- parts of speech
- prefixes and suffixes
- probabilities
- probability
- procedure
- process
- pronoun
- runtime
- segments
- semantic
- sentence
- sentences
- statistical data
- stems
- suffix
- suffixes
- syntactic information
- syntax
- tag sequence
- tag set
- tagged corpora
- tagging accuracy
- tags
- terms
- test corpus
- text
- training
- training corpus
- training text
- trigram
- user
- verb
- vowel
- web site
- word
- word form
- words