ACL RD-TEC 1.0 Summarization of W05-0710
Paper Title:
CLASSIFYING AMHARIC NEWS TEXT USING SELF-ORGANIZING MAPS
CLASSIFYING AMHARIC NEWS TEXT USING SELF-ORGANIZING MAPS
Authors: Samuel Eyassu and Björn Gambäck
Primarily assigned technology terms:
- artificial neural networks
- categorization
- character recognition
- classification
- classifier
- clustering
- database
- decomposition
- digital library
- document classification
- document clustering
- document indexing
- indexing
- information access
- information exchange
- information retrieval
- language processing
- language processing systems
- latent semantic indexing
- learning
- learning process
- lexicon building
- log-entropy weighting
- matching
- morphological analyser
- neural net
- neural network
- neural networks
- pattern matching
- perceptron
- preprocessing
- processing
- processing tools
- ranking
- recognition
- reporting
- retrieval system
- retrieving
- self-organizing map
- semantic indexing
- singular value decomposition
- spelling
- statistical methods
- statistical techniques
- supervised learning
- terminology
- text categorization
- text classification
- text retrieval
- unsupervised learning
- vector space model
- weighting
Other assigned terms:
- alphabet
- analyser
- approach
- array
- case
- characters
- classification accuracy
- cluster
- clusters
- co-occurrence
- compound words
- compounds
- concept
- corpora
- cosine similarity
- cosine similarity measure
- countrywide communication
- culture
- data sets
- dictionary
- document
- document collection
- document content
- document sets
- electronic publication
- grid
- histogram
- interpretation
- large corpora
- latent semantic
- latin alphabet
- lattice
- lattice structure
- learning strategy
- lexical variation
- lexicon
- linguist
- mapping
- mappings
- maps
- matlab
- meaning
- measure
- measures
- medline
- method
- names
- phoneme
- precision
- process
- pronunciation
- punctuation
- queries
- query
- query vector
- representations
- research and development
- retrieval performance
- semantic
- semitic languages
- signal
- similarity measure
- similarity measures
- style
- symbol
- symbols
- technique
- technology
- term
- term-document matrix
- terms
- test corpus
- test data
- test set
- text
- training
- training corpus
- training set
- treebank
- user
- vector space
- vowel
- weight vector
- weighting formula
- word
- word formation
- words
- writing system