ACL RD-TEC 1.0 Summarization of W03-1315
Paper Title:
AN INVESTIGATION OF VARIOUS INFORMATION SOURCES FOR CLASSIFYING BIOLOGICAL NAMES
AN INVESTIGATION OF VARIOUS INFORMATION SOURCES FOR CLASSIFYING BIOLOGICAL NAMES
Authors: Manabu Torii and Sachin Kamboj and K. Vijay-Shanker
Primarily assigned technology terms:
- algorithm
- biomedicine
- classification
- classification method
- classifier
- coreference resolution
- cross-validation
- disambiguation
- entity extraction
- exact matching
- example-based classification
- feature selection
- identification
- identification process
- information extraction
- knearest neighbor
- learning
- matching
- matching algorithm
- name classification
- name detection
- name extraction
- name identification
- name recognition
- name recognizer
- named entity extraction
- nearest neighbors
- nlp
- partial matching
- protein name recognition
- recognition
- recognition system
- recognizer
- right-branching
- sense disambiguation
- string matching
- terminology
- tokenization
- type coercion
- validation
- voting
- word sense disambiguation
Other assigned terms:
- acronym
- annotated corpus
- approach
- bias
- bigram
- biomedical domain
- case
- characters
- classification task
- composition
- compounds
- concepts
- conditional probability
- contextual features
- contextual information
- corpora
- dictionary
- evaluation methodology
- fact
- feature
- genia
- genia corpus
- head noun
- heuristic
- identification task
- information sources
- knowledge
- large corpora
- large corpus
- meaning
- method
- methodology
- named entity
- names
- natural language
- natural language text
- noun phrase
- noun phrases
- nouns
- ontology
- phrase
- precision
- probabilities
- probability
- process
- protein names
- right-branching structure
- similarity score
- similarity scores
- sources of information
- suffix
- suffixes
- terms
- test corpus
- test set
- text
- training
- training set
- transformation
- umls
- unified medical language
- unigram
- word
- word classes
- word level
- word sense
- words