Back to Search Start Over

Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation.

Authors :
Lord PW
Stevens RD
Brass A
Goble CA
Source :
Bioinformatics (Oxford, England) [Bioinformatics] 2003 Jul 01; Vol. 19 (10), pp. 1275-83.
Publication Year :
2003

Abstract

Motivation: Many bioinformatics data resources not only hold data in the form of sequences, but also as annotation. In the majority of cases, annotation is written as scientific natural language: this is suitable for humans, but not particularly useful for machine processing. Ontologies offer a mechanism by which knowledge can be represented in a form capable of such processing. In this paper we investigate the use of ontological annotation to measure the similarities in knowledge content or 'semantic similarity' between entries in a data resource. These allow a bioinformatician to perform a similarity measure over annotation in an analogous manner to those performed over sequences. A measure of semantic similarity for the knowledge component of bioinformatics resources should afford a biologist a new tool in their repertoire of analyses.<br />Results: We present the results from experiments that investigate the validity of using semantic similarity by comparison with sequence similarity. We show a simple extension that enables a semantic search of the knowledge held within sequence databases.<br />Availability: Software available from http://www.russet.org.uk.

Details

Language :
English
ISSN :
1367-4803
Volume :
19
Issue :
10
Database :
MEDLINE
Journal :
Bioinformatics (Oxford, England)
Publication Type :
Academic Journal
Accession number :
12835272
Full Text :
https://doi.org/10.1093/bioinformatics/btg153