Back to Search
Start Over
Improving disease gene prioritization using the semantic similarity of Gene Ontology terms
- Source :
- Bioinformatics
- Publication Year :
- 2010
- Publisher :
- Oxford University Press, 2010.
-
Abstract
- Motivation: Many hereditary human diseases are polygenic, resulting from sequence alterations in multiple genes. Genomic linkage and association studies are commonly performed for identifying disease-related genes. Such studies often yield lists of up to several hundred candidate genes, which have to be prioritized and validated further. Recent studies discovered that genes involved in phenotypically similar diseases are often functionally related on the molecular level. Results: Here, we introduce MedSim, a novel approach for ranking candidate genes for a particular disease based on functional comparisons involving the Gene Ontology. MedSim uses functional annotations of known disease genes for assessing the similarity of diseases as well as the disease relevance of candidate genes. We benchmarked our approach with genes known to be involved in 99 diseases taken from the OMIM database. Using artificial quantitative trait loci, MedSim achieved excellent performance with an area under the ROC curve of up to 0.90 and a sensitivity of over 70% at 90% specificity when classifying gene products according to their disease relatedness. This performance is comparable or even superior to related methods in the field, albeit using less and thus more easily accessible information. Availability: MedSim is offered as part of our FunSimMat web service (http://www.funsimmat.de). Contact: mario.albrecht@mpi-inf.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
Candidate gene
Multifactorial Inheritance
Disease
Quantitative trait locus
Biology
Biochemistry
Ranking (information retrieval)
Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
Mice
Semantic similarity
Databases, Genetic
Animals
Humans
Molecular Biology
Gene
Genetic association
Genetics
Genetic Diseases, Inborn
Computational Biology
Proteins
Original Papers
Computer Science Applications
Semantics
Computational Mathematics
Benchmarking
Computational Theory and Mathematics
Genes
Text Mining, Ontologies and Databases
Software
Subjects
Details
- Language :
- English
- ISSN :
- 13674811 and 13674803
- Volume :
- 26
- Issue :
- 18
- Database :
- OpenAIRE
- Journal :
- Bioinformatics
- Accession number :
- edsair.doi.dedup.....05b67ee9e0c7843c0f09d47d6d72201a