Back to Search Start Over

GSAn: an alternative to enrichment analysis for annotating gene sets.

Authors :
Ayllon-Benitez A
Bourqui R
Thébault P
Mougin F
Source :
NAR genomics and bioinformatics [NAR Genom Bioinform] 2020 Mar 14; Vol. 2 (2), pp. lqaa017. Date of Electronic Publication: 2020 Mar 14 (Print Publication: 2020).
Publication Year :
2020

Abstract

The revolution in new sequencing technologies is greatly leading to new understandings of the relations between genotype and phenotype. To interpret and analyze data that are grouped according to a phenotype of interest, methods based on statistical enrichment became a standard in biology. However, these methods synthesize the biological information by a priori selecting the over-represented terms and may suffer from focusing on the most studied genes that represent a limited coverage of annotated genes within a gene set. Semantic similarity measures have shown great results within the pairwise gene comparison by making advantage of the underlying structure of the Gene Ontology. We developed GSAn, a novel gene set annotation method that uses semantic similarity measures to synthesize a priori Gene Ontology annotation terms. The originality of our approach is to identify the best compromise between the number of retained annotation terms that has to be drastically reduced and the number of related genes that has to be as large as possible. Moreover, GSAn offers interactive visualization facilities dedicated to the multi-scale analysis of gene set annotations. Compared to enrichment analysis tools, GSAn has shown excellent results in terms of maximizing the gene coverage while minimizing the number of terms.<br /> (© The Author(s) 2019. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.)

Details

Language :
English
ISSN :
2631-9268
Volume :
2
Issue :
2
Database :
MEDLINE
Journal :
NAR genomics and bioinformatics
Publication Type :
Academic Journal
Accession number :
33575577
Full Text :
https://doi.org/10.1093/nargab/lqaa017