Back to Search Start Over

Correcting mistakes in predicting distributions

Authors :
Valérie Marot-Lassauzaie
Michael Bernhofer
Burkhard Rost
Source :
Bioinformatics
Publication Year :
2018
Publisher :
Oxford University Press (OUP), 2018.

Abstract

Motivation Many applications monitor predictions of a whole range of features for biological datasets, e.g. the fraction of secreted human proteins in the human proteome. Results and error estimates are typically derived from publications. Results Here, we present a simple, alternative approximation that uses performance estimates of methods to error-correct the predicted distributions. This approximation uses the confusion matrix (TP true positives, TN true negatives, FP false positives and FN false negatives) describing the performance of the prediction tool for correction. As proof-of-principle, the correction was applied to a two-class (membrane/not) and to a seven-class (localization) prediction. Availability and implementation Datasets and a simple JavaScript tool available freely for all users at http://www.rostlab.org/services/distributions. Supplementary information Supplementary data are available at Bioinformatics online.

Details

ISSN :
13674811 and 13674803
Volume :
34
Database :
OpenAIRE
Journal :
Bioinformatics
Accession number :
edsair.doi.dedup.....294e36da8192dc5ccd3abf22e1b386e0
Full Text :
https://doi.org/10.1093/bioinformatics/bty346