Back to Search Start Over

Relabeling metabolic pathway data with groups to improve prediction outcomes

Authors :
Steven J. Hallam
Abdur Rahman M. A. Basher
Publication Year :
2020
Publisher :
Cold Spring Harbor Laboratory, 2020.

Abstract

Metabolic pathway inference from genomic sequence information is an integral scientific problem with wide ranging applications in the life sciences. As sequencing throughput increases, scalable and performative methods for pathway prediction at different levels of genome complexity and completion become compulsory. In this paper, we present reMap (relabeling metabolic pathway data with groups) a simple, and yet, generic framework, that performs relabeling examples to a different set of labels, characterized as groups. A pathway group is comprised of a subset of statistically correlated pathways that can be further distributed between multiple pathway groups. This has important implications for pathway prediction, where a learning algorithm can revisit a pathway multiple times across groups to improve sensitivity. The relabeling process in reMap is achieved through an alternating feedback process. In the first feed-forward phase, a minimal subset of pathway groups is picked to label each example. In the second feed-backward phase, reMap’s internal parameters are updated to increase the accuracy of mapping examples to pathway groups. The resulting pathway group dataset is then be used to train a multi-label learning algorithm. reMap’s effectiveness was evaluated on metabolic pathway prediction where resulting performance metrics equaled or exceeded other prediction methods on organismal genomes with improved predictive performance.

Details

Database :
OpenAIRE
Accession number :
edsair.doi...........53cfa2f3947b4d24e6dbb81223e77f04
Full Text :
https://doi.org/10.1101/2020.08.21.260109