Back to Search Start Over

Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset.

Authors :
Bauer, Chris
Kleinjung, Frank
Smith, Celia J.
Towers, Mark W.
Tiss, Ali
Chadt, Alexandra
Dreja, Tanja
Beule, Dieter
Al-Hasani, Hadi
Reinert, Knut
Schuchhardt, Johannes
Cramer, Rainer
Source :
BMC Bioinformatics; 2011, Vol. 12 Issue 1, p140-153, 14p
Publication Year :
2011

Abstract

Background: Diabetes like many diseases and biological processes is not mono-causal. On the one hand multifactorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics. Results: We present a comprehensive work-flow tailored for analyzing complex data including data from multifactorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods. Conclusions: The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14712105
Volume :
12
Issue :
1
Database :
Complementary Index
Journal :
BMC Bioinformatics
Publication Type :
Academic Journal
Accession number :
62571167
Full Text :
https://doi.org/10.1186/1471-2105-12-140