1. MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data
- Author
-
Sandra Castillo, Alejandro Villar-Briones, Matej Orešič, and Tomáš Pluskal
- Subjects
Proteomics ,Computer science ,Computational biology ,RANSAC ,lcsh:Computer applications to medicine. Medical informatics ,Mass spectrometry ,computer.software_genre ,01 natural sciences ,Biochemistry ,Mass Spectrometry ,03 medical and health sciences ,Metabolomics ,Software Design ,Structural Biology ,Biomarker discovery ,lcsh:QH301-705.5 ,Molecular Biology ,030304 developmental biology ,0303 health sciences ,business.industry ,Applied Mathematics ,010401 analytical chemistry ,Experimental data ,Modular design ,0104 chemical sciences ,Computer Science Applications ,Visualization ,lcsh:Biology (General) ,lcsh:R858-859.7 ,Data mining ,DNA microarray ,business ,computer ,Functional genomics ,Algorithms ,Software - Abstract
Background Mass spectrometry (MS) coupled with online separation methods is commonly applied for differential and quantitative profiling of biological samples in metabolomic as well as proteomic research. Such approaches are used for systems biology, functional genomics, and biomarker discovery, among others. An ongoing challenge of these molecular profiling approaches, however, is the development of better data processing methods. Here we introduce a new generation of a popular open-source data processing toolbox, MZmine 2. Results A key concept of the MZmine 2 software design is the strict separation of core functionality and data processing modules, with emphasis on easy usability and support for high-resolution spectra processing. Data processing modules take advantage of embedded visualization tools, allowing for immediate previews of parameter settings. Newly introduced functionality includes the identification of peaks using online databases, MSn data support, improved isotope pattern support, scatter plot visualization, and a new method for peak list alignment based on the random sample consensus (RANSAC) algorithm. The performance of the RANSAC alignment was evaluated using synthetic datasets as well as actual experimental data, and the results were compared to those obtained using other alignment algorithms. Conclusions MZmine 2 is freely available under a GNU GPL license and can be obtained from the project website at: http://mzmine.sourceforge.net/. The current version of MZmine 2 is suitable for processing large batches of data and has been applied to both targeted and non-targeted metabolomic analyses.
- Published
- 2010
- Full Text
- View/download PDF