1. A high performance cloud computing platform for mRNA analysis
- Author
-
Feipei Lai, Chia-Ping Shen, Yan-Yu Lam, Feng-Seng Lin, Jeng-Wei Lin, and Hsiao-Ya Sung
- Subjects
Speedup ,Time Factors ,business.industry ,Computer science ,Cancer ,Computational Biology ,Cloud computing ,Models, Theoretical ,computer.software_genre ,Machine learning ,medicine.disease ,Multiclass classification ,Support vector machine ,ComputingMethodologies_PATTERNRECOGNITION ,Benchmark (computing) ,medicine ,Humans ,The Internet ,Data mining ,Artificial intelligence ,RNA, Messenger ,User interface ,business ,computer ,Algorithms - Abstract
Multiclass classification is an important technique to many complex bioinformatics problems. However, their performance is limited by the computation power. Based on the Apache Hadoop design framework, this study proposes a two layer architecture that exploits the inherent parallelism of GA-SVM classification to speed up the work. The performance evaluations on an mRNA benchmark cancer dataset have reduced 86.55% features and raised accuracy from 97.53% to 98.03%. With a user-friendly web interface, the system provides researchers an easy way to investigate the unrevealed secrets in the fast-growing repository of bioinformatics data.
- Published
- 2013