1. A simple yet effective data integration approach to tree-based microarray data classification
- Author
-
Jiuyong Li, Lin Liu, Yi Li, Bing Liu, Liu, Lin, Li,Yi, Liu, Bing, Li, Jiuyong, and 32nd annual international conference of the IEEE engineering in medicine and biology society Buenos Aires, Argentina 31 August-4 September 2010
- Subjects
Lung Neoplasms ,Computer science ,data analysis ,Decision tree ,Information Storage and Retrieval ,Genomics ,computer.software_genre ,Pattern Recognition, Automated ,Data modeling ,pattern classification ,genomics ,Biomarkers, Tumor ,Humans ,genetics ,cellular biophysics ,Oligonucleotide Array Sequence Analysis ,decision trees ,SIMPLE (military communications protocol) ,Microarray analysis techniques ,business.industry ,bioinformatics ,Neoplasm Proteins ,Random forest ,Systems Integration ,Data set ,Statistical classification ,Gene chip analysis ,System integration ,Data mining ,business ,computer ,Algorithms ,Data integration - Abstract
Different biological labs conduct similar experiments on same diseases. It is highly desirable to have a better model based on more experimental results than that on a single result. In this paper, we propose a method for integrating microarray data from multiple sources for building classification models. To test the method, we use three real world microarray data sets from different labs with different experimental devices and environments. Although microarray data is well known for its inconsistencies across labs, we demonstrate that it is possible to build consistent models using data sets from multiple labs. We report our method, experimental results and observations in the paper. Refereed/Peer-reviewed
- Published
- 2010