1. Association of Protein Translation and Extracellular Matrix Gene Sets with Breast Cancer Metastasis: Findings Uncovered on Analysis of Multiple Publicly Available Datasets Using Individual Patient Data Approach
- Author
-
Shantanu Sapru and Nilotpal Chowdhury
- Subjects
Adult ,Multivariate analysis ,Microarray ,Science ,Breast Neoplasms ,Computational biology ,Biology ,Bioinformatics ,Metastasis ,Young Adult ,Breast cancer ,Databases, Genetic ,medicine ,Humans ,Neoplasm Metastasis ,Aged ,Oligonucleotide Array Sequence Analysis ,Proportional Hazards Models ,Aged, 80 and over ,Extracellular Matrix Proteins ,Multidisciplinary ,Microarray analysis techniques ,Gene Expression Profiling ,Middle Aged ,medicine.disease ,Prognosis ,Extracellular Matrix ,Gene expression profiling ,Protein Biosynthesis ,Multivariate Analysis ,Medicine ,Female ,DNA microarray ,Proteinaceous extracellular matrix ,Research Article - Abstract
IntroductionMicroarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis.AimThe aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets.MethodsFour microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA).ResultsGene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed.ConclusionTo the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting results and may be used as a tool to guide new research.
- Published
- 2015