Back to Search Start Over

Abstract PR02: Multiple Pathway Learning accurately predicts gene essentiality in the Cancer Cell Line Encyclopedia

Authors :
Kyle Ellrott
Yulia Newton
Vladislav Uzunangelov
Artem Sokolov
Adrian Bivol
Joshua M. Stuart
Evan O. Paull
Daniel E. Carlin
Sahil Chopra
Kiley Graim
Sam Ng
Source :
Cancer Research. 75:PR02-PR02
Publication Year :
2015
Publisher :
American Association for Cancer Research (AACR), 2015.

Abstract

We applied biologically-motivated feature transformations coupled with established machine learning methods to predict gene essentiality in CCLE cell line models. By leveraging additional large datasets, such as The Cancer Genome Atlas PanCancer12 data and MSigDB pathway definitions, we improved the robustness and biological interpretability of our models. We developed a multi-pathway learning (MPL) approach that associates a genetic pathway from MSigDB with a distinct kernel for use in a multiple kernel learning setting. We evaluated the performance of MPL compared to several other regression methods including random forests, kernel ridge regression, and elastic net linear models, We combined multiple approaches using an ensemble technique on the diverse set of predictors. We found that the best performing method was an ensemble combining MPL and random forest predictions. Both models utilized features derived from both gene expression and copy number data, the latter of which were filtered to those predicted as driver events in prior pan-cancer studies. The ensemble method was a joint winner in the recent DREAM 9 gene essentiality prediction challenge. MPL also demonstrated merit as a feature selector when used with other downstream methods. The ensemble performed best at predicting the essentiality of genes involved in cell cycle control (cyclins and cyclin-dependent kinases), protein degradation (proteasome complex), cell proliferation signaling (sonic hedgehog, Aurora-B, RAC1), apoptosis (RB1,TP53) and hypoxia response (VEGF, VHL). Many of the key genes in those pathways are known to be drivers of cancer progression, suggesting our method's utility as a biomarker for detecting key tumorigenic events. The advantage of MPL is that mechanistically coherent gene sets are automatically selected as high scoring pathway kernels (HSPKs). We investigated whether the HSPKs identify cellular processes relevant to the loss of key genes. To do this, we inspected the HSPKs for a few of the most abundantly mutated genes in cancer. The MPL predictor for TP53 included the targets of this transcription factor as well as HSPKs involved in apoptosis, a cellular process regulated by TP53. The retinoblastoma gene (RB1) MPL predictor included RB1 targets as well as HSPKs involved in the regulation of histone deacetylase (HDAC) that interacts with RB1 to suppress DNA synthesis. These findings suggest trends in the MPL results could reveal a pathway-level view of the synthetic lethal architecture of cells. Such a map, that links patterns of pathway expression to potential genetic vulnerabilities, could provide an invaluable tool for exploring new avenues to target cancer cells. Citation Format: Vladislav Uzunangelov, Evan Paull, Sahil Chopra, Daniel Carlin, Adrian Bivol, Kyle Ellrott, Kiley Graim, Yulia Newton, Sam Ng, Artem Sokolov, Joshua Stuart. Multiple Pathway Learning accurately predicts gene essentiality in the Cancer Cell Line Encyclopedia. [abstract]. In: Proceedings of the AACR Special Conference on Computational and Systems Biology of Cancer; Feb 8-11 2015; San Francisco, CA. Philadelphia (PA): AACR; Cancer Res 2015;75(22 Suppl 2):Abstract nr PR02.

Details

ISSN :
15387445 and 00085472
Volume :
75
Database :
OpenAIRE
Journal :
Cancer Research
Accession number :
edsair.doi...........025a09648052c6b37ed07e590eceed7e
Full Text :
https://doi.org/10.1158/1538-7445.compsysbio-pr02