Back to Search
Start Over
Prediction and interpretation of cancer survival using graph convolution neural networks
- Source :
- Methods
- Publication Year :
- 2021
- Publisher :
- Elsevier BV, 2021.
-
Abstract
- The survival rate of cancer has increased significantly during the past two decades for breast, prostate, testicular, and colon cancer, while the brain and pancreatic cancers have a much lower median survival rate that has not improved much over the last forty years. This has imposed the challenge of finding gene markers for early cancer detection and treatment strategies. Different methods including regression-based Cox-PH, artificial neural networks, and recently deep learning algorithms have been proposed to predict the survival rate for cancers. We established in this work a novel graph convolution neural network (GCNN) approach called Surv_GCNN to predict the survival rate for 13 different cancer types using the TCGA dataset. For each cancer type, 6 Surv_GCNN models with graphs generated by correlation analysis, GeneMania database, and correlation + GeneMania were trained with and without clinical data to predict the risk score (RS). The performance of the 6 Surv_GCNN models was compared with two other existing models, Cox-PH and Cox-nnet. The results showed that Cox-PH has the worst performance among 8 tested models across the 13 cancer types while Surv_GCNN models with clinical data reported the best overall performance, outperforming other competing models in 7 out of 13 cancer types including BLCA, BRCA, COAD, LUSC, SARC, STAD, and UCEC. A novel network-based interpretation of Surv_GCNN was also proposed to identify potential gene markers for breast cancer. The signatures learned by the nodes in the hidden layer of Surv_GCNN were identified and were linked to potential gene markers by network modularization. The identified gene markers for breast cancer have been compared to a total of 213 gene markers from three widely cited lists for breast cancer survival analysis. About 57% of gene markers obtained by Surv_GCNN with correlation + GeneMania graph either overlap or directly interact with the 213 genes, confirming the effectiveness of the identified markers by Surv_GCNN.
- Subjects :
- Male
Colorectal cancer
Breast Neoplasms
Computational biology
Biology
Article
General Biochemistry, Genetics and Molecular Biology
03 medical and health sciences
Breast cancer
medicine
Humans
Molecular Biology
Survival rate
Survival analysis
030304 developmental biology
0303 health sciences
Artificial neural network
business.industry
Deep learning
030302 biochemistry & molecular biology
Cancer
medicine.disease
Regression
Survival Rate
Neural Networks, Computer
Artificial intelligence
business
Algorithms
Subjects
Details
- ISSN :
- 10462023
- Volume :
- 192
- Database :
- OpenAIRE
- Journal :
- Methods
- Accession number :
- edsair.doi.dedup.....2be5b3b277b9d29e93560f9047654798