Back to Search
Start Over
Assessment of intratumoral heterogeneity with mutations and gene expression profiles
- Source :
- PLoS ONE, PLoS ONE, Vol 14, Iss 7, p e0219682 (2019)
- Publication Year :
- 2019
- Publisher :
- Public Library of Science (PLoS), 2019.
-
Abstract
- Intratumoral heterogeneity (ITH) refers to the presence of distinct tumor cell populations. It provides vital information for the clinical prognosis, drug responsiveness, and personalized treatment of cancer patients. As genomic ITH in various cancers affects the expression patterns of genes, the expression profile could be utilized for determining ITH level. Herein, we present a novel approach to directly detect high ITH defined as a larger number of subclones from the gene expression pattern through machine learning approaches. We examined associations between gene expression profile and ITH of 12 cancer types from The Cancer Genome Atlas (TCGA) database. Using stomach adenocarcinoma (STAD) showing high association, we evaluated the performance of our method in predicting ITH by employing three machine learning algorithms using gene expression profile data. We classified tumors into high and low heterogeneity groups using the learning model through the selection of LASSO feature. The result showed that support vector machines (SVMs) outperformed other algorithms (AUC = 0.84 in SVMs and 0.82 in Naïve Bayes) and we were able to improve predictive power by using both combined data from mutation and expression. Furthermore, we evaluated the prediction ability of each model using simulation data generated by mixing cell lines of the Cancer Cell Line Encyclopedia (CCLE), and obtained consistent results with using real dataset. Our approach could be utilized for discriminating tumors with heterogeneous cell populations to characterize ITH.
- Subjects :
- 0301 basic medicine
Support Vector Machine
Databases, Factual
Gene Expression
Machine Learning
Bayes' theorem
Mathematical and Statistical Techniques
0302 clinical medicine
Lasso (statistics)
Adenocarcinomas
Medicine and Health Sciences
Regulation of gene expression
Multidisciplinary
Stomach
Statistics
Genomics
Prognosis
Gene Expression Regulation, Neoplastic
Oncology
Area Under Curve
030220 oncology & carcinogenesis
Physical Sciences
Medicine
Adenocarcinoma
Anatomy
Algorithms
Research Article
Computer and Information Sciences
Science
Computational biology
Biology
Research and Analysis Methods
Carcinomas
Genetic Heterogeneity
03 medical and health sciences
Naive Bayes classifier
Stomach Neoplasms
Artificial Intelligence
Cell Line, Tumor
Support Vector Machines
Genetics
Cancer Genetics
medicine
Humans
Computer Simulation
Statistical Methods
Molecular Biology Techniques
Molecular Biology
Genome, Human
Genetic heterogeneity
Gene Expression Profiling
Biology and Life Sciences
Cancers and Neoplasms
Bayes Theorem
Oncogenes
medicine.disease
Gastrointestinal Tract
Gene expression profiling
030104 developmental biology
ROC Curve
Mutation
Transcriptome
Digestive System
Mathematics
Forecasting
Cloning
Subjects
Details
- ISSN :
- 19326203
- Volume :
- 14
- Database :
- OpenAIRE
- Journal :
- PLOS ONE
- Accession number :
- edsair.doi.dedup.....6dbb157df5812ff902c5fc2c76e704b9
- Full Text :
- https://doi.org/10.1371/journal.pone.0219682