Back to Search Start Over

A platform-independent AI tumor lineage and site (ATLAS) classifier.

Authors :
Rydzewski NR
Shi Y
Li C
Chrostek MR
Bakhtiar H
Helzer KT
Bootsma ML
Berg TJ
Harari PM
Floberg JM
Blitzer GC
Kosoff D
Taylor AK
Sharifi MN
Yu M
Lang JM
Patel KR
Citrin DE
Sundling KE
Zhao SG
Source :
Communications biology [Commun Biol] 2024 Mar 13; Vol. 7 (1), pp. 314. Date of Electronic Publication: 2024 Mar 13.
Publication Year :
2024

Abstract

Histopathologic diagnosis and classification of cancer plays a critical role in guiding treatment. Advances in next-generation sequencing have ushered in new complementary molecular frameworks. However, existing approaches do not independently assess both site-of-origin (e.g. prostate) and lineage (e.g. adenocarcinoma) and have minimal validation in metastatic disease, where classification is more difficult. Utilizing gradient-boosted machine learning, we developed ATLAS, a pair of separate AI Tumor Lineage and Site-of-origin models from RNA expression data on 8249 tumor samples. We assessed performance independently in 10,376 total tumor samples, including 1490 metastatic samples, achieving an accuracy of 91.4% for cancer site-of-origin and 97.1% for cancer lineage. High confidence predictions (encompassing the majority of cases) were accurate 98-99% of the time in both localized and remarkably even in metastatic samples. We also identified emergent properties of our lineage scores for tumor types on which the model was never trained (zero-shot learning). Adenocarcinoma/sarcoma lineage scores differentiated epithelioid from biphasic/sarcomatoid mesothelioma. Also, predicted lineage de-differentiation identified neuroendocrine/small cell tumors and was associated with poor outcomes across tumor types. Our platform-independent single-sample approach can be easily translated to existing RNA-seq platforms. ATLAS can complement and guide traditional histopathologic assessment in challenging situations and tumors of unknown primary.<br /> (© 2024. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.)

Details

Language :
English
ISSN :
2399-3642
Volume :
7
Issue :
1
Database :
MEDLINE
Journal :
Communications biology
Publication Type :
Academic Journal
Accession number :
38480799
Full Text :
https://doi.org/10.1038/s42003-024-05981-5