Back to Search Start Over

A BERT-GNN Approach for Metastatic Breast Cancer Prediction Using Histopathology Reports.

Authors :
Basaad, Abdullah
Basurra, Shadi
Vakaj, Edlira
Eldaly, Ahmed Karam
Abdelsamea, Mohammed M.
Source :
Diagnostics (2075-4418). Jul2024, Vol. 14 Issue 13, p1365. 20p.
Publication Year :
2024

Abstract

Metastatic breast cancer (MBC) continues to be a leading cause of cancer-related deaths among women. This work introduces an innovative non-invasive breast cancer classification model designed to improve the identification of cancer metastases. While this study marks the initial exploration into predicting MBC, additional investigations are essential to validate the occurrence of MBC. Our approach combines the strengths of large language models (LLMs), specifically the bidirectional encoder representations from transformers (BERT) model, with the powerful capabilities of graph neural networks (GNNs) to predict MBC patients based on their histopathology reports. This paper introduces a BERT-GNN approach for metastatic breast cancer prediction (BG-MBC) that integrates graph information derived from the BERT model. In this model, nodes are constructed from patient medical records, while BERT embeddings are employed to vectorise representations of the words in histopathology reports, thereby capturing semantic information crucial for classification by employing three distinct approaches (namely univariate selection, extra trees classifier for feature importance, and Shapley values to identify the features that have the most significant impact). Identifying the most crucial 30 features out of 676 generated as embeddings during model training, our model further enhances its predictive capabilities. The BG-MBC model achieves outstanding accuracy, with a detection rate of 0.98 and an area under curve (AUC) of 0.98, in identifying MBC patients. This remarkable performance is credited to the model's utilisation of attention scores generated by the LLM from histopathology reports, effectively capturing pertinent features for classification. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20754418
Volume :
14
Issue :
13
Database :
Academic Search Index
Journal :
Diagnostics (2075-4418)
Publication Type :
Academic Journal
Accession number :
178695707
Full Text :
https://doi.org/10.3390/diagnostics14131365