Back to Search Start Over

Predicting Tumor Type and Residual Status of Suprasellar Lesions Using Indian Discharge Summaries

Authors :
Priyanka C. Nair
Deepa Gupta
Bhagavatula Indira Devi
Vani Kanjirangat
P. Deepak
Source :
IEEE Access, Vol 12, Pp 134379-134410 (2024)
Publication Year :
2024
Publisher :
IEEE, 2024.

Abstract

A suprasellar lesion is an unusual mass in the suprasellar region in the brain. Some common suprasellar lesions include Pituitary Adenoma, Craniopharyngioma and Meningioma. Patients may present with significant visual and other symptoms like headache, and hormonal imbalances. The proposed study utilizes 553 discharge summaries of suprasellar patients admitted during 2013–2019 at NIMHANS hospitals, Bangalore. Classification of discharge summary was conducted using 11 different word embedding techniques, including word2vec, FastText, Glove, and transformer-based embeddings. Tumor type is predicted using advanced ML classifiers like AdaBoost, Random Forest, and XGBoost. The highest F-score of 0.91 was reported for XGBoost when implemented along with SMOTE based data balancing and PCA based feature reduction. To enhance the classification performance of the best performing model, ClinicalBioBERT, a pre-trained BERT model that demonstrated superior results, was finetuned with domain-specific clinical data and resulted in an improvement of the F-score to 0.93. Classification of presence/absence of residual tumor post surgery is also carried out using transformer models and achieved a macro F1-score of maximum 1, after handling the class imbalance using SMOTE. Different combinations of experiments with PCA and SMOTE were carried out in both classification problems. Two Large Language Models: FlanT5 and Bloom, are also investigated in this work for both classification problems Initially, the LLM is employed with a zero-shot classification pipeline, resulting in poor performance. Consequently, fine-tuning of the LLM models are attempted using the discharge summary text, resulting in performance improvements.

Details

Language :
English
ISSN :
21693536
Volume :
12
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.ff2827e5c8854a0f99f2cb36d70f6bdc
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2024.3460976