Back to Search Start Over

CerviFusionNet: A multi-modal, hybrid CNN-transformer-GRU model for enhanced cervical lesion multi-classification

Authors :
Yuyang Sha
Qingyue Zhang
Xiaobing Zhai
Menghui Hou
Jingtao Lu
Weiyu Meng
Yuefei Wang
Kefeng Li
Jing Ma
Source :
iScience, Vol 27, Iss 12, Pp 111313- (2024)
Publication Year :
2024
Publisher :
Elsevier, 2024.

Abstract

Summary: Cervical lesions pose a significant threat to women’s health worldwide. Colposcopy is essential for screening and treating cervical lesions, but its effectiveness depends on the doctor’s experience. Artificial intelligence-based solutions via colposcopy images have shown great potential in cervical lesions screening. However, some challenges still need to be addressed, such as low algorithm performance and lack of high-quality multi-modal datasets. Here, we established a multi-modal colposcopy dataset of 2,273 HPV+ patients, comprising original colposcopy images, acetic acid reactions at 60s and 120s, iodine staining, diagnostic reports, and pathological results. Utilizing this dataset, we developed CerviFusionNet, a hybrid architecture that merges convolutional neural networks and vision transformers to learn robust representations. We designed a temporal module to capture dynamic changes in acetic acid sequences, which can boost the model performance without sacrificing inference speed. Compared with several existing methods, CerviFusionNet demonstrated excellent accuracy and efficiency.

Details

Language :
English
ISSN :
25890042
Volume :
27
Issue :
12
Database :
Directory of Open Access Journals
Journal :
iScience
Publication Type :
Academic Journal
Accession number :
edsdoj.8b479579744446a80ed893c33bc818d
Document Type :
article
Full Text :
https://doi.org/10.1016/j.isci.2024.111313