Back to Search Start Over

Comparison of Vision Transformers and Convolutional Neural Networks in Medical Image Analysis: A Systematic Review.

Authors :
Takahashi S
Sakaguchi Y
Kouno N
Takasawa K
Ishizu K
Akagi Y
Aoyama R
Teraya N
Bolatkan A
Shinkai N
Machino H
Kobayashi K
Asada K
Komatsu M
Kaneko S
Sugiyama M
Hamamoto R
Source :
Journal of medical systems [J Med Syst] 2024 Sep 12; Vol. 48 (1), pp. 84. Date of Electronic Publication: 2024 Sep 12.
Publication Year :
2024

Abstract

In the rapidly evolving field of medical image analysis utilizing artificial intelligence (AI), the selection of appropriate computational models is critical for accurate diagnosis and patient care. This literature review provides a comprehensive comparison of vision transformers (ViTs) and convolutional neural networks (CNNs), the two leading techniques in the field of deep learning in medical imaging. We conducted a survey systematically. Particular attention was given to the robustness, computational efficiency, scalability, and accuracy of these models in handling complex medical datasets. The review incorporates findings from 36 studies and indicates a collective trend that transformer-based models, particularly ViTs, exhibit significant potential in diverse medical imaging tasks, showcasing superior performance when contrasted with conventional CNN models. Additionally, it is evident that pre-training is important for transformer applications. We expect this work to help researchers and practitioners select the most appropriate model for specific medical image analysis tasks, accounting for the current state of the art and future trends in the field.<br /> (© 2024. The Author(s).)

Details

Language :
English
ISSN :
1573-689X
Volume :
48
Issue :
1
Database :
MEDLINE
Journal :
Journal of medical systems
Publication Type :
Academic Journal
Accession number :
39264388
Full Text :
https://doi.org/10.1007/s10916-024-02105-8