Back to Search
Start Over
ViT-ILD: A Vision Transformer-based Neural Network for Detection of Interstitial Lung Disease from CT Images.
- Source :
- Procedia Computer Science; 2024, Vol. 235, p779-788, 10p
- Publication Year :
- 2024
-
Abstract
- Interstitial Lung Disease (ILD) is a lung illness characterized by inflammation and scarring. Identifying and categorizing ILD patterns using chest Computed Tomography (CT) images is crucial for diagnosis and treatment planning. Deep learning and computer vision advancements offer the potential for automating medical image examination, such as the transformer model, which identifies intricate dependencies and relationships in data. Chest CT scans provide valuable information for ILD pattern classification and diagnosis. The Vision Transformer (ViT) based Multi-Head Self Attention (MHSA) architecture detects local and global spatial dependencies, focusing on relevant regions and considering contextual interactions. The ViT-based model architecture aims to categorize ILD patterns using MHSA mechanisms. The proposed ViT-ILD model improves the performance by modifying hyperparameters, attention heads, and hidden units. It also utilises techniques of residual connections, layer normalization, and positional encoding for improvement. The proposed method ViT-ILD has achieved comparable training, validation and test accuracy of 100%, 98%, and 82.75% respectively for the 4-class classification with a healthy lung, hypersensitivity pneumonitis, pulmonary fibrosis, and tuberculosis from the MedGift CT dataset. It is observed that the proposed ViT-ILD model has achieved test accuracy, recall, precision, and f1-score of 82.75%, 74.15%, 100%, and 82.35%. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 18770509
- Volume :
- 235
- Database :
- Supplemental Index
- Journal :
- Procedia Computer Science
- Publication Type :
- Academic Journal
- Accession number :
- 177603654
- Full Text :
- https://doi.org/10.1016/j.procs.2024.04.074