1. 基于多尺度特征与注意力机制的宫颈病变检测.
- Author
-
冯婷, 应捷, 杨海马, and 李芳
- Subjects
- *
CERVICAL intraepithelial neoplasia , *IMAGE recognition (Computer vision) , *FEATURE extraction , *MULTISPECTRAL imaging , *TRANSFORMER models - Abstract
CIN (Cervical Intraepithelial Neoplasm) is a precancerous lesion of the cervix with a high correlation to invasive cervical cancer. Accurate detection and classification of CIN is helpful to reduce the rate of severe cervical cancer. YOLOv5-CBTR(You Only Look Once version 5 Convolutional Block Transformer) cervical lesion detection method is proposed to address the issues of low accuracy in detection and classification of cervical lesions by combining multi scale features and multiple attention mechanisms. The backbone network employs the SECSP (SENet BottleneckCSP) with SENet (Squeeze and Excitation Networks) attention mechanism for feature extraction. The Transformer encoder module is introduced to fuse and amplify multi-feature information, and multi-head attention mechanism is used to enhance the feature extraction ability of lesion regions. Convolutional attention modules are introduced into the feature fusion layer for multiscale fusion of lesion feature information. The power transformation is introduced into the calculation of the boundary regression box, which speeds up the convergence of the models loss function and realizes the detection and classification of cervical lesions. The experimental results show that the accuracy, recall rate, mAP(mean Average Precision), and F value of YOLOv5 CBTR model for the detection and classification of RGB cervical lesion images are 93.99%, 92. 91%, 92.80%, and 93.45%, respectively. The mAP and F values of the model in multispectral cervical image detection and classification are 97.68% and 95.23%, respectively. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF