1. A novel approach for underwater fish segmentation in complex scenes based on multi-levels triangular atrous convolution.
- Author
-
Yang, Yufang, Li, Dashe, and Zhao, Siwei
- Subjects
- *
CONVOLUTIONAL neural networks , *IDENTIFICATION of fishes , *FEATURE extraction , *BIOLOGICAL monitoring , *FISHERY management , *IMAGE segmentation - Abstract
Underwater segmentation technology achieves effective monitoring of fish biological information through accurate identification of fish species and precise estimation of their quantities. It serves as an effective approach to enhance the informatization level of aquaculture and promote intelligent management in fisheries. However, the complex and diverse underwater environment, coupled with poor visibility, results in blurry and lower-quality underwater fish images. The current image segmentation methods, when applied to fish segmentation, exhibit low accuracy and inadequate generalization capabilities. This paper proposes an image segmentation model based on Atrous Spatial Pyramid Pooling (ASPP) to address these challenges. The model aims to improve fish feature extraction and enhance the segmentation precision of fish images in complex underwater environments, thereby enhancing the accuracy and generalization capabilities of existing fish segmentation models. First, a multiscale feature extraction module (triangular atrous spatial pyramid multifeature fusion) based on dilated convolutional spatial pyramid pooling, which enhances the extraction of high-level semantic features of images through the triangular combination of multilayer dilated convolutional pyramid pooling and the adaptive channel attention module, is proposed. Second, a spatial attention module based on strip pooling (atrous strip pooling) is proposed, which further expands the receptive field of the attention mechanism by combining different expansion rates, enhances the correlations between pixels, and effectively captures spatial information. Finally, a decoder module based on multilayer semantic feature fusion is proposed. Through the processing and fusion of medium-, low-, and high-level semantic features, the model understands image content and performs accurate pixel-level segmentation. The proposed model is evaluated using a VOC-compliant dataset created from underwater fish images and validated against public and specific underwater fish datasets. The results demonstrate the successful application of the feature extraction and feature fusion modules in underwater fish image segmentation, achieving an average Intersection over Union (MIoU) of 85.49% in segmentation tasks. Compared to conventional segmentation models, the proposed model shows significant improvements, with an average increase of 3.8% in MIoU and 2.5% in balanced F-score (F1-score). The accurate segmentation of underwater fish images and extraction of vital biological information by this model provide a solid foundation for intelligent monitoring, including fish length measurement, weight estimation, and analysis of growth and health status. Moreover, the model offers a scientific framework for decision-making in aquaculture, driving advancements in precision and intelligent management practices in the field. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF