Back to Search
Start Over
Synchronous composition and semantic line detection based on cross-attention.
- Source :
-
Multimedia Systems . Jun2024, Vol. 30 Issue 3, p1-12. 12p. - Publication Year :
- 2024
-
Abstract
- Composition detection and semantic line detection are important research topics in computer vision and play an important auxiliary role in the analysis of image esthetics. However, at present, few researchers have considered the internal relationship between these two related tasks for comprehensive research. In order to solve this problem, we propose a synchronous detection network of composition class and semantic lines based on cross-attention, which can realize the mutual supervision and guidance between composition class detection and semantic line detection, to improve the accuracy of each other’s detection. First, the pre-trained composition detection model and the pre-trained semantic line detection model as two teacher models to provide data labels of composition and semantic line information for the student model. Then, we train a student model with the help of the teacher model. The student model adopts the multi-task learning architecture by combining soft and hard parameter sharing, as we propose. At the same time, we develop a cross-attention module to ensure that both tasks get the help and supervision they need from each other. Experimental results show that our method can draw semantic lines while detecting composition classes, which increases the interpretability of composition class detection. Our composition detection accuracy reaches 92.57%, and for benchmark semantic lines, the accuracy of our AUC_A metric can reach 92.00%. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 09424962
- Volume :
- 30
- Issue :
- 3
- Database :
- Academic Search Index
- Journal :
- Multimedia Systems
- Publication Type :
- Academic Journal
- Accession number :
- 178307708
- Full Text :
- https://doi.org/10.1007/s00530-024-01307-x