Back to Search Start Over

Synchronous composition and semantic line detection based on cross-attention.

Authors :
Hou, Qinggang
Ke, Yongzhen
Wang, Kai
Qin, Fan
Wang, Yaoting
Source :
Multimedia Systems. Jun2024, Vol. 30 Issue 3, p1-12. 12p.
Publication Year :
2024

Abstract

Composition detection and semantic line detection are important research topics in computer vision and play an important auxiliary role in the analysis of image esthetics. However, at present, few researchers have considered the internal relationship between these two related tasks for comprehensive research. In order to solve this problem, we propose a synchronous detection network of composition class and semantic lines based on cross-attention, which can realize the mutual supervision and guidance between composition class detection and semantic line detection, to improve the accuracy of each other’s detection. First, the pre-trained composition detection model and the pre-trained semantic line detection model as two teacher models to provide data labels of composition and semantic line information for the student model. Then, we train a student model with the help of the teacher model. The student model adopts the multi-task learning architecture by combining soft and hard parameter sharing, as we propose. At the same time, we develop a cross-attention module to ensure that both tasks get the help and supervision they need from each other. Experimental results show that our method can draw semantic lines while detecting composition classes, which increases the interpretability of composition class detection. Our composition detection accuracy reaches 92.57%, and for benchmark semantic lines, the accuracy of our AUC_A metric can reach 92.00%. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09424962
Volume :
30
Issue :
3
Database :
Academic Search Index
Journal :
Multimedia Systems
Publication Type :
Academic Journal
Accession number :
178307708
Full Text :
https://doi.org/10.1007/s00530-024-01307-x