Start Over

DeepFocus: A visual focus of attention detection framework using deep learning in multi-object scenarios

Authors :: Sadia Afroze
Md. Rajib Hossain
Mohammed Moshiul Hoque
Source :: Journal of King Saud University: Computer and Information Sciences, Vol 34, Iss 10, Pp 10109-10124 (2022)
Publication Year :: 2022
Publisher :: Elsevier, 2022.
Abstract: In recent years, recognizing the visual focus of attention (VFoA) has attracted much attention among computer vision experts due to its various Human–Computer Interaction (HCI) or Human-Robot Interaction (HRI) applications. Although eye gaze is a potential cue to determine someone’s focus of attention (FOA), it is challenging to determine FOA alone when the interacting partners are far away or the camera cannot capture high-resolution images from long-distance. Therefore, the head pose can be used as an approximation to recognize the focus of someone’s attention. This paper proposes a vision-based framework to detect the FOA of humans using nine head poses consisting of four main modules: face detection and facial key-point selection (FDKPSM), head pose classification (HPCM), object localization and classification (OLCM), and focus of attention estimation (FoAEM). The FDKPSM uses the Multi-task Cascaded Neural Network (MTCNN) framework to detect head poses, and the HPCM classifies them into nine classes using the ResNet18. To estimate the FoA, the FoAEM uses a mapping Algorithm (EFoA) which integrates head poses on the focused object. Experimental results show that the proposed model outperformed other deep learning models by achieving the highest accuracy on three datasets: BIWI-M (96.97%), Pointing’04-M (96.04%) and HPoD 9 (98.99%). The visual focus of the attention model gained an accuracy of 94.12% in the multi-object scenario.

Subjects :: Human–computer interaction
Computer vision
Head pose categorization
Focus of attention
Deep learning
Electronic computers. Computer science
QA75.5-76.95

Details

Language :: English
ISSN :: 13191578
Volume :: 34
Issue :: 10
Database :: Directory of Open Access Journals
Journal :: Journal of King Saud University: Computer and Information Sciences
Publication Type :: Academic Journal
Accession number :: edsdoj.9c99b48ae70c4f7c94b251495e05f776
Document Type :: article
Full Text :: https://doi.org/10.1016/j.jksuci.2022.10.009

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

DeepFocus: A visual focus of attention detection framework using deep learning in multi-object scenarios

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

DeepFocus: A visual focus of attention detection framework using deep learning in multi-object scenarios

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources