Back to Search Start Over

Features to Text: A Comprehensive Survey of Deep Learning on Semantic Segmentation and Image Captioning

Authors :
Ariyo Oluwasammi
Muhammad Umar Aftab
Zhiguang Qin
Son Tung Ngo
Thang Van Doan
Son Ba Nguyen
Son Hoang Nguyen
Giang Hoang Nguyen
Source :
Complexity, Vol 2021 (2021)
Publication Year :
2021
Publisher :
Hindawi-Wiley, 2021.

Abstract

With the emergence of deep learning, computer vision has witnessed extensive advancement and has seen immense applications in multiple domains. Specifically, image captioning has become an attractive focal direction for most machine learning experts, which includes the prerequisite of object identification, location, and semantic understanding. In this paper, semantic segmentation and image captioning are comprehensively investigated based on traditional and state-of-the-art methodologies. In this survey, we deliberate on the use of deep learning techniques on the segmentation analysis of both 2D and 3D images using a fully convolutional network and other high-level hierarchical feature extraction methods. First, each domain’s preliminaries and concept are described, and then semantic segmentation is discussed alongside its relevant features, available datasets, and evaluation criteria. Also, the semantic information capturing of objects and their attributes is presented in relation to their annotation generation. Finally, analysis of the existing methods, their contributions, and relevance are highlighted, informing the importance of these methods and illuminating a possible research continuation for the application of semantic image segmentation and image captioning approaches.

Details

Language :
English
ISSN :
10762787 and 10990526
Volume :
2021
Database :
Directory of Open Access Journals
Journal :
Complexity
Publication Type :
Academic Journal
Accession number :
edsdoj.bbf0293988b0454181d97d3a1c9fae6e
Document Type :
article
Full Text :
https://doi.org/10.1155/2021/5538927