Start Over

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates.

Authors :: Moratelli, Nicholas
Barraco, Manuele
Morelli, Davide
Cornia, Marcella
Baraldi, Lorenzo
Cucchiara, Rita
Source :: Sensors (14248220). Feb2023, Vol. 23 Issue 3, p1286. 16p.
Publication Year :: 2023
Abstract: Research related to fashion and e-commerce domains is gaining attention in computer vision and multimedia communities. Following this trend, this article tackles the task of generating fine-grained and accurate natural language descriptions of fashion items, a recently-proposed and under-explored challenge that is still far from being solved. To overcome the limitations of previous approaches, a transformer-based captioning model was designed with the integration of external textual memory that could be accessed through k-nearest neighbor (kNN) searches. From an architectural point of view, the proposed transformer model can read and retrieve items from the external memory through cross-attention operations, and tune the flow of information coming from the external memory thanks to a novel fully attentive gate. Experimental analyses were carried out on the fashion captioning dataset (FACAD) for fashion image captioning, which contains more than 130k fine-grained descriptions, validating the effectiveness of the proposed approach and the proposed architectural strategies in comparison with carefully designed baselines and state-of-the-art approaches. The presented method constantly outperforms all compared approaches, demonstrating its effectiveness for fashion image captioning. [ABSTRACT FROM AUTHOR]

Subjects :: *K-nearest neighbor classification
*COMPUTER vision
*FASHION
*COMMUNITIES
*PRIVATE communities
*NATURAL languages

Details

Language :: English
ISSN :: 14248220
Volume :: 23
Issue :: 3
Database :: Academic Search Index
Journal :: Sensors (14248220)
Publication Type :: Academic Journal
Accession number :: 161874183
Full Text :: https://doi.org/10.3390/s23031286

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources