Back to Search Start Over

Image captioning for Brazilian Portuguese using GRIT model

Authors :
de Alencar, Rafael Silva
CastaƱeda, William Alberto Cruz
Amadeus, Marcellus
Publication Year :
2024

Abstract

This work presents the early development of a model of image captioning for the Brazilian Portuguese language. We used the GRIT (Grid - and Region-based Image captioning Transformer) model to accomplish this work. GRIT is a Transformer-only neural architecture that effectively utilizes two visual features to generate better captions. The GRIT method emerged as a proposal to be a more efficient way to generate image captioning. In this work, we adapt the GRIT model to be trained in a Brazilian Portuguese dataset to have an image captioning method for the Brazilian Portuguese Language.<br />Comment: arXiv admin note: text overlap with arXiv:2207.09666 by other authors

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2402.05106
Document Type :
Working Paper