Back to Search Start Over

EaSe: A Diagnostic Tool for VQA Based on Answer Diversity

Authors :
Jolly, S.
Pezzelle, S.
Nabi, M.
Toutanova, K.
Rumshisky, A.
Zettlemoyer, L.
Hakkani-Tur, D.
Beltagy, I.
Bethard, S.
Cotterell, R.
Chakraborty, T.
Zhou, Y.
Language and Computation (ILLC, FNWI/FGw)
ILLC (FNWI)
Source :
The 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: NAACL-HLT 2021 : proceedings of the conference : June 6-11, 2021, 2407-2414, STARTPAGE=2407;ENDPAGE=2414;TITLE=The 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT
Publication Year :
2021

Abstract

We propose EASE, a simple diagnostic tool for Visual Question Answering (VQA) which quantifies the difficulty of an image, question sample. EASE is based on the pattern of answers provided by multiple annotators to a given question. In particular, it considers two aspects of the answers: (i) their Entropy; (ii) their Semantic content. First, we prove the validity of our diagnostic to identify samples that are easy/hard for state-of-art VQA models. Second, we show that EASE can be successfully used to select the most-informative samples for training/fine-tuning. Crucially, only information that is readily available in any VQA dataset is used to compute its scores.

Details

Language :
English
Database :
OpenAIRE
Journal :
The 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: NAACL-HLT 2021 : proceedings of the conference : June 6-11, 2021, 2407-2414, STARTPAGE=2407;ENDPAGE=2414;TITLE=The 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT
Accession number :
edsair.doi.dedup.....74ee792406ed3d7c28443ddd95adc13a
Full Text :
https://doi.org/10.18653/v1/2021.naacl-main.192