Back to Search Start Over

Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation

Authors :
Hau Nguyen, Duc
Mallart, Cyrielle
Gravier, Guillaume
Sébillot, Pascale
Creating and exploiting explicit links between multimedia fragments (LinkMedia)
Inria Rennes – Bretagne Atlantique
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-SIGNAL, IMAGE ET LANGAGE (IRISA-D6)
Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA)
Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes)
Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-Institut National de Recherche en Informatique et en Automatique (Inria)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique)
Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes)
Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA)
Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-Université de Bretagne Sud (UBS)-École normale supérieure - Rennes (ENS Rennes)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique (IMT Atlantique)
Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)
A Symbolic and Human-centric view of dAta MANagement (SHAMAN)
GESTION DES DONNÉES ET DE LA CONNAISSANCE (IRISA-D7)
ANR-19-CE38-0011,ARCHIVAL,Valorisation d'archives multimédia : Compréhension automatique multimodale du langage pour de nouvelles interfaces intelligentes de médiation et de transmission des savoirs(2019)
Source :
Proceedings of 28th International Conference on Natural Language and Information Systems, Lecture Notes in Computer Science, Vol. 13913, NLDB 2023-28th International Conference on Natural Language and Information Systems, NLDB 2023-28th International Conference on Natural Language and Information Systems, Jun 2023, Derby, United Kingdom. pp.1-14
Publication Year :
2023
Publisher :
HAL CCSD, 2023.

Abstract

International audience; Attention mechanism is contributing to the majority of recent advances in machine learning for natural language processing. Additionally, it results in an attention map that shows the proportional influence of each input in its decision. Empirical studies postulate that attention maps can be provided as an explanation for model output. However, it is still questionable to ask whether this explanation helps regular people to understand and accept the model output (the plausibility of the explanation). Recent studies show that attention weights in RNN encoders are hardly plausible because they spread on input tokens. We thus propose three additional constraints to the learning objective function to improve the plausibility of the attention map: regularization to increase the attention weight sparsity, semi-supervision to supervise the map by a heuristic and supervision by human annotation. Results show that all techniques can improve the attention map plausibility at some level. We also observe that specific instructions for human annotation might have a negative effect on classification performance. Beyond the attention map, results on text classification tasks also show that the contextualization layer plays a crucial role in finding the right space for finding plausible tokens, no matter how constraints bring the gain.

Details

Language :
English
Database :
OpenAIRE
Journal :
Proceedings of 28th International Conference on Natural Language and Information Systems, Lecture Notes in Computer Science, Vol. 13913, NLDB 2023-28th International Conference on Natural Language and Information Systems, NLDB 2023-28th International Conference on Natural Language and Information Systems, Jun 2023, Derby, United Kingdom. pp.1-14
Accession number :
edsair.dedup.wf.001..d548fb85dc5589b01212354d2cfb33f7