Back to Search Start Over

CAM : a Combined Attention Model for natural language inference

Authors :
Noura Al Moubayed
Steven Bradley
Sardar Jaf
A. Stephen McGough
Amit Gajbhiye
Abe, Naoki
Liu, Huan
Pu, Calton
Hu, Xiaohua
Ahmed, Nesreen
Qiao, Mu
Song, Yang
Kossmann, Donald
Liu, Bing
Lee, Kisung
Tang, Jiliang
He, Jingrui
Saltz, Jeffrey
Source :
Abe, Naoki & Liu, Huan & Pu, Calton & Hu, Xiaohua & Ahmed, Nesreen & Qiao, Mu & Song, Yang & Kossmann, Donald & Liu, Bing & Lee, Kisung & Tang, Jiliang & He, Jingrui & Saltz, Jeffrey (Eds.). (2018). 2018 IEEE International Conference on Big Data (Big Data) ; proceedings. Piscataway, N.J.: IEEE, pp. 1009-1014, IEEE BigData
Publication Year :
2018
Publisher :
IEEE, 2018.

Abstract

Natural Language Inference (NLI) is a fundamental step towards natural language understanding. The task aims to detect whether a premise entails or contradicts a given hypothesis. NLI contributes to a wide range of natural language understanding applications such as question answering, text summarization and information extraction. Recently, the public availability of big datasets such as Stanford Natural Language Inference (SNLI) and SciTail, has made it feasible to train complex neural NLI models. Particularly, Bidirectional Long Short-Term Memory networks (BiLSTMs) with attention mechanisms have shown promising performance for NLI. In this paper, we propose a Combined Attention Model (CAM) for NLI. CAM combines the two attention mechanisms: intra-attention and inter-attention. The model first captures the semantics of the individual input premise and hypothesis with intra-attention and then aligns the premise and hypothesis with inter-sentence attention. We evaluate CAM on two benchmark datasets: Stanford Natural Language Inference (SNLI) and SciTail, achieving 86.14% accuracy on SNLI and 77.23% on SciTail. Further, to investigate the effectiveness of individual attention mechanism and in combination with each other, we present an analysis showing that the intra- and inter-attention mechanisms achieve higher accuracy when they are combined together than when they are independently used.

Details

Database :
OpenAIRE
Journal :
Abe, Naoki & Liu, Huan & Pu, Calton & Hu, Xiaohua & Ahmed, Nesreen & Qiao, Mu & Song, Yang & Kossmann, Donald & Liu, Bing & Lee, Kisung & Tang, Jiliang & He, Jingrui & Saltz, Jeffrey (Eds.). (2018). 2018 IEEE International Conference on Big Data (Big Data) ; proceedings. Piscataway, N.J.: IEEE, pp. 1009-1014, IEEE BigData
Accession number :
edsair.doi.dedup.....4c6065f1df5751b0f46d1fb9e809ef46