Back to Search Start Over

Hierarchical Transformer Encoder With Structured Representation for Abstract Reasoning

Authors :
Jinwon An
Sungzoon Cho
Source :
IEEE Access, Vol 8, Pp 200229-200236 (2020)
Publication Year :
2020
Publisher :
IEEE, 2020.

Abstract

Abstract reasoning is one of the defining characteristics of human intelligence and can be estimated by visual IQ tests such as Raven's Progressive Matrices. In this paper, we propose using a hierarchical Transformer encoder with structured representation that employs a novel neural network architecture to improve both perception and reasoning in a visual IQ test. For perception, we used object detection models to extract the structured features. For reasoning, we used the Transformer encoder in a hierarchical manner that fits the structure of Raven's Progressive Matrices. Experimental results on the RAVEN dataset, which is one of the major large-scale datasets on Raven's Progressive Matrices, showed that our proposed architecture achieved an overall accuracy of 99.62%, which is an improvement of more than 8% points over CoPINet, the present-day, state-of-the-art neural network model.

Details

Language :
English
ISSN :
21693536
Volume :
8
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.4182c06df9494fdd879f9a2a2f54959f
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2020.3035463