Back to Search Start Over

Representation Learning of Logic Words by an RNN: From Word Sequences to Robot Actions.

Authors :
Yamada T
Murata S
Arie H
Ogata T
Source :
Frontiers in neurorobotics [Front Neurorobot] 2017 Dec 22; Vol. 11, pp. 70. Date of Electronic Publication: 2017 Dec 22 (Print Publication: 2017).
Publication Year :
2017

Abstract

An important characteristic of human language is compositionality. We can efficiently express a wide variety of real-world situations, events, and behaviors by compositionally constructing the meaning of a complex expression from a finite number of elements. Previous studies have analyzed how machine-learning models, particularly neural networks, can learn from experience to represent compositional relationships between language and robot actions with the aim of understanding the symbol grounding structure and achieving intelligent communicative agents. Such studies have mainly dealt with the words (nouns, adjectives, and verbs) that directly refer to real-world matters. In addition to these words, the current study deals with logic words, such as "not," "and," and "or" simultaneously. These words are not directly referring to the real world, but are logical operators that contribute to the construction of meaning in sentences. In human-robot communication, these words may be used often. The current study builds a recurrent neural network model with long short-term memory units and trains it to learn to translate sentences including logic words into robot actions. We investigate what kind of compositional representations, which mediate sentences and robot actions, emerge as the network's internal states via the learning process. Analysis after learning shows that referential words are merged with visual information and the robot's own current state, and the logical words are represented by the model in accordance with their functions as logical operators. Words such as "true," "false," and "not" work as non-linear transformations to encode orthogonal phrases into the same area in a memory cell state space. The word "and," which required a robot to lift up both its hands, worked as if it was a universal quantifier. The word "or," which required action generation that looked apparently random, was represented as an unstable space of the network's dynamical system.

Details

Language :
English
ISSN :
1662-5218
Volume :
11
Database :
MEDLINE
Journal :
Frontiers in neurorobotics
Publication Type :
Academic Journal
Accession number :
29311891
Full Text :
https://doi.org/10.3389/fnbot.2017.00070