Start Over

The Hierarchical Continuous Pursuit Learning Automation for Large Numbers of Actions

Authors :: B. John Oommen
Xuan Zhang
Anis Yazidi
Lei Jiao
Oslo and Akershus University College of Applied Sciences [Oslo] (HiOA)
University of Agder (UIA)
Carleton University
Lazaros Iliadis
Ilias Maglogiannis
Vassilis Plagianakos
TC 12
WG 12.5
Source :: IFIP Advances in Information and Communication Technology ISBN: 9783319920061, AIAI, IFIP Advances in Information and Communication Technology, 14th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), 14th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), May 2018, Rhodes, Greece. pp.451-461, ⟨10.1007/978-3-319-92007-8_38⟩
Publication Year :: 2018
Publisher :: Springer International Publishing, 2018.
Abstract: Part 10: Learning - Intelligence; International audience; Although the field of Learning Automata (LA) has made significant progress in the last four decades, the LA-based methods to tackle problems involving environments with a large number of actions are, in reality, relatively unresolved. The extension of the traditional LA (fixed structure, variable structure, discretized, and pursuit) to problems within this domain cannot be easily established when the number of actions is very large. This is because the dimensionality of the action probability vector is correspondingly large, and consequently, most components of the vector will, after a relatively short time, have values that are smaller than the machine accuracy permits, implying that they will never be chosen. This paper pioneers a solution that extends the continuous pursuit paradigm to such large-actioned problem domains. The beauty of the solution is that it is hierarchical, where all the actions offered by the environment reside as leaves of the hierarchy. Further, at every level, we merely require a two-action LA which automatically resolves the problem of dealing with arbitrarily small action probabilities. Additionally, since all the LA invoke the pursuit paradigm, the best action at every level trickles up towards the root. Thus, by invoking the property of the “max” operator, in which, the maximum of numerous maxima is the overall maximum, the hierarchy of LA converges to the optimal action. Apart from reporting the theoretical properties of the scheme, the paper contains extensive experimental results which demonstrate the power of the scheme and its computational advantages. As far as we know, there are no comparable results in the field of LA.

Details

ISBN :: 978-3-319-92006-1
ISBNs :: 9783319920061
Database :: OpenAIRE
Journal :: IFIP Advances in Information and Communication Technology ISBN: 9783319920061, AIAI, IFIP Advances in Information and Communication Technology, 14th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), 14th IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAI), May 2018, Rhodes, Greece. pp.451-461, ⟨10.1007/978-3-319-92007-8_38⟩
Accession number :: edsair.doi.dedup.....9a65b593e78722a378da2528ef1f52f5
Full Text :: https://doi.org/10.1007/978-3-319-92007-8_38

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

The Hierarchical Continuous Pursuit Learning Automation for Large Numbers of Actions

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

The Hierarchical Continuous Pursuit Learning Automation for Large Numbers of Actions

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources