Back to Search Start Over

Performance analysis of a hybrid agent for quantum-accessible reinforcement learning

Authors :
Arne Hamann
Sabine Wölk
Source :
New Journal of Physics, Vol 24, Iss 3, p 033044 (2022)
Publication Year :
2022
Publisher :
IOP Publishing, 2022.

Abstract

In the last decade quantum machine learning has provided fascinating and fundamental improvements to supervised, unsupervised and reinforcement learning (RL). In RL, a so-called agent is challenged to solve a task given by some environment. The agent learns to solve the task by exploring the environment and exploiting the rewards it gets from the environment. For some classical task environments, an analogue quantum environment can be constructed which allows to find rewards quadratically faster by applying quantum algorithms. In this paper, we analytically analyze the behavior of a hybrid agent which combines this quadratic speedup in exploration with the policy update of a classical agent. This leads to a faster learning of the hybrid agent compared to the classical agent. We demonstrate that if the classical agent needs on average ⟨ J ⟩ rewards and ⟨ T ⟩ _cl epochs to learn how to solve the task, the hybrid agent will take ${\langle T\rangle }_{\mathrm{q}}\leqslant {\alpha }_{s}{\alpha }_{o}\sqrt{{\langle T\rangle }_{\mathrm{c}\mathrm{l}}\langle J\rangle }$ epochs on average. Here, α _s and α _o denote constants depending on details of the quantum search and are independent of the problem size. Additionally, we prove that if the environment allows for maximally α _o k _max sequential coherent interactions, e.g. due to noise effects, an improvement given by ⟨ T ⟩ _q ≈ α _o ⟨ T ⟩ _cl /(4 k _max ) is still possible.

Details

Language :
English
ISSN :
13672630
Volume :
24
Issue :
3
Database :
Directory of Open Access Journals
Journal :
New Journal of Physics
Publication Type :
Academic Journal
Accession number :
edsdoj.173a314c00ee46448751443bdc1a61f6
Document Type :
article
Full Text :
https://doi.org/10.1088/1367-2630/ac5b56