Cargando…

Quantum-accessible reinforcement learning beyond strictly epochal environments

In recent years, quantum-enhanced machine learning has emerged as a particularly fruitful application of quantum algorithms, covering aspects of supervised, unsupervised and reinforcement learning. Reinforcement learning offers numerous options of how quantum theory can be applied, and is arguably t...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hamann, A., Dunjko, V., Wölk, S.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer International Publishing 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8550166/ https://www.ncbi.nlm.nih.gov/pubmed/34723097 http://dx.doi.org/10.1007/s42484-021-00049-7

_version_	1784590905017106432
author	Hamann, A. Dunjko, V. Wölk, S.
author_facet	Hamann, A. Dunjko, V. Wölk, S.
author_sort	Hamann, A.
collection	PubMed
description	In recent years, quantum-enhanced machine learning has emerged as a particularly fruitful application of quantum algorithms, covering aspects of supervised, unsupervised and reinforcement learning. Reinforcement learning offers numerous options of how quantum theory can be applied, and is arguably the least explored, from a quantum perspective. Here, an agent explores an environment and tries to find a behavior optimizing some figure of merit. Some of the first approaches investigated settings where this exploration can be sped-up, by considering quantum analogs of classical environments, which can then be queried in superposition. If the environments have a strict periodic structure in time (i.e. are strictly episodic), such environments can be effectively converted to conventional oracles encountered in quantum information. However, in general environments, we obtain scenarios that generalize standard oracle tasks. In this work, we consider one such generalization, where the environment is not strictly episodic, which is mapped to an oracle identification setting with a changing oracle. We analyze this case and show that standard amplitude-amplification techniques can, with minor modifications, still be applied to achieve quadratic speed-ups. In addition, we prove that an algorithm based on Grover iterations is optimal for oracle identification even if the oracle changes over time in a way that the “rewarded space” is monotonically increasing. This result constitutes one of the first generalizations of quantum-accessible reinforcement learning.
format	Online Article Text
id	pubmed-8550166
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Springer International Publishing
record_format	MEDLINE/PubMed
spelling	pubmed-85501662021-10-29 Quantum-accessible reinforcement learning beyond strictly epochal environments Hamann, A. Dunjko, V. Wölk, S. Quantum Mach Intell Research Article In recent years, quantum-enhanced machine learning has emerged as a particularly fruitful application of quantum algorithms, covering aspects of supervised, unsupervised and reinforcement learning. Reinforcement learning offers numerous options of how quantum theory can be applied, and is arguably the least explored, from a quantum perspective. Here, an agent explores an environment and tries to find a behavior optimizing some figure of merit. Some of the first approaches investigated settings where this exploration can be sped-up, by considering quantum analogs of classical environments, which can then be queried in superposition. If the environments have a strict periodic structure in time (i.e. are strictly episodic), such environments can be effectively converted to conventional oracles encountered in quantum information. However, in general environments, we obtain scenarios that generalize standard oracle tasks. In this work, we consider one such generalization, where the environment is not strictly episodic, which is mapped to an oracle identification setting with a changing oracle. We analyze this case and show that standard amplitude-amplification techniques can, with minor modifications, still be applied to achieve quadratic speed-ups. In addition, we prove that an algorithm based on Grover iterations is optimal for oracle identification even if the oracle changes over time in a way that the “rewarded space” is monotonically increasing. This result constitutes one of the first generalizations of quantum-accessible reinforcement learning. Springer International Publishing 2021-08-02 2021 /pmc/articles/PMC8550166/ /pubmed/34723097 http://dx.doi.org/10.1007/s42484-021-00049-7 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Research Article Hamann, A. Dunjko, V. Wölk, S. Quantum-accessible reinforcement learning beyond strictly epochal environments
title	Quantum-accessible reinforcement learning beyond strictly epochal environments
title_full	Quantum-accessible reinforcement learning beyond strictly epochal environments
title_fullStr	Quantum-accessible reinforcement learning beyond strictly epochal environments
title_full_unstemmed	Quantum-accessible reinforcement learning beyond strictly epochal environments
title_short	Quantum-accessible reinforcement learning beyond strictly epochal environments
title_sort	quantum-accessible reinforcement learning beyond strictly epochal environments
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8550166/ https://www.ncbi.nlm.nih.gov/pubmed/34723097 http://dx.doi.org/10.1007/s42484-021-00049-7
work_keys_str_mv	AT hamanna quantumaccessiblereinforcementlearningbeyondstrictlyepochalenvironments AT dunjkov quantumaccessiblereinforcementlearningbeyondstrictlyepochalenvironments AT wolks quantumaccessiblereinforcementlearningbeyondstrictlyepochalenvironments

Quantum-accessible reinforcement learning beyond strictly epochal environments

Ejemplares similares