Cargando…

Experimental quantum speed-up in reinforcement learning agents

As the field of artificial intelligence advances, the demand for algorithms that can learn quickly and efficiently increases. An important paradigm within artificial intelligence is reinforcement learning [1], where decision-making entities called agents interact with environments and learn by updat...

Descripción completa

Detalles Bibliográficos
Autores principales: Saggio, V., Asenbeck, B. E., Hamann, A., Strömberg, T., Schiansky, P., Dunjko, V., Friis, N., Harris, N. C., Hochberg, M., Englund, D., Wölk, S., Briegel, H. J., Walther, P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7612051/
https://www.ncbi.nlm.nih.gov/pubmed/33692560
http://dx.doi.org/10.1038/s41586-021-03242-7
_version_ 1783605328110157824
author Saggio, V.
Asenbeck, B. E.
Hamann, A.
Strömberg, T.
Schiansky, P.
Dunjko, V.
Friis, N.
Harris, N. C.
Hochberg, M.
Englund, D.
Wölk, S.
Briegel, H. J.
Walther, P.
author_facet Saggio, V.
Asenbeck, B. E.
Hamann, A.
Strömberg, T.
Schiansky, P.
Dunjko, V.
Friis, N.
Harris, N. C.
Hochberg, M.
Englund, D.
Wölk, S.
Briegel, H. J.
Walther, P.
author_sort Saggio, V.
collection PubMed
description As the field of artificial intelligence advances, the demand for algorithms that can learn quickly and efficiently increases. An important paradigm within artificial intelligence is reinforcement learning [1], where decision-making entities called agents interact with environments and learn by updating their behaviour based on obtained feedback. The crucial question for practical applications is how fast agents learn [2]. While various works have made use of quantum mechanics to speed up the agent’s decision-making process [3, 4], a reduction in learning time has not been demonstrated yet. Here, we present a reinforcement learning experiment where the learning process of an agent is sped up by utilizing a quantum communication channel with the environment. We further show that combining this scenario with classical communication enables the evaluation of such an improvement, and additionally allows for optimal control of the learning progress. We implement this learning protocol on a compact and fully tuneable integrated nanophotonic processor. The device interfaces with telecom-wavelength photons and features a fast active feedback mechanism, allowing us to demonstrate the agent’s systematic quantum ad-vantage in a setup that could be readily integrated within future large-scale quantum communication networks.
format Online
Article
Text
id pubmed-7612051
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-76120512021-11-29 Experimental quantum speed-up in reinforcement learning agents Saggio, V. Asenbeck, B. E. Hamann, A. Strömberg, T. Schiansky, P. Dunjko, V. Friis, N. Harris, N. C. Hochberg, M. Englund, D. Wölk, S. Briegel, H. J. Walther, P. Nature Article As the field of artificial intelligence advances, the demand for algorithms that can learn quickly and efficiently increases. An important paradigm within artificial intelligence is reinforcement learning [1], where decision-making entities called agents interact with environments and learn by updating their behaviour based on obtained feedback. The crucial question for practical applications is how fast agents learn [2]. While various works have made use of quantum mechanics to speed up the agent’s decision-making process [3, 4], a reduction in learning time has not been demonstrated yet. Here, we present a reinforcement learning experiment where the learning process of an agent is sped up by utilizing a quantum communication channel with the environment. We further show that combining this scenario with classical communication enables the evaluation of such an improvement, and additionally allows for optimal control of the learning progress. We implement this learning protocol on a compact and fully tuneable integrated nanophotonic processor. The device interfaces with telecom-wavelength photons and features a fast active feedback mechanism, allowing us to demonstrate the agent’s systematic quantum ad-vantage in a setup that could be readily integrated within future large-scale quantum communication networks. 2021-03-01 2021-03-10 /pmc/articles/PMC7612051/ /pubmed/33692560 http://dx.doi.org/10.1038/s41586-021-03242-7 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Saggio, V.
Asenbeck, B. E.
Hamann, A.
Strömberg, T.
Schiansky, P.
Dunjko, V.
Friis, N.
Harris, N. C.
Hochberg, M.
Englund, D.
Wölk, S.
Briegel, H. J.
Walther, P.
Experimental quantum speed-up in reinforcement learning agents
title Experimental quantum speed-up in reinforcement learning agents
title_full Experimental quantum speed-up in reinforcement learning agents
title_fullStr Experimental quantum speed-up in reinforcement learning agents
title_full_unstemmed Experimental quantum speed-up in reinforcement learning agents
title_short Experimental quantum speed-up in reinforcement learning agents
title_sort experimental quantum speed-up in reinforcement learning agents
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7612051/
https://www.ncbi.nlm.nih.gov/pubmed/33692560
http://dx.doi.org/10.1038/s41586-021-03242-7
work_keys_str_mv AT saggiov experimentalquantumspeedupinreinforcementlearningagents
AT asenbeckbe experimentalquantumspeedupinreinforcementlearningagents
AT hamanna experimentalquantumspeedupinreinforcementlearningagents
AT strombergt experimentalquantumspeedupinreinforcementlearningagents
AT schianskyp experimentalquantumspeedupinreinforcementlearningagents
AT dunjkov experimentalquantumspeedupinreinforcementlearningagents
AT friisn experimentalquantumspeedupinreinforcementlearningagents
AT harrisnc experimentalquantumspeedupinreinforcementlearningagents
AT hochbergm experimentalquantumspeedupinreinforcementlearningagents
AT englundd experimentalquantumspeedupinreinforcementlearningagents
AT wolks experimentalquantumspeedupinreinforcementlearningagents
AT briegelhj experimentalquantumspeedupinreinforcementlearningagents
AT waltherp experimentalquantumspeedupinreinforcementlearningagents