Cargando…
Experimental quantum speed-up in reinforcement learning agents
As the field of artificial intelligence advances, the demand for algorithms that can learn quickly and efficiently increases. An important paradigm within artificial intelligence is reinforcement learning [1], where decision-making entities called agents interact with environments and learn by updat...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7612051/ https://www.ncbi.nlm.nih.gov/pubmed/33692560 http://dx.doi.org/10.1038/s41586-021-03242-7 |
_version_ | 1783605328110157824 |
---|---|
author | Saggio, V. Asenbeck, B. E. Hamann, A. Strömberg, T. Schiansky, P. Dunjko, V. Friis, N. Harris, N. C. Hochberg, M. Englund, D. Wölk, S. Briegel, H. J. Walther, P. |
author_facet | Saggio, V. Asenbeck, B. E. Hamann, A. Strömberg, T. Schiansky, P. Dunjko, V. Friis, N. Harris, N. C. Hochberg, M. Englund, D. Wölk, S. Briegel, H. J. Walther, P. |
author_sort | Saggio, V. |
collection | PubMed |
description | As the field of artificial intelligence advances, the demand for algorithms that can learn quickly and efficiently increases. An important paradigm within artificial intelligence is reinforcement learning [1], where decision-making entities called agents interact with environments and learn by updating their behaviour based on obtained feedback. The crucial question for practical applications is how fast agents learn [2]. While various works have made use of quantum mechanics to speed up the agent’s decision-making process [3, 4], a reduction in learning time has not been demonstrated yet. Here, we present a reinforcement learning experiment where the learning process of an agent is sped up by utilizing a quantum communication channel with the environment. We further show that combining this scenario with classical communication enables the evaluation of such an improvement, and additionally allows for optimal control of the learning progress. We implement this learning protocol on a compact and fully tuneable integrated nanophotonic processor. The device interfaces with telecom-wavelength photons and features a fast active feedback mechanism, allowing us to demonstrate the agent’s systematic quantum ad-vantage in a setup that could be readily integrated within future large-scale quantum communication networks. |
format | Online Article Text |
id | pubmed-7612051 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
record_format | MEDLINE/PubMed |
spelling | pubmed-76120512021-11-29 Experimental quantum speed-up in reinforcement learning agents Saggio, V. Asenbeck, B. E. Hamann, A. Strömberg, T. Schiansky, P. Dunjko, V. Friis, N. Harris, N. C. Hochberg, M. Englund, D. Wölk, S. Briegel, H. J. Walther, P. Nature Article As the field of artificial intelligence advances, the demand for algorithms that can learn quickly and efficiently increases. An important paradigm within artificial intelligence is reinforcement learning [1], where decision-making entities called agents interact with environments and learn by updating their behaviour based on obtained feedback. The crucial question for practical applications is how fast agents learn [2]. While various works have made use of quantum mechanics to speed up the agent’s decision-making process [3, 4], a reduction in learning time has not been demonstrated yet. Here, we present a reinforcement learning experiment where the learning process of an agent is sped up by utilizing a quantum communication channel with the environment. We further show that combining this scenario with classical communication enables the evaluation of such an improvement, and additionally allows for optimal control of the learning progress. We implement this learning protocol on a compact and fully tuneable integrated nanophotonic processor. The device interfaces with telecom-wavelength photons and features a fast active feedback mechanism, allowing us to demonstrate the agent’s systematic quantum ad-vantage in a setup that could be readily integrated within future large-scale quantum communication networks. 2021-03-01 2021-03-10 /pmc/articles/PMC7612051/ /pubmed/33692560 http://dx.doi.org/10.1038/s41586-021-03242-7 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms |
spellingShingle | Article Saggio, V. Asenbeck, B. E. Hamann, A. Strömberg, T. Schiansky, P. Dunjko, V. Friis, N. Harris, N. C. Hochberg, M. Englund, D. Wölk, S. Briegel, H. J. Walther, P. Experimental quantum speed-up in reinforcement learning agents |
title | Experimental quantum speed-up in reinforcement learning
agents |
title_full | Experimental quantum speed-up in reinforcement learning
agents |
title_fullStr | Experimental quantum speed-up in reinforcement learning
agents |
title_full_unstemmed | Experimental quantum speed-up in reinforcement learning
agents |
title_short | Experimental quantum speed-up in reinforcement learning
agents |
title_sort | experimental quantum speed-up in reinforcement learning
agents |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7612051/ https://www.ncbi.nlm.nih.gov/pubmed/33692560 http://dx.doi.org/10.1038/s41586-021-03242-7 |
work_keys_str_mv | AT saggiov experimentalquantumspeedupinreinforcementlearningagents AT asenbeckbe experimentalquantumspeedupinreinforcementlearningagents AT hamanna experimentalquantumspeedupinreinforcementlearningagents AT strombergt experimentalquantumspeedupinreinforcementlearningagents AT schianskyp experimentalquantumspeedupinreinforcementlearningagents AT dunjkov experimentalquantumspeedupinreinforcementlearningagents AT friisn experimentalquantumspeedupinreinforcementlearningagents AT harrisnc experimentalquantumspeedupinreinforcementlearningagents AT hochbergm experimentalquantumspeedupinreinforcementlearningagents AT englundd experimentalquantumspeedupinreinforcementlearningagents AT wolks experimentalquantumspeedupinreinforcementlearningagents AT briegelhj experimentalquantumspeedupinreinforcementlearningagents AT waltherp experimentalquantumspeedupinreinforcementlearningagents |