Cargando…
Embodied intelligence via learning and evolution
The intertwined processes of learning and evolution in complex environmental niches have resulted in a remarkable diversity of morphological forms. Moreover, many aspects of animal intelligence are deeply embodied in these evolved morphologies. However, the principles governing relations between env...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8494941/ https://www.ncbi.nlm.nih.gov/pubmed/34615862 http://dx.doi.org/10.1038/s41467-021-25874-z |
_version_ | 1784579422046650368 |
---|---|
author | Gupta, Agrim Savarese, Silvio Ganguli, Surya Fei-Fei, Li |
author_facet | Gupta, Agrim Savarese, Silvio Ganguli, Surya Fei-Fei, Li |
author_sort | Gupta, Agrim |
collection | PubMed |
description | The intertwined processes of learning and evolution in complex environmental niches have resulted in a remarkable diversity of morphological forms. Moreover, many aspects of animal intelligence are deeply embodied in these evolved morphologies. However, the principles governing relations between environmental complexity, evolved morphology, and the learnability of intelligent control, remain elusive, because performing large-scale in silico experiments on evolution and learning is challenging. Here, we introduce Deep Evolutionary Reinforcement Learning (DERL): a computational framework which can evolve diverse agent morphologies to learn challenging locomotion and manipulation tasks in complex environments. Leveraging DERL we demonstrate several relations between environmental complexity, morphological intelligence and the learnability of control. First, environmental complexity fosters the evolution of morphological intelligence as quantified by the ability of a morphology to facilitate the learning of novel tasks. Second, we demonstrate a morphological Baldwin effect i.e., in our simulations evolution rapidly selects morphologies that learn faster, thereby enabling behaviors learned late in the lifetime of early ancestors to be expressed early in the descendants lifetime. Third, we suggest a mechanistic basis for the above relationships through the evolution of morphologies that are more physically stable and energy efficient, and can therefore facilitate learning and control. |
format | Online Article Text |
id | pubmed-8494941 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-84949412021-10-07 Embodied intelligence via learning and evolution Gupta, Agrim Savarese, Silvio Ganguli, Surya Fei-Fei, Li Nat Commun Article The intertwined processes of learning and evolution in complex environmental niches have resulted in a remarkable diversity of morphological forms. Moreover, many aspects of animal intelligence are deeply embodied in these evolved morphologies. However, the principles governing relations between environmental complexity, evolved morphology, and the learnability of intelligent control, remain elusive, because performing large-scale in silico experiments on evolution and learning is challenging. Here, we introduce Deep Evolutionary Reinforcement Learning (DERL): a computational framework which can evolve diverse agent morphologies to learn challenging locomotion and manipulation tasks in complex environments. Leveraging DERL we demonstrate several relations between environmental complexity, morphological intelligence and the learnability of control. First, environmental complexity fosters the evolution of morphological intelligence as quantified by the ability of a morphology to facilitate the learning of novel tasks. Second, we demonstrate a morphological Baldwin effect i.e., in our simulations evolution rapidly selects morphologies that learn faster, thereby enabling behaviors learned late in the lifetime of early ancestors to be expressed early in the descendants lifetime. Third, we suggest a mechanistic basis for the above relationships through the evolution of morphologies that are more physically stable and energy efficient, and can therefore facilitate learning and control. Nature Publishing Group UK 2021-10-06 /pmc/articles/PMC8494941/ /pubmed/34615862 http://dx.doi.org/10.1038/s41467-021-25874-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Gupta, Agrim Savarese, Silvio Ganguli, Surya Fei-Fei, Li Embodied intelligence via learning and evolution |
title | Embodied intelligence via learning and evolution |
title_full | Embodied intelligence via learning and evolution |
title_fullStr | Embodied intelligence via learning and evolution |
title_full_unstemmed | Embodied intelligence via learning and evolution |
title_short | Embodied intelligence via learning and evolution |
title_sort | embodied intelligence via learning and evolution |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8494941/ https://www.ncbi.nlm.nih.gov/pubmed/34615862 http://dx.doi.org/10.1038/s41467-021-25874-z |
work_keys_str_mv | AT guptaagrim embodiedintelligencevialearningandevolution AT savaresesilvio embodiedintelligencevialearningandevolution AT gangulisurya embodiedintelligencevialearningandevolution AT feifeili embodiedintelligencevialearningandevolution |