Cargando…
Continuous learning of emergent behavior in robotic matter
One of the main challenges in robotics is the development of systems that can adapt to their environment and achieve autonomous behavior. Current approaches typically aim to achieve this by increasing the complexity of the centralized controller by, e.g., direct modeling of their behavior, or implem...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Academy of Sciences
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166149/ https://www.ncbi.nlm.nih.gov/pubmed/33972408 http://dx.doi.org/10.1073/pnas.2017015118 |
_version_ | 1783701452444663808 |
---|---|
author | Oliveri, Giorgio van Laake, Lucas C. Carissimo, Cesare Miette, Clara Overvelde, Johannes T. B. |
author_facet | Oliveri, Giorgio van Laake, Lucas C. Carissimo, Cesare Miette, Clara Overvelde, Johannes T. B. |
author_sort | Oliveri, Giorgio |
collection | PubMed |
description | One of the main challenges in robotics is the development of systems that can adapt to their environment and achieve autonomous behavior. Current approaches typically aim to achieve this by increasing the complexity of the centralized controller by, e.g., direct modeling of their behavior, or implementing machine learning. In contrast, we simplify the controller using a decentralized and modular approach, with the aim of finding specific requirements needed for a robust and scalable learning strategy in robots. To achieve this, we conducted experiments and simulations on a specific robotic platform assembled from identical autonomous units that continuously sense their environment and react to it. By letting each unit adapt its behavior independently using a basic Monte Carlo scheme, the assembled system is able to learn and maintain optimal behavior in a dynamic environment as long as its memory is representative of the current environment, even when incurring damage. We show that the physical connection between the units is enough to achieve learning, and no additional communication or centralized information is required. As a result, such a distributed learning approach can be easily scaled to larger assemblies, blurring the boundaries between materials and robots, paving the way for a new class of modular “robotic matter” that can autonomously learn to thrive in dynamic or unfamiliar situations, for example, encountered by soft robots or self-assembled (micro)robots in various environments spanning from the medical realm to space explorations. |
format | Online Article Text |
id | pubmed-8166149 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | National Academy of Sciences |
record_format | MEDLINE/PubMed |
spelling | pubmed-81661492021-06-10 Continuous learning of emergent behavior in robotic matter Oliveri, Giorgio van Laake, Lucas C. Carissimo, Cesare Miette, Clara Overvelde, Johannes T. B. Proc Natl Acad Sci U S A Physical Sciences One of the main challenges in robotics is the development of systems that can adapt to their environment and achieve autonomous behavior. Current approaches typically aim to achieve this by increasing the complexity of the centralized controller by, e.g., direct modeling of their behavior, or implementing machine learning. In contrast, we simplify the controller using a decentralized and modular approach, with the aim of finding specific requirements needed for a robust and scalable learning strategy in robots. To achieve this, we conducted experiments and simulations on a specific robotic platform assembled from identical autonomous units that continuously sense their environment and react to it. By letting each unit adapt its behavior independently using a basic Monte Carlo scheme, the assembled system is able to learn and maintain optimal behavior in a dynamic environment as long as its memory is representative of the current environment, even when incurring damage. We show that the physical connection between the units is enough to achieve learning, and no additional communication or centralized information is required. As a result, such a distributed learning approach can be easily scaled to larger assemblies, blurring the boundaries between materials and robots, paving the way for a new class of modular “robotic matter” that can autonomously learn to thrive in dynamic or unfamiliar situations, for example, encountered by soft robots or self-assembled (micro)robots in various environments spanning from the medical realm to space explorations. National Academy of Sciences 2021-05-25 2021-05-10 /pmc/articles/PMC8166149/ /pubmed/33972408 http://dx.doi.org/10.1073/pnas.2017015118 Text en Copyright © 2021 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) . |
spellingShingle | Physical Sciences Oliveri, Giorgio van Laake, Lucas C. Carissimo, Cesare Miette, Clara Overvelde, Johannes T. B. Continuous learning of emergent behavior in robotic matter |
title | Continuous learning of emergent behavior in robotic matter |
title_full | Continuous learning of emergent behavior in robotic matter |
title_fullStr | Continuous learning of emergent behavior in robotic matter |
title_full_unstemmed | Continuous learning of emergent behavior in robotic matter |
title_short | Continuous learning of emergent behavior in robotic matter |
title_sort | continuous learning of emergent behavior in robotic matter |
topic | Physical Sciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166149/ https://www.ncbi.nlm.nih.gov/pubmed/33972408 http://dx.doi.org/10.1073/pnas.2017015118 |
work_keys_str_mv | AT oliverigiorgio continuouslearningofemergentbehaviorinroboticmatter AT vanlaakelucasc continuouslearningofemergentbehaviorinroboticmatter AT carissimocesare continuouslearningofemergentbehaviorinroboticmatter AT mietteclara continuouslearningofemergentbehaviorinroboticmatter AT overveldejohannestb continuouslearningofemergentbehaviorinroboticmatter |