Cargando…

Continuous learning of emergent behavior in robotic matter

One of the main challenges in robotics is the development of systems that can adapt to their environment and achieve autonomous behavior. Current approaches typically aim to achieve this by increasing the complexity of the centralized controller by, e.g., direct modeling of their behavior, or implem...

Descripción completa

Detalles Bibliográficos
Autores principales: Oliveri, Giorgio, van Laake, Lucas C., Carissimo, Cesare, Miette, Clara, Overvelde, Johannes T. B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166149/
https://www.ncbi.nlm.nih.gov/pubmed/33972408
http://dx.doi.org/10.1073/pnas.2017015118
_version_ 1783701452444663808
author Oliveri, Giorgio
van Laake, Lucas C.
Carissimo, Cesare
Miette, Clara
Overvelde, Johannes T. B.
author_facet Oliveri, Giorgio
van Laake, Lucas C.
Carissimo, Cesare
Miette, Clara
Overvelde, Johannes T. B.
author_sort Oliveri, Giorgio
collection PubMed
description One of the main challenges in robotics is the development of systems that can adapt to their environment and achieve autonomous behavior. Current approaches typically aim to achieve this by increasing the complexity of the centralized controller by, e.g., direct modeling of their behavior, or implementing machine learning. In contrast, we simplify the controller using a decentralized and modular approach, with the aim of finding specific requirements needed for a robust and scalable learning strategy in robots. To achieve this, we conducted experiments and simulations on a specific robotic platform assembled from identical autonomous units that continuously sense their environment and react to it. By letting each unit adapt its behavior independently using a basic Monte Carlo scheme, the assembled system is able to learn and maintain optimal behavior in a dynamic environment as long as its memory is representative of the current environment, even when incurring damage. We show that the physical connection between the units is enough to achieve learning, and no additional communication or centralized information is required. As a result, such a distributed learning approach can be easily scaled to larger assemblies, blurring the boundaries between materials and robots, paving the way for a new class of modular “robotic matter” that can autonomously learn to thrive in dynamic or unfamiliar situations, for example, encountered by soft robots or self-assembled (micro)robots in various environments spanning from the medical realm to space explorations.
format Online
Article
Text
id pubmed-8166149
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-81661492021-06-10 Continuous learning of emergent behavior in robotic matter Oliveri, Giorgio van Laake, Lucas C. Carissimo, Cesare Miette, Clara Overvelde, Johannes T. B. Proc Natl Acad Sci U S A Physical Sciences One of the main challenges in robotics is the development of systems that can adapt to their environment and achieve autonomous behavior. Current approaches typically aim to achieve this by increasing the complexity of the centralized controller by, e.g., direct modeling of their behavior, or implementing machine learning. In contrast, we simplify the controller using a decentralized and modular approach, with the aim of finding specific requirements needed for a robust and scalable learning strategy in robots. To achieve this, we conducted experiments and simulations on a specific robotic platform assembled from identical autonomous units that continuously sense their environment and react to it. By letting each unit adapt its behavior independently using a basic Monte Carlo scheme, the assembled system is able to learn and maintain optimal behavior in a dynamic environment as long as its memory is representative of the current environment, even when incurring damage. We show that the physical connection between the units is enough to achieve learning, and no additional communication or centralized information is required. As a result, such a distributed learning approach can be easily scaled to larger assemblies, blurring the boundaries between materials and robots, paving the way for a new class of modular “robotic matter” that can autonomously learn to thrive in dynamic or unfamiliar situations, for example, encountered by soft robots or self-assembled (micro)robots in various environments spanning from the medical realm to space explorations. National Academy of Sciences 2021-05-25 2021-05-10 /pmc/articles/PMC8166149/ /pubmed/33972408 http://dx.doi.org/10.1073/pnas.2017015118 Text en Copyright © 2021 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Physical Sciences
Oliveri, Giorgio
van Laake, Lucas C.
Carissimo, Cesare
Miette, Clara
Overvelde, Johannes T. B.
Continuous learning of emergent behavior in robotic matter
title Continuous learning of emergent behavior in robotic matter
title_full Continuous learning of emergent behavior in robotic matter
title_fullStr Continuous learning of emergent behavior in robotic matter
title_full_unstemmed Continuous learning of emergent behavior in robotic matter
title_short Continuous learning of emergent behavior in robotic matter
title_sort continuous learning of emergent behavior in robotic matter
topic Physical Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8166149/
https://www.ncbi.nlm.nih.gov/pubmed/33972408
http://dx.doi.org/10.1073/pnas.2017015118
work_keys_str_mv AT oliverigiorgio continuouslearningofemergentbehaviorinroboticmatter
AT vanlaakelucasc continuouslearningofemergentbehaviorinroboticmatter
AT carissimocesare continuouslearningofemergentbehaviorinroboticmatter
AT mietteclara continuouslearningofemergentbehaviorinroboticmatter
AT overveldejohannestb continuouslearningofemergentbehaviorinroboticmatter