Cargando…

Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers

We show how a symmetric and fully distributed flocking controller can be synthesized using Deep Learning from a centralized flocking controller. Our approach is based on Supervised Learning, with the centralized controller providing the training data, in the form of trajectories of state-action pair...

Descripción completa

Detalles Bibliográficos
Autores principales:	Mehmood, Usama, Roy, Shouvik, Grosu, Radu, Smolka, Scott A., Stoller, Scott D., Tiwari, Ashish
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7788607/ http://dx.doi.org/10.1007/978-3-030-45231-5_1

_version_	1783633062682165248
author	Mehmood, Usama Roy, Shouvik Grosu, Radu Smolka, Scott A. Stoller, Scott D. Tiwari, Ashish
author_facet	Mehmood, Usama Roy, Shouvik Grosu, Radu Smolka, Scott A. Stoller, Scott D. Tiwari, Ashish
author_sort	Mehmood, Usama
collection	PubMed
description	We show how a symmetric and fully distributed flocking controller can be synthesized using Deep Learning from a centralized flocking controller. Our approach is based on Supervised Learning, with the centralized controller providing the training data, in the form of trajectories of state-action pairs. We use Model Predictive Control (MPC) for the centralized controller, an approach that we have successfully demonstrated on flocking problems. MPC-based flocking controllers are high-performing but also computationally expensive. By learning a symmetric and distributed neural flocking controller from a centralized MPC-based one, we achieve the best of both worlds: the neural controllers have high performance (on par with the MPC controllers) and high efficiency. Our experimental results demonstrate the sophisticated nature of the distributed controllers we learn. In particular, the neural controllers are capable of achieving myriad flocking-oriented control objectives, including flocking formation, collision avoidance, obstacle avoidance, predator avoidance, and target seeking. Moreover, they generalize the behavior seen in the training data to achieve these objectives in a significantly broader range of scenarios. In terms of verification of our neural flocking controller, we use a form of statistical model checking to compute confidence intervals for its convergence rate and time to convergence.
format	Online Article Text
id	pubmed-7788607
institution	National Center for Biotechnology Information
language	English
publishDate	2020
record_format	MEDLINE/PubMed
spelling	pubmed-77886072021-01-07 Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers Mehmood, Usama Roy, Shouvik Grosu, Radu Smolka, Scott A. Stoller, Scott D. Tiwari, Ashish Foundations of Software Science and Computation Structures Article We show how a symmetric and fully distributed flocking controller can be synthesized using Deep Learning from a centralized flocking controller. Our approach is based on Supervised Learning, with the centralized controller providing the training data, in the form of trajectories of state-action pairs. We use Model Predictive Control (MPC) for the centralized controller, an approach that we have successfully demonstrated on flocking problems. MPC-based flocking controllers are high-performing but also computationally expensive. By learning a symmetric and distributed neural flocking controller from a centralized MPC-based one, we achieve the best of both worlds: the neural controllers have high performance (on par with the MPC controllers) and high efficiency. Our experimental results demonstrate the sophisticated nature of the distributed controllers we learn. In particular, the neural controllers are capable of achieving myriad flocking-oriented control objectives, including flocking formation, collision avoidance, obstacle avoidance, predator avoidance, and target seeking. Moreover, they generalize the behavior seen in the training data to achieve these objectives in a significantly broader range of scenarios. In terms of verification of our neural flocking controller, we use a form of statistical model checking to compute confidence intervals for its convergence rate and time to convergence. 2020-04-17 /pmc/articles/PMC7788607/ http://dx.doi.org/10.1007/978-3-030-45231-5_1 Text en © The Author(s) 2020 Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made. The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
spellingShingle	Article Mehmood, Usama Roy, Shouvik Grosu, Radu Smolka, Scott A. Stoller, Scott D. Tiwari, Ashish Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers
title	Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers
title_full	Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers
title_fullStr	Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers
title_full_unstemmed	Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers
title_short	Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers
title_sort	neural flocking: mpc-based supervised learning of flocking controllers
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7788607/ http://dx.doi.org/10.1007/978-3-030-45231-5_1
work_keys_str_mv	AT mehmoodusama neuralflockingmpcbasedsupervisedlearningofflockingcontrollers AT royshouvik neuralflockingmpcbasedsupervisedlearningofflockingcontrollers AT grosuradu neuralflockingmpcbasedsupervisedlearningofflockingcontrollers AT smolkascotta neuralflockingmpcbasedsupervisedlearningofflockingcontrollers AT stollerscottd neuralflockingmpcbasedsupervisedlearningofflockingcontrollers AT tiwariashish neuralflockingmpcbasedsupervisedlearningofflockingcontrollers

Neural Flocking: MPC-Based Supervised Learning of Flocking Controllers

Ejemplares similares