Cargando…

Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System

Intralogistics is a technology that optimizes, integrates, automates, and manages the logistics flow of goods within a logistics transportation and sortation center. As the demand for parcel transportation increases, many sortation systems have been developed. In general, the goal of sortation syste...

Descripción completa

Detalles Bibliográficos
Autores principales:	Kim, Ju-Bong, Choi, Ho-Bin, Hwang, Gyu-Young, Kim, Kwihoon, Hong, Yong-Geun, Han, Youn-Hee
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349561/ https://www.ncbi.nlm.nih.gov/pubmed/32560217 http://dx.doi.org/10.3390/s20123401

_version_	1783557083344404480
author	Kim, Ju-Bong Choi, Ho-Bin Hwang, Gyu-Young Kim, Kwihoon Hong, Yong-Geun Han, Youn-Hee
author_facet	Kim, Ju-Bong Choi, Ho-Bin Hwang, Gyu-Young Kim, Kwihoon Hong, Yong-Geun Han, Youn-Hee
author_sort	Kim, Ju-Bong
collection	PubMed
description	Intralogistics is a technology that optimizes, integrates, automates, and manages the logistics flow of goods within a logistics transportation and sortation center. As the demand for parcel transportation increases, many sortation systems have been developed. In general, the goal of sortation systems is to route (or sort) parcels correctly and quickly. We design an n-grid sortation system that can be flexibly deployed and used at intralogistics warehouse and develop a collaborative multi-agent reinforcement learning (RL) algorithm to control the behavior of emitters or sorters in the system. We present two types of RL agents, emission agents and routing agents, and they are trained to achieve the given sortation goals together. For the verification of the proposed system and algorithm, we implement them in a full-fledged cyber-physical system simulator and describe the RL agents’ learning performance. From the learning results, we present that the well-trained collaborative RL agents can optimize their performance effectively. In particular, the routing agents finally learn to route the parcels through their optimal paths, while the emission agents finally learn to balance the inflow and outflow of parcels.
format	Online Article Text
id	pubmed-7349561
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-73495612020-07-14 Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System Kim, Ju-Bong Choi, Ho-Bin Hwang, Gyu-Young Kim, Kwihoon Hong, Yong-Geun Han, Youn-Hee Sensors (Basel) Article Intralogistics is a technology that optimizes, integrates, automates, and manages the logistics flow of goods within a logistics transportation and sortation center. As the demand for parcel transportation increases, many sortation systems have been developed. In general, the goal of sortation systems is to route (or sort) parcels correctly and quickly. We design an n-grid sortation system that can be flexibly deployed and used at intralogistics warehouse and develop a collaborative multi-agent reinforcement learning (RL) algorithm to control the behavior of emitters or sorters in the system. We present two types of RL agents, emission agents and routing agents, and they are trained to achieve the given sortation goals together. For the verification of the proposed system and algorithm, we implement them in a full-fledged cyber-physical system simulator and describe the RL agents’ learning performance. From the learning results, we present that the well-trained collaborative RL agents can optimize their performance effectively. In particular, the routing agents finally learn to route the parcels through their optimal paths, while the emission agents finally learn to balance the inflow and outflow of parcels. MDPI 2020-06-16 /pmc/articles/PMC7349561/ /pubmed/32560217 http://dx.doi.org/10.3390/s20123401 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Kim, Ju-Bong Choi, Ho-Bin Hwang, Gyu-Young Kim, Kwihoon Hong, Yong-Geun Han, Youn-Hee Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System
title	Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System
title_full	Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System
title_fullStr	Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System
title_full_unstemmed	Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System
title_short	Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System
title_sort	sortation control using multi-agent deep reinforcement learning in n-grid sortation system
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7349561/ https://www.ncbi.nlm.nih.gov/pubmed/32560217 http://dx.doi.org/10.3390/s20123401
work_keys_str_mv	AT kimjubong sortationcontrolusingmultiagentdeepreinforcementlearninginngridsortationsystem AT choihobin sortationcontrolusingmultiagentdeepreinforcementlearninginngridsortationsystem AT hwanggyuyoung sortationcontrolusingmultiagentdeepreinforcementlearninginngridsortationsystem AT kimkwihoon sortationcontrolusingmultiagentdeepreinforcementlearninginngridsortationsystem AT hongyonggeun sortationcontrolusingmultiagentdeepreinforcementlearninginngridsortationsystem AT hanyounhee sortationcontrolusingmultiagentdeepreinforcementlearninginngridsortationsystem

Sortation Control Using Multi-Agent Deep Reinforcement Learning in N-Grid Sortation System

Ejemplares similares