Cargando…

MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management

Financial portfolio management (PM) is one of the most applicable problems in reinforcement learning (RL) owing to its sequential decision-making nature. However, existing RL-based approaches rarely focus on scalability or reusability to adapt to the ever-changing markets. These approaches are rigid...

Descripción completa

Detalles Bibliográficos
Autores principales:	Huang, Zhenhan, Tanaka, Fumihide
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2022
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8856562/ https://www.ncbi.nlm.nih.gov/pubmed/35180235 http://dx.doi.org/10.1371/journal.pone.0263689

_version_	1784653875422167040
author	Huang, Zhenhan Tanaka, Fumihide
author_facet	Huang, Zhenhan Tanaka, Fumihide
author_sort	Huang, Zhenhan
collection	PubMed
description	Financial portfolio management (PM) is one of the most applicable problems in reinforcement learning (RL) owing to its sequential decision-making nature. However, existing RL-based approaches rarely focus on scalability or reusability to adapt to the ever-changing markets. These approaches are rigid and unscalable to accommodate the varying number of assets of portfolios and increasing need for heterogeneous data input. Also, RL agents in the existing systems are ad-hoc trained and hardly reusable for different portfolios. To confront the above problems, a modular design is desired for the systems to be compatible with reusable asset-dedicated agents. In this paper, we propose a multi-agent RL-based system for PM (MSPM). MSPM involves two types of asynchronously-updated modules: Evolving Agent Module (EAM) and Strategic Agent Module (SAM). An EAM is an information-generating module with a Deep Q-network (DQN) agent, and it receives heterogeneous data and generates signal-comprised information for a particular asset. An SAM is a decision-making module with a Proximal Policy Optimization (PPO) agent for portfolio optimization, and it connects to multiple EAMs to reallocate the corresponding assets in a financial portfolio. Once been trained, EAMs can be connected to any SAM at will, like assembling LEGO blocks. With its modularized architecture, the multi-step condensation of volatile market information, and the reusable design of EAM, MSPM simultaneously addresses the two challenges in RL-based PM: scalability and reusability. Experiments on 8-year U.S. stock market data prove the effectiveness of MSPM in profit accumulation by its outperformance over five different baselines in terms of accumulated rate of return (ARR), daily rate of return (DRR), and Sortino ratio (SR). MSPM improves ARR by at least 186.5% compared to constant rebalanced portfolio (CRP), a widely-used PM strategy. To validate the indispensability of EAM, we back-test and compare MSPMs on four different portfolios. EAM-enabled MSPMs improve ARR by at least 1341.8% compared to EAM-disabled MSPMs.
format	Online Article Text
id	pubmed-8856562
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-88565622022-02-19 MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management Huang, Zhenhan Tanaka, Fumihide PLoS One Research Article Financial portfolio management (PM) is one of the most applicable problems in reinforcement learning (RL) owing to its sequential decision-making nature. However, existing RL-based approaches rarely focus on scalability or reusability to adapt to the ever-changing markets. These approaches are rigid and unscalable to accommodate the varying number of assets of portfolios and increasing need for heterogeneous data input. Also, RL agents in the existing systems are ad-hoc trained and hardly reusable for different portfolios. To confront the above problems, a modular design is desired for the systems to be compatible with reusable asset-dedicated agents. In this paper, we propose a multi-agent RL-based system for PM (MSPM). MSPM involves two types of asynchronously-updated modules: Evolving Agent Module (EAM) and Strategic Agent Module (SAM). An EAM is an information-generating module with a Deep Q-network (DQN) agent, and it receives heterogeneous data and generates signal-comprised information for a particular asset. An SAM is a decision-making module with a Proximal Policy Optimization (PPO) agent for portfolio optimization, and it connects to multiple EAMs to reallocate the corresponding assets in a financial portfolio. Once been trained, EAMs can be connected to any SAM at will, like assembling LEGO blocks. With its modularized architecture, the multi-step condensation of volatile market information, and the reusable design of EAM, MSPM simultaneously addresses the two challenges in RL-based PM: scalability and reusability. Experiments on 8-year U.S. stock market data prove the effectiveness of MSPM in profit accumulation by its outperformance over five different baselines in terms of accumulated rate of return (ARR), daily rate of return (DRR), and Sortino ratio (SR). MSPM improves ARR by at least 186.5% compared to constant rebalanced portfolio (CRP), a widely-used PM strategy. To validate the indispensability of EAM, we back-test and compare MSPMs on four different portfolios. EAM-enabled MSPMs improve ARR by at least 1341.8% compared to EAM-disabled MSPMs. Public Library of Science 2022-02-18 /pmc/articles/PMC8856562/ /pubmed/35180235 http://dx.doi.org/10.1371/journal.pone.0263689 Text en © 2022 Huang, Tanaka https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Huang, Zhenhan Tanaka, Fumihide MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management
title	MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management
title_full	MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management
title_fullStr	MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management
title_full_unstemmed	MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management
title_short	MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management
title_sort	mspm: a modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8856562/ https://www.ncbi.nlm.nih.gov/pubmed/35180235 http://dx.doi.org/10.1371/journal.pone.0263689
work_keys_str_mv	AT huangzhenhan mspmamodularizedandscalablemultiagentreinforcementlearningbasedsystemforfinancialportfoliomanagement AT tanakafumihide mspmamodularizedandscalablemultiagentreinforcementlearningbasedsystemforfinancialportfoliomanagement

MSPM: A modularized and scalable multi-agent reinforcement learning-based system for financial portfolio management

Ejemplares similares