Cargando…

Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning

Despite active research on trading systems based on reinforcement learning, the development and performance of research methods require improvements. This study proposes a new action-specialized expert ensemble method consisting of action-specialized expert models designed specifically for each rein...

Descripción completa

Detalles Bibliográficos
Autores principales: Leem, JoonBum, Kim, Ha Young
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7384672/
https://www.ncbi.nlm.nih.gov/pubmed/32716945
http://dx.doi.org/10.1371/journal.pone.0236178
_version_ 1783563646259953664
author Leem, JoonBum
Kim, Ha Young
author_facet Leem, JoonBum
Kim, Ha Young
author_sort Leem, JoonBum
collection PubMed
description Despite active research on trading systems based on reinforcement learning, the development and performance of research methods require improvements. This study proposes a new action-specialized expert ensemble method consisting of action-specialized expert models designed specifically for each reinforcement learning action: buy, hold, and sell. Models are constructed by examining and defining different reward values that correlate with each action under specific conditions, and investment behavior is reflected with each expert model. To verify the performance of this technique, profits of the proposed system are compared to those of single trading and common ensemble systems. To verify robustness and account for the extension of discrete action space, we compared and analyzed changes in profits of the three actions to our model’s results. Furthermore, we checked for sensitivity with three different reward functions: profit, Sharpe ratio, and Sortino ratio. All experiments were conducted with S&P500, Hang Seng Index, and Eurostoxx50 data. The model was 39.1% and 21.6% more efficient than single and common ensemble models, respectively. Considering the extended discrete action space, the 3-action space was extended to 11- and 21-action spaces, and the cumulative returns increased by 427.2% and 856.7%, respectively. Results on reward functions indicated that our models are well trained; results of the Sharpe and Sortino ratios were better than the implementation of profit only, as in the single-model cases. The Sortino ratio was slightly better than the Sharpe ratio.
format Online
Article
Text
id pubmed-7384672
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-73846722020-08-05 Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning Leem, JoonBum Kim, Ha Young PLoS One Research Article Despite active research on trading systems based on reinforcement learning, the development and performance of research methods require improvements. This study proposes a new action-specialized expert ensemble method consisting of action-specialized expert models designed specifically for each reinforcement learning action: buy, hold, and sell. Models are constructed by examining and defining different reward values that correlate with each action under specific conditions, and investment behavior is reflected with each expert model. To verify the performance of this technique, profits of the proposed system are compared to those of single trading and common ensemble systems. To verify robustness and account for the extension of discrete action space, we compared and analyzed changes in profits of the three actions to our model’s results. Furthermore, we checked for sensitivity with three different reward functions: profit, Sharpe ratio, and Sortino ratio. All experiments were conducted with S&P500, Hang Seng Index, and Eurostoxx50 data. The model was 39.1% and 21.6% more efficient than single and common ensemble models, respectively. Considering the extended discrete action space, the 3-action space was extended to 11- and 21-action spaces, and the cumulative returns increased by 427.2% and 856.7%, respectively. Results on reward functions indicated that our models are well trained; results of the Sharpe and Sortino ratios were better than the implementation of profit only, as in the single-model cases. The Sortino ratio was slightly better than the Sharpe ratio. Public Library of Science 2020-07-27 /pmc/articles/PMC7384672/ /pubmed/32716945 http://dx.doi.org/10.1371/journal.pone.0236178 Text en © 2020 Leem, Kim http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Leem, JoonBum
Kim, Ha Young
Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning
title Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning
title_full Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning
title_fullStr Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning
title_full_unstemmed Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning
title_short Action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning
title_sort action-specialized expert ensemble trading system with extended discrete action space using deep reinforcement learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7384672/
https://www.ncbi.nlm.nih.gov/pubmed/32716945
http://dx.doi.org/10.1371/journal.pone.0236178
work_keys_str_mv AT leemjoonbum actionspecializedexpertensembletradingsystemwithextendeddiscreteactionspaceusingdeepreinforcementlearning
AT kimhayoung actionspecializedexpertensembletradingsystemwithextendeddiscreteactionspaceusingdeepreinforcementlearning