Cargando…
Risk-aware multi-armed bandit problem with application to portfolio selection
Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential dec...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Royal Society Publishing
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5717697/ https://www.ncbi.nlm.nih.gov/pubmed/29291122 http://dx.doi.org/10.1098/rsos.171377 |
_version_ | 1783284195952427008 |
---|---|
author | Huo, Xiaoguang Fu, Feng |
author_facet | Huo, Xiaoguang Fu, Feng |
author_sort | Huo, Xiaoguang |
collection | PubMed |
description | Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision-making under uncertainty, namely the exploration versus exploitation dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of the financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return. |
format | Online Article Text |
id | pubmed-5717697 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | The Royal Society Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-57176972017-12-29 Risk-aware multi-armed bandit problem with application to portfolio selection Huo, Xiaoguang Fu, Feng R Soc Open Sci Physics Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision-making under uncertainty, namely the exploration versus exploitation dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of the financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return. The Royal Society Publishing 2017-11-15 /pmc/articles/PMC5717697/ /pubmed/29291122 http://dx.doi.org/10.1098/rsos.171377 Text en © 2017 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited. |
spellingShingle | Physics Huo, Xiaoguang Fu, Feng Risk-aware multi-armed bandit problem with application to portfolio selection |
title | Risk-aware multi-armed bandit problem with application to portfolio selection |
title_full | Risk-aware multi-armed bandit problem with application to portfolio selection |
title_fullStr | Risk-aware multi-armed bandit problem with application to portfolio selection |
title_full_unstemmed | Risk-aware multi-armed bandit problem with application to portfolio selection |
title_short | Risk-aware multi-armed bandit problem with application to portfolio selection |
title_sort | risk-aware multi-armed bandit problem with application to portfolio selection |
topic | Physics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5717697/ https://www.ncbi.nlm.nih.gov/pubmed/29291122 http://dx.doi.org/10.1098/rsos.171377 |
work_keys_str_mv | AT huoxiaoguang riskawaremultiarmedbanditproblemwithapplicationtoportfolioselection AT fufeng riskawaremultiarmedbanditproblemwithapplicationtoportfolioselection |