Cargando…

Risk-aware multi-armed bandit problem with application to portfolio selection

Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential dec...

Descripción completa

Detalles Bibliográficos
Autores principales: Huo, Xiaoguang, Fu, Feng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Royal Society Publishing 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5717697/
https://www.ncbi.nlm.nih.gov/pubmed/29291122
http://dx.doi.org/10.1098/rsos.171377
_version_ 1783284195952427008
author Huo, Xiaoguang
Fu, Feng
author_facet Huo, Xiaoguang
Fu, Feng
author_sort Huo, Xiaoguang
collection PubMed
description Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision-making under uncertainty, namely the exploration versus exploitation dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of the financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return.
format Online
Article
Text
id pubmed-5717697
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher The Royal Society Publishing
record_format MEDLINE/PubMed
spelling pubmed-57176972017-12-29 Risk-aware multi-armed bandit problem with application to portfolio selection Huo, Xiaoguang Fu, Feng R Soc Open Sci Physics Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision-making under uncertainty, namely the exploration versus exploitation dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of the financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return. The Royal Society Publishing 2017-11-15 /pmc/articles/PMC5717697/ /pubmed/29291122 http://dx.doi.org/10.1098/rsos.171377 Text en © 2017 The Authors. http://creativecommons.org/licenses/by/4.0/ Published by the Royal Society under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0/, which permits unrestricted use, provided the original author and source are credited.
spellingShingle Physics
Huo, Xiaoguang
Fu, Feng
Risk-aware multi-armed bandit problem with application to portfolio selection
title Risk-aware multi-armed bandit problem with application to portfolio selection
title_full Risk-aware multi-armed bandit problem with application to portfolio selection
title_fullStr Risk-aware multi-armed bandit problem with application to portfolio selection
title_full_unstemmed Risk-aware multi-armed bandit problem with application to portfolio selection
title_short Risk-aware multi-armed bandit problem with application to portfolio selection
title_sort risk-aware multi-armed bandit problem with application to portfolio selection
topic Physics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5717697/
https://www.ncbi.nlm.nih.gov/pubmed/29291122
http://dx.doi.org/10.1098/rsos.171377
work_keys_str_mv AT huoxiaoguang riskawaremultiarmedbanditproblemwithapplicationtoportfolioselection
AT fufeng riskawaremultiarmedbanditproblemwithapplicationtoportfolioselection