Cargando…

Risk-aware multi-armed bandit problem with application to portfolio selection

Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential dec...

Descripción completa

Detalles Bibliográficos
Autores principales:	Huo, Xiaoguang, Fu, Feng
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	The Royal Society Publishing 2017
Materias:	Physics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5717697/ https://www.ncbi.nlm.nih.gov/pubmed/29291122 http://dx.doi.org/10.1098/rsos.171377

Descripción
Sumario:	Sequential portfolio selection has attracted increasing interest in the machine learning and quantitative finance communities in recent years. As a mathematical framework for reinforcement learning policies, the stochastic multi-armed bandit problem addresses the primary difficulty in sequential decision-making under uncertainty, namely the exploration versus exploitation dilemma, and therefore provides a natural connection to portfolio selection. In this paper, we incorporate risk awareness into the classic multi-armed bandit setting and introduce an algorithm to construct portfolio. Through filtering assets based on the topological structure of the financial market and combining the optimal multi-armed bandit policy with the minimization of a coherent risk measure, we achieve a balance between risk and return.

Risk-aware multi-armed bandit problem with application to portfolio selection

Ejemplares similares