Cargando…

Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials

Learning optimal individualized treatment rules (ITRs) has become increasingly important in the modern era of precision medicine. Many statistical and machine learning methods for learning optimal ITRs have been developed in the literature. However, most existing methods are based on data collected...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gao, Daiqi, Liu, Yufeng, Zeng, Donglin
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10419117/ https://www.ncbi.nlm.nih.gov/pubmed/37576335

_version_	1785088434086346752
author	Gao, Daiqi Liu, Yufeng Zeng, Donglin
author_facet	Gao, Daiqi Liu, Yufeng Zeng, Donglin
author_sort	Gao, Daiqi
collection	PubMed
description	Learning optimal individualized treatment rules (ITRs) has become increasingly important in the modern era of precision medicine. Many statistical and machine learning methods for learning optimal ITRs have been developed in the literature. However, most existing methods are based on data collected from traditional randomized controlled trials and thus cannot take advantage of the accumulative evidence when patients enter the trials sequentially. It is also ethically important that future patients should have a high probability to be treated optimally based on the updated knowledge so far. In this work, we propose a new design called sequentially rule-adaptive trials to learn optimal ITRs based on the contextual bandit framework, in contrast to the response-adaptive design in traditional adaptive trials. In our design, each entering patient will be allocated with a high probability to the current best treatment for this patient, which is estimated using the past data based on some machine learning algorithm (for example, outcome weighted learning in our implementation). We explore the tradeoff between training and test values of the estimated ITR in single-stage problems by proving theoretically that for a higher probability of following the estimated ITR, the training value converges to the optimal value at a faster rate, while the test value converges at a slower rate. This problem is different from traditional decision problems in the sense that the training data are generated sequentially and are dependent. We also develop a tool that combines martingale with empirical process to tackle the problem that cannot be solved by previous techniques for i.i.d. data. We show by numerical examples that without much loss of the test value, our proposed algorithm can improve the training value significantly as compared to existing methods. Finally, we use a real data study to illustrate the performance of the proposed method.
format	Online Article Text
id	pubmed-10419117
institution	National Center for Biotechnology Information
language	English
publishDate	2022
record_format	MEDLINE/PubMed
spelling	pubmed-104191172023-08-11 Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials Gao, Daiqi Liu, Yufeng Zeng, Donglin J Mach Learn Res Article Learning optimal individualized treatment rules (ITRs) has become increasingly important in the modern era of precision medicine. Many statistical and machine learning methods for learning optimal ITRs have been developed in the literature. However, most existing methods are based on data collected from traditional randomized controlled trials and thus cannot take advantage of the accumulative evidence when patients enter the trials sequentially. It is also ethically important that future patients should have a high probability to be treated optimally based on the updated knowledge so far. In this work, we propose a new design called sequentially rule-adaptive trials to learn optimal ITRs based on the contextual bandit framework, in contrast to the response-adaptive design in traditional adaptive trials. In our design, each entering patient will be allocated with a high probability to the current best treatment for this patient, which is estimated using the past data based on some machine learning algorithm (for example, outcome weighted learning in our implementation). We explore the tradeoff between training and test values of the estimated ITR in single-stage problems by proving theoretically that for a higher probability of following the estimated ITR, the training value converges to the optimal value at a faster rate, while the test value converges at a slower rate. This problem is different from traditional decision problems in the sense that the training data are generated sequentially and are dependent. We also develop a tool that combines martingale with empirical process to tackle the problem that cannot be solved by previous techniques for i.i.d. data. We show by numerical examples that without much loss of the test value, our proposed algorithm can improve the training value significantly as compared to existing methods. Finally, we use a real data study to illustrate the performance of the proposed method. 2022 /pmc/articles/PMC10419117/ /pubmed/37576335 Text en https://creativecommons.org/licenses/by/4.0/License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v23/21-0354.html.
spellingShingle	Article Gao, Daiqi Liu, Yufeng Zeng, Donglin Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials
title	Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials
title_full	Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials
title_fullStr	Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials
title_full_unstemmed	Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials
title_short	Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials
title_sort	non-asymptotic properties of individualized treatment rules from sequentially rule-adaptive trials
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10419117/ https://www.ncbi.nlm.nih.gov/pubmed/37576335
work_keys_str_mv	AT gaodaiqi nonasymptoticpropertiesofindividualizedtreatmentrulesfromsequentiallyruleadaptivetrials AT liuyufeng nonasymptoticpropertiesofindividualizedtreatmentrulesfromsequentiallyruleadaptivetrials AT zengdonglin nonasymptoticpropertiesofindividualizedtreatmentrulesfromsequentiallyruleadaptivetrials

Non-asymptotic Properties of Individualized Treatment Rules from Sequentially Rule-Adaptive Trials

Ejemplares similares