Cargando…

Dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data

BACKGROUND: Multivariable confounder adjustment in comparative studies of newly marketed drugs can be limited by small numbers of exposed patients and even fewer outcomes. Disease risk scores (DRSs) developed in historical comparator drug users before the new drug entered the market may improve adju...

Descripción completa

Detalles Bibliográficos
Autores principales: Kumamaru, Hiraku, Schneeweiss, Sebastian, Glynn, Robert J., Setoguchi, Soko, Gagne, Joshua J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4822311/
https://www.ncbi.nlm.nih.gov/pubmed/27053942
http://dx.doi.org/10.1186/s12982-016-0047-x
_version_ 1782425762482094080
author Kumamaru, Hiraku
Schneeweiss, Sebastian
Glynn, Robert J.
Setoguchi, Soko
Gagne, Joshua J.
author_facet Kumamaru, Hiraku
Schneeweiss, Sebastian
Glynn, Robert J.
Setoguchi, Soko
Gagne, Joshua J.
author_sort Kumamaru, Hiraku
collection PubMed
description BACKGROUND: Multivariable confounder adjustment in comparative studies of newly marketed drugs can be limited by small numbers of exposed patients and even fewer outcomes. Disease risk scores (DRSs) developed in historical comparator drug users before the new drug entered the market may improve adjustment. However, in a high dimensional data setting, empirical selection of hundreds of potential confounders and modeling of DRS even in the historical cohort can lead to over-fitting and reduced predictive performance in the study cohort. We propose the use of combinations of dimension reduction and shrinkage methods to overcome this problem, and compared the performances of these modeling strategies for implementing high dimensional (hd) DRSs from historical data in two empirical study examples of newly marketed drugs versus comparator drugs after the new drugs’ market entry—dabigatran versus warfarin for the outcome of major hemorrhagic events and cyclooxygenase-2 inhibitor (coxibs) versus nonselective non-steroidal anti-inflammatory drugs (nsNSAIDs) for gastrointestinal bleeds. RESULTS: Historical hdDRSs that included predefined and empirical outcome predictors with dimension reduction (principal component analysis; PCA) and shrinkage (lasso and ridge regression) approaches had higher c-statistics (0.66 for the PCA model, 0.64 for the PCA + ridge and 0.65 for the PCA + lasso models in the warfarin users) than an unreduced model (c-statistic, 0.54) in the dabigatran example. The odds ratio (OR) from PCA + lasso hdDRS-stratification [OR, 0.64; 95 % confidence interval (CI) 0.46–0.90] was closer to the benchmark estimate (0.93) from a randomized trial than the model without empirical predictors (OR, 0.58; 95 % CI 0.41–0.81). In the coxibs example, c-statistics of the hdDRSs in the nsNSAID initiators were 0.66 for the PCA model, 0.67 for the PCA + ridge model, and 0.67 for the PCA + lasso model; these were higher than for the unreduced model (c-statistic, 0.45), and comparable to the demographics + risk score model (c-statistic, 0.67). CONCLUSIONS: hdDRSs using historical data with dimension reduction and shrinkage was feasible, and improved confounding adjustment in two studies of newly marketed medications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12982-016-0047-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4822311
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48223112016-04-07 Dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data Kumamaru, Hiraku Schneeweiss, Sebastian Glynn, Robert J. Setoguchi, Soko Gagne, Joshua J. Emerg Themes Epidemiol Methodology BACKGROUND: Multivariable confounder adjustment in comparative studies of newly marketed drugs can be limited by small numbers of exposed patients and even fewer outcomes. Disease risk scores (DRSs) developed in historical comparator drug users before the new drug entered the market may improve adjustment. However, in a high dimensional data setting, empirical selection of hundreds of potential confounders and modeling of DRS even in the historical cohort can lead to over-fitting and reduced predictive performance in the study cohort. We propose the use of combinations of dimension reduction and shrinkage methods to overcome this problem, and compared the performances of these modeling strategies for implementing high dimensional (hd) DRSs from historical data in two empirical study examples of newly marketed drugs versus comparator drugs after the new drugs’ market entry—dabigatran versus warfarin for the outcome of major hemorrhagic events and cyclooxygenase-2 inhibitor (coxibs) versus nonselective non-steroidal anti-inflammatory drugs (nsNSAIDs) for gastrointestinal bleeds. RESULTS: Historical hdDRSs that included predefined and empirical outcome predictors with dimension reduction (principal component analysis; PCA) and shrinkage (lasso and ridge regression) approaches had higher c-statistics (0.66 for the PCA model, 0.64 for the PCA + ridge and 0.65 for the PCA + lasso models in the warfarin users) than an unreduced model (c-statistic, 0.54) in the dabigatran example. The odds ratio (OR) from PCA + lasso hdDRS-stratification [OR, 0.64; 95 % confidence interval (CI) 0.46–0.90] was closer to the benchmark estimate (0.93) from a randomized trial than the model without empirical predictors (OR, 0.58; 95 % CI 0.41–0.81). In the coxibs example, c-statistics of the hdDRSs in the nsNSAID initiators were 0.66 for the PCA model, 0.67 for the PCA + ridge model, and 0.67 for the PCA + lasso model; these were higher than for the unreduced model (c-statistic, 0.45), and comparable to the demographics + risk score model (c-statistic, 0.67). CONCLUSIONS: hdDRSs using historical data with dimension reduction and shrinkage was feasible, and improved confounding adjustment in two studies of newly marketed medications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12982-016-0047-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-04-05 /pmc/articles/PMC4822311/ /pubmed/27053942 http://dx.doi.org/10.1186/s12982-016-0047-x Text en © Kumamaru et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Kumamaru, Hiraku
Schneeweiss, Sebastian
Glynn, Robert J.
Setoguchi, Soko
Gagne, Joshua J.
Dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data
title Dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data
title_full Dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data
title_fullStr Dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data
title_full_unstemmed Dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data
title_short Dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data
title_sort dimension reduction and shrinkage methods for high dimensional disease risk scores in historical data
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4822311/
https://www.ncbi.nlm.nih.gov/pubmed/27053942
http://dx.doi.org/10.1186/s12982-016-0047-x
work_keys_str_mv AT kumamaruhiraku dimensionreductionandshrinkagemethodsforhighdimensionaldiseaseriskscoresinhistoricaldata
AT schneeweisssebastian dimensionreductionandshrinkagemethodsforhighdimensionaldiseaseriskscoresinhistoricaldata
AT glynnrobertj dimensionreductionandshrinkagemethodsforhighdimensionaldiseaseriskscoresinhistoricaldata
AT setoguchisoko dimensionreductionandshrinkagemethodsforhighdimensionaldiseaseriskscoresinhistoricaldata
AT gagnejoshuaj dimensionreductionandshrinkagemethodsforhighdimensionaldiseaseriskscoresinhistoricaldata