Cargando…

Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data

Variable selection and penalized regression models in high-dimension settings have become an increasingly important topic in many disciplines. For instance, omics data are generated in biomedical researches that may be associated with survival of patients and suggest insights into disease dynamics t...

Descripción completa

Detalles Bibliográficos
Autores principales: Tapak, Leili, Kosorok, Michael R., Sadeghifar, Majid, Hamidi, Omid, Afshar, Saeid, Doosti, Hassan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8476266/
https://www.ncbi.nlm.nih.gov/pubmed/34589136
http://dx.doi.org/10.1155/2021/5169052
_version_ 1784575571451183104
author Tapak, Leili
Kosorok, Michael R.
Sadeghifar, Majid
Hamidi, Omid
Afshar, Saeid
Doosti, Hassan
author_facet Tapak, Leili
Kosorok, Michael R.
Sadeghifar, Majid
Hamidi, Omid
Afshar, Saeid
Doosti, Hassan
author_sort Tapak, Leili
collection PubMed
description Variable selection and penalized regression models in high-dimension settings have become an increasingly important topic in many disciplines. For instance, omics data are generated in biomedical researches that may be associated with survival of patients and suggest insights into disease dynamics to identify patients with worse prognosis and to improve the therapy. Analysis of high-dimensional time-to-event data in the presence of competing risks requires special modeling techniques. So far, some attempts have been made to variable selection in low- and high-dimension competing risk setting using partial likelihood-based procedures. In this paper, a weighted likelihood-based penalized approach is extended for direct variable selection under the subdistribution hazards model for high-dimensional competing risk data. The proposed method which considers a larger class of semiparametric regression models for the subdistribution allows for taking into account time-varying effects and is of particular importance, because the proportional hazards assumption may not be valid in general, especially in the high-dimension setting. Also, this model relaxes from the constraint of the ability to simultaneously model multiple cumulative incidence functions using the Fine and Gray approach. The performance/effectiveness of several penalties including minimax concave penalty (MCP); adaptive LASSO and smoothly clipped absolute deviation (SCAD) as well as their L(2) counterparts were investigated through simulation studies in terms of sensitivity/specificity. The results revealed that sensitivity of all penalties were comparable, but the MCP and MCP-L(2) penalties outperformed the other methods in term of selecting less noninformative variables. The practical use of the model was investigated through the analysis of genomic competing risk data obtained from patients with bladder cancer and six genes of CDC20, NCF2, SMARCAD1, RTN4, ETFDH, and SON were identified using all the methods and were significantly correlated with the subdistribution.
format Online
Article
Text
id pubmed-8476266
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-84762662021-09-28 Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data Tapak, Leili Kosorok, Michael R. Sadeghifar, Majid Hamidi, Omid Afshar, Saeid Doosti, Hassan Comput Math Methods Med Research Article Variable selection and penalized regression models in high-dimension settings have become an increasingly important topic in many disciplines. For instance, omics data are generated in biomedical researches that may be associated with survival of patients and suggest insights into disease dynamics to identify patients with worse prognosis and to improve the therapy. Analysis of high-dimensional time-to-event data in the presence of competing risks requires special modeling techniques. So far, some attempts have been made to variable selection in low- and high-dimension competing risk setting using partial likelihood-based procedures. In this paper, a weighted likelihood-based penalized approach is extended for direct variable selection under the subdistribution hazards model for high-dimensional competing risk data. The proposed method which considers a larger class of semiparametric regression models for the subdistribution allows for taking into account time-varying effects and is of particular importance, because the proportional hazards assumption may not be valid in general, especially in the high-dimension setting. Also, this model relaxes from the constraint of the ability to simultaneously model multiple cumulative incidence functions using the Fine and Gray approach. The performance/effectiveness of several penalties including minimax concave penalty (MCP); adaptive LASSO and smoothly clipped absolute deviation (SCAD) as well as their L(2) counterparts were investigated through simulation studies in terms of sensitivity/specificity. The results revealed that sensitivity of all penalties were comparable, but the MCP and MCP-L(2) penalties outperformed the other methods in term of selecting less noninformative variables. The practical use of the model was investigated through the analysis of genomic competing risk data obtained from patients with bladder cancer and six genes of CDC20, NCF2, SMARCAD1, RTN4, ETFDH, and SON were identified using all the methods and were significantly correlated with the subdistribution. Hindawi 2021-09-19 /pmc/articles/PMC8476266/ /pubmed/34589136 http://dx.doi.org/10.1155/2021/5169052 Text en Copyright © 2021 Leili Tapak et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Tapak, Leili
Kosorok, Michael R.
Sadeghifar, Majid
Hamidi, Omid
Afshar, Saeid
Doosti, Hassan
Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data
title Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data
title_full Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data
title_fullStr Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data
title_full_unstemmed Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data
title_short Regularized Weighted Nonparametric Likelihood Approach for High-Dimension Sparse Subdistribution Hazards Model for Competing Risk Data
title_sort regularized weighted nonparametric likelihood approach for high-dimension sparse subdistribution hazards model for competing risk data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8476266/
https://www.ncbi.nlm.nih.gov/pubmed/34589136
http://dx.doi.org/10.1155/2021/5169052
work_keys_str_mv AT tapakleili regularizedweightednonparametriclikelihoodapproachforhighdimensionsparsesubdistributionhazardsmodelforcompetingriskdata
AT kosorokmichaelr regularizedweightednonparametriclikelihoodapproachforhighdimensionsparsesubdistributionhazardsmodelforcompetingriskdata
AT sadeghifarmajid regularizedweightednonparametriclikelihoodapproachforhighdimensionsparsesubdistributionhazardsmodelforcompetingriskdata
AT hamidiomid regularizedweightednonparametriclikelihoodapproachforhighdimensionsparsesubdistributionhazardsmodelforcompetingriskdata
AT afsharsaeid regularizedweightednonparametriclikelihoodapproachforhighdimensionsparsesubdistributionhazardsmodelforcompetingriskdata
AT doostihassan regularizedweightednonparametriclikelihoodapproachforhighdimensionsparsesubdistributionhazardsmodelforcompetingriskdata