Cargando…
Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data
Semiparametric joint models of longitudinal and competing risk data are computationally costly, and their current implementations do not scale well to massive biobank data. This paper identifies and addresses some key computational barriers in a semiparametric joint model for longitudinal and compet...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8846996/ https://www.ncbi.nlm.nih.gov/pubmed/35178111 http://dx.doi.org/10.1155/2022/1362913 |
_version_ | 1784651955469025280 |
---|---|
author | Li, Shanpeng Li, Ning Wang, Hong Zhou, Jin Zhou, Hua Li, Gang |
author_facet | Li, Shanpeng Li, Ning Wang, Hong Zhou, Jin Zhou, Hua Li, Gang |
author_sort | Li, Shanpeng |
collection | PubMed |
description | Semiparametric joint models of longitudinal and competing risk data are computationally costly, and their current implementations do not scale well to massive biobank data. This paper identifies and addresses some key computational barriers in a semiparametric joint model for longitudinal and competing risk survival data. By developing and implementing customized linear scan algorithms, we reduce the computational complexities from O(n(2)) or O(n(3)) to O(n) in various steps including numerical integration, risk set calculation, and standard error estimation, where n is the number of subjects. Using both simulated and real-world biobank data, we demonstrate that these linear scan algorithms can speed up the existing methods by a factor of up to hundreds of thousands when n > 10(4), often reducing the runtime from days to minutes. We have developed an R package, FastJM, based on the proposed algorithms for joint modeling of longitudinal and competing risk time-to-event data and made it publicly available on the Comprehensive R Archive Network (CRAN). |
format | Online Article Text |
id | pubmed-8846996 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-88469962022-02-16 Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data Li, Shanpeng Li, Ning Wang, Hong Zhou, Jin Zhou, Hua Li, Gang Comput Math Methods Med Research Article Semiparametric joint models of longitudinal and competing risk data are computationally costly, and their current implementations do not scale well to massive biobank data. This paper identifies and addresses some key computational barriers in a semiparametric joint model for longitudinal and competing risk survival data. By developing and implementing customized linear scan algorithms, we reduce the computational complexities from O(n(2)) or O(n(3)) to O(n) in various steps including numerical integration, risk set calculation, and standard error estimation, where n is the number of subjects. Using both simulated and real-world biobank data, we demonstrate that these linear scan algorithms can speed up the existing methods by a factor of up to hundreds of thousands when n > 10(4), often reducing the runtime from days to minutes. We have developed an R package, FastJM, based on the proposed algorithms for joint modeling of longitudinal and competing risk time-to-event data and made it publicly available on the Comprehensive R Archive Network (CRAN). Hindawi 2022-02-08 /pmc/articles/PMC8846996/ /pubmed/35178111 http://dx.doi.org/10.1155/2022/1362913 Text en Copyright © 2022 Shanpeng Li et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Li, Shanpeng Li, Ning Wang, Hong Zhou, Jin Zhou, Hua Li, Gang Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data |
title | Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data |
title_full | Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data |
title_fullStr | Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data |
title_full_unstemmed | Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data |
title_short | Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data |
title_sort | efficient algorithms and implementation of a semiparametric joint model for longitudinal and competing risk data: with applications to massive biobank data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8846996/ https://www.ncbi.nlm.nih.gov/pubmed/35178111 http://dx.doi.org/10.1155/2022/1362913 |
work_keys_str_mv | AT lishanpeng efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata AT lining efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata AT wanghong efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata AT zhoujin efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata AT zhouhua efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata AT ligang efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata |