Cargando…

Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data

Semiparametric joint models of longitudinal and competing risk data are computationally costly, and their current implementations do not scale well to massive biobank data. This paper identifies and addresses some key computational barriers in a semiparametric joint model for longitudinal and compet...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Shanpeng, Li, Ning, Wang, Hong, Zhou, Jin, Zhou, Hua, Li, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8846996/
https://www.ncbi.nlm.nih.gov/pubmed/35178111
http://dx.doi.org/10.1155/2022/1362913
_version_ 1784651955469025280
author Li, Shanpeng
Li, Ning
Wang, Hong
Zhou, Jin
Zhou, Hua
Li, Gang
author_facet Li, Shanpeng
Li, Ning
Wang, Hong
Zhou, Jin
Zhou, Hua
Li, Gang
author_sort Li, Shanpeng
collection PubMed
description Semiparametric joint models of longitudinal and competing risk data are computationally costly, and their current implementations do not scale well to massive biobank data. This paper identifies and addresses some key computational barriers in a semiparametric joint model for longitudinal and competing risk survival data. By developing and implementing customized linear scan algorithms, we reduce the computational complexities from O(n(2)) or O(n(3)) to O(n) in various steps including numerical integration, risk set calculation, and standard error estimation, where n is the number of subjects. Using both simulated and real-world biobank data, we demonstrate that these linear scan algorithms can speed up the existing methods by a factor of up to hundreds of thousands when n > 10(4), often reducing the runtime from days to minutes. We have developed an R package, FastJM, based on the proposed algorithms for joint modeling of longitudinal and competing risk time-to-event data and made it publicly available on the Comprehensive R Archive Network (CRAN).
format Online
Article
Text
id pubmed-8846996
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-88469962022-02-16 Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data Li, Shanpeng Li, Ning Wang, Hong Zhou, Jin Zhou, Hua Li, Gang Comput Math Methods Med Research Article Semiparametric joint models of longitudinal and competing risk data are computationally costly, and their current implementations do not scale well to massive biobank data. This paper identifies and addresses some key computational barriers in a semiparametric joint model for longitudinal and competing risk survival data. By developing and implementing customized linear scan algorithms, we reduce the computational complexities from O(n(2)) or O(n(3)) to O(n) in various steps including numerical integration, risk set calculation, and standard error estimation, where n is the number of subjects. Using both simulated and real-world biobank data, we demonstrate that these linear scan algorithms can speed up the existing methods by a factor of up to hundreds of thousands when n > 10(4), often reducing the runtime from days to minutes. We have developed an R package, FastJM, based on the proposed algorithms for joint modeling of longitudinal and competing risk time-to-event data and made it publicly available on the Comprehensive R Archive Network (CRAN). Hindawi 2022-02-08 /pmc/articles/PMC8846996/ /pubmed/35178111 http://dx.doi.org/10.1155/2022/1362913 Text en Copyright © 2022 Shanpeng Li et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Li, Shanpeng
Li, Ning
Wang, Hong
Zhou, Jin
Zhou, Hua
Li, Gang
Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data
title Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data
title_full Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data
title_fullStr Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data
title_full_unstemmed Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data
title_short Efficient Algorithms and Implementation of a Semiparametric Joint Model for Longitudinal and Competing Risk Data: With Applications to Massive Biobank Data
title_sort efficient algorithms and implementation of a semiparametric joint model for longitudinal and competing risk data: with applications to massive biobank data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8846996/
https://www.ncbi.nlm.nih.gov/pubmed/35178111
http://dx.doi.org/10.1155/2022/1362913
work_keys_str_mv AT lishanpeng efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata
AT lining efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata
AT wanghong efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata
AT zhoujin efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata
AT zhouhua efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata
AT ligang efficientalgorithmsandimplementationofasemiparametricjointmodelforlongitudinalandcompetingriskdatawithapplicationstomassivebiobankdata