Cargando…

IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves

BACKGROUND: When applying secondary analysis on published survival data, it is critical to obtain each patient’s raw data, because the individual patient data (IPD) approach has been considered as the gold standard of data analysis. However, researchers often lack access to IPD. We aim to propose a...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Na, Zhou, Yanhong, Lee, J. Jack
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8168323/
https://www.ncbi.nlm.nih.gov/pubmed/34074267
http://dx.doi.org/10.1186/s12874-021-01308-8
_version_ 1783701865282666496
author Liu, Na
Zhou, Yanhong
Lee, J. Jack
author_facet Liu, Na
Zhou, Yanhong
Lee, J. Jack
author_sort Liu, Na
collection PubMed
description BACKGROUND: When applying secondary analysis on published survival data, it is critical to obtain each patient’s raw data, because the individual patient data (IPD) approach has been considered as the gold standard of data analysis. However, researchers often lack access to IPD. We aim to propose a straightforward and robust approach to obtain IPD from published survival curves with a user-friendly software platform. RESULTS: Improving upon existing methods, we propose an easy-to-use, two-stage approach to reconstruct IPD from published Kaplan-Meier (K-M) curves. Stage 1 extracts raw data coordinates and Stage 2 reconstructs IPD using the proposed method. To facilitate the use of the proposed method, we developed the R package IPDfromKM and an accompanying web-based Shiny application. Both the R package and Shiny application have an “all-in-one” feature such that users can use them to extract raw data coordinates from published K-M curves, reconstruct IPD from the extracted data coordinates, visualize the reconstructed IPD, assess the accuracy of the reconstruction, and perform secondary analysis on the basis of the reconstructed IPD. We illustrate the use of the R package and the Shiny application with K-M curves from published studies. Extensive simulations and real-world data applications demonstrate that the proposed method has high accuracy and great reliability in estimating the number of events, number of patients at risk, survival probabilities, median survival times, and hazard ratios. CONCLUSIONS: IPDfromKM has great flexibility and accuracy to reconstruct IPD from published K-M curves with different shapes. We believe that the R package and the Shiny application will greatly facilitate the potential use of quality IPD and advance the use of secondary data to facilitate informed decision making in medical research.
format Online
Article
Text
id pubmed-8168323
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-81683232021-06-02 IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves Liu, Na Zhou, Yanhong Lee, J. Jack BMC Med Res Methodol Software BACKGROUND: When applying secondary analysis on published survival data, it is critical to obtain each patient’s raw data, because the individual patient data (IPD) approach has been considered as the gold standard of data analysis. However, researchers often lack access to IPD. We aim to propose a straightforward and robust approach to obtain IPD from published survival curves with a user-friendly software platform. RESULTS: Improving upon existing methods, we propose an easy-to-use, two-stage approach to reconstruct IPD from published Kaplan-Meier (K-M) curves. Stage 1 extracts raw data coordinates and Stage 2 reconstructs IPD using the proposed method. To facilitate the use of the proposed method, we developed the R package IPDfromKM and an accompanying web-based Shiny application. Both the R package and Shiny application have an “all-in-one” feature such that users can use them to extract raw data coordinates from published K-M curves, reconstruct IPD from the extracted data coordinates, visualize the reconstructed IPD, assess the accuracy of the reconstruction, and perform secondary analysis on the basis of the reconstructed IPD. We illustrate the use of the R package and the Shiny application with K-M curves from published studies. Extensive simulations and real-world data applications demonstrate that the proposed method has high accuracy and great reliability in estimating the number of events, number of patients at risk, survival probabilities, median survival times, and hazard ratios. CONCLUSIONS: IPDfromKM has great flexibility and accuracy to reconstruct IPD from published K-M curves with different shapes. We believe that the R package and the Shiny application will greatly facilitate the potential use of quality IPD and advance the use of secondary data to facilitate informed decision making in medical research. BioMed Central 2021-06-01 /pmc/articles/PMC8168323/ /pubmed/34074267 http://dx.doi.org/10.1186/s12874-021-01308-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Liu, Na
Zhou, Yanhong
Lee, J. Jack
IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves
title IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves
title_full IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves
title_fullStr IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves
title_full_unstemmed IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves
title_short IPDfromKM: reconstruct individual patient data from published Kaplan-Meier survival curves
title_sort ipdfromkm: reconstruct individual patient data from published kaplan-meier survival curves
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8168323/
https://www.ncbi.nlm.nih.gov/pubmed/34074267
http://dx.doi.org/10.1186/s12874-021-01308-8
work_keys_str_mv AT liuna ipdfromkmreconstructindividualpatientdatafrompublishedkaplanmeiersurvivalcurves
AT zhouyanhong ipdfromkmreconstructindividualpatientdatafrompublishedkaplanmeiersurvivalcurves
AT leejjack ipdfromkmreconstructindividualpatientdatafrompublishedkaplanmeiersurvivalcurves