Cargando…

Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure

BACKGROUND: Clinical decisions are mainly driven by the ability of physicians to apply risk stratification to patients. However, this task is difficult as it requires complex integration of numerous parameters and is impacted by patient heterogeneity. We sought to evaluate the ability of transplant...

Descripción completa

Detalles Bibliográficos
Autores principales: Divard, Gillian, Raynaud, Marc, Tatapudi, Vasishta S., Abdalla, Basmah, Bailly, Elodie, Assayag, Maureen, Binois, Yannick, Cohen, Raphael, Zhang, Huanxi, Ulloa, Camillo, Linhares, Kamila, Tedesco, Helio S., Legendre, Christophe, Jouven, Xavier, Montgomery, Robert A., Lefaucheur, Carmen, Aubert, Olivier, Loupy, Alexandre
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9684574/
https://www.ncbi.nlm.nih.gov/pubmed/36418380
http://dx.doi.org/10.1038/s43856-022-00201-9
_version_ 1784835318106554368
author Divard, Gillian
Raynaud, Marc
Tatapudi, Vasishta S.
Abdalla, Basmah
Bailly, Elodie
Assayag, Maureen
Binois, Yannick
Cohen, Raphael
Zhang, Huanxi
Ulloa, Camillo
Linhares, Kamila
Tedesco, Helio S.
Legendre, Christophe
Jouven, Xavier
Montgomery, Robert A.
Lefaucheur, Carmen
Aubert, Olivier
Loupy, Alexandre
author_facet Divard, Gillian
Raynaud, Marc
Tatapudi, Vasishta S.
Abdalla, Basmah
Bailly, Elodie
Assayag, Maureen
Binois, Yannick
Cohen, Raphael
Zhang, Huanxi
Ulloa, Camillo
Linhares, Kamila
Tedesco, Helio S.
Legendre, Christophe
Jouven, Xavier
Montgomery, Robert A.
Lefaucheur, Carmen
Aubert, Olivier
Loupy, Alexandre
author_sort Divard, Gillian
collection PubMed
description BACKGROUND: Clinical decisions are mainly driven by the ability of physicians to apply risk stratification to patients. However, this task is difficult as it requires complex integration of numerous parameters and is impacted by patient heterogeneity. We sought to evaluate the ability of transplant physicians to predict the risk of long-term allograft failure and compare them to a validated artificial intelligence (AI) prediction algorithm. METHODS: We randomly selected 400 kidney transplant recipients from a qualified dataset of 4000 patients. For each patient, 44 features routinely collected during the first-year post-transplant were compiled in an electronic health record (EHR). We enrolled 9 transplant physicians at various career stages. At 1-year post-transplant, they blindly predicted the long-term graft survival with probabilities for each patient. Their predictions were compared with those of a validated prediction system (iBox). We assessed the determinants of each physician’s prediction using a random forest survival model. RESULTS: Among the 400 patients included, 84 graft failures occurred at 7 years post-evaluation. The iBox system demonstrates the best predictive performance with a discrimination of 0.79 and a median calibration error of 5.79%, while physicians tend to overestimate the risk of graft failure. Physicians’ risk predictions show wide heterogeneity with a moderate intraclass correlation of 0.58. The determinants of physicians’ prediction are disparate, with poor agreement regardless of their clinical experience. CONCLUSIONS: This study shows the overall limited performance and consistency of physicians to predict the risk of long-term graft failure, demonstrated by the superior performances of the iBox. This study supports the use of a companion tool to help physicians in their prognostic judgement and decision-making in clinical care.
format Online
Article
Text
id pubmed-9684574
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96845742022-11-25 Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure Divard, Gillian Raynaud, Marc Tatapudi, Vasishta S. Abdalla, Basmah Bailly, Elodie Assayag, Maureen Binois, Yannick Cohen, Raphael Zhang, Huanxi Ulloa, Camillo Linhares, Kamila Tedesco, Helio S. Legendre, Christophe Jouven, Xavier Montgomery, Robert A. Lefaucheur, Carmen Aubert, Olivier Loupy, Alexandre Commun Med (Lond) Article BACKGROUND: Clinical decisions are mainly driven by the ability of physicians to apply risk stratification to patients. However, this task is difficult as it requires complex integration of numerous parameters and is impacted by patient heterogeneity. We sought to evaluate the ability of transplant physicians to predict the risk of long-term allograft failure and compare them to a validated artificial intelligence (AI) prediction algorithm. METHODS: We randomly selected 400 kidney transplant recipients from a qualified dataset of 4000 patients. For each patient, 44 features routinely collected during the first-year post-transplant were compiled in an electronic health record (EHR). We enrolled 9 transplant physicians at various career stages. At 1-year post-transplant, they blindly predicted the long-term graft survival with probabilities for each patient. Their predictions were compared with those of a validated prediction system (iBox). We assessed the determinants of each physician’s prediction using a random forest survival model. RESULTS: Among the 400 patients included, 84 graft failures occurred at 7 years post-evaluation. The iBox system demonstrates the best predictive performance with a discrimination of 0.79 and a median calibration error of 5.79%, while physicians tend to overestimate the risk of graft failure. Physicians’ risk predictions show wide heterogeneity with a moderate intraclass correlation of 0.58. The determinants of physicians’ prediction are disparate, with poor agreement regardless of their clinical experience. CONCLUSIONS: This study shows the overall limited performance and consistency of physicians to predict the risk of long-term graft failure, demonstrated by the superior performances of the iBox. This study supports the use of a companion tool to help physicians in their prognostic judgement and decision-making in clinical care. Nature Publishing Group UK 2022-11-23 /pmc/articles/PMC9684574/ /pubmed/36418380 http://dx.doi.org/10.1038/s43856-022-00201-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Divard, Gillian
Raynaud, Marc
Tatapudi, Vasishta S.
Abdalla, Basmah
Bailly, Elodie
Assayag, Maureen
Binois, Yannick
Cohen, Raphael
Zhang, Huanxi
Ulloa, Camillo
Linhares, Kamila
Tedesco, Helio S.
Legendre, Christophe
Jouven, Xavier
Montgomery, Robert A.
Lefaucheur, Carmen
Aubert, Olivier
Loupy, Alexandre
Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure
title Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure
title_full Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure
title_fullStr Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure
title_full_unstemmed Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure
title_short Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure
title_sort comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9684574/
https://www.ncbi.nlm.nih.gov/pubmed/36418380
http://dx.doi.org/10.1038/s43856-022-00201-9
work_keys_str_mv AT divardgillian comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT raynaudmarc comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT tatapudivasishtas comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT abdallabasmah comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT baillyelodie comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT assayagmaureen comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT binoisyannick comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT cohenraphael comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT zhanghuanxi comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT ulloacamillo comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT linhareskamila comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT tedescohelios comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT legendrechristophe comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT jouvenxavier comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT montgomeryroberta comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT lefaucheurcarmen comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT aubertolivier comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure
AT loupyalexandre comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure