Cargando…
Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure
BACKGROUND: Clinical decisions are mainly driven by the ability of physicians to apply risk stratification to patients. However, this task is difficult as it requires complex integration of numerous parameters and is impacted by patient heterogeneity. We sought to evaluate the ability of transplant...
Autores principales: | , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9684574/ https://www.ncbi.nlm.nih.gov/pubmed/36418380 http://dx.doi.org/10.1038/s43856-022-00201-9 |
_version_ | 1784835318106554368 |
---|---|
author | Divard, Gillian Raynaud, Marc Tatapudi, Vasishta S. Abdalla, Basmah Bailly, Elodie Assayag, Maureen Binois, Yannick Cohen, Raphael Zhang, Huanxi Ulloa, Camillo Linhares, Kamila Tedesco, Helio S. Legendre, Christophe Jouven, Xavier Montgomery, Robert A. Lefaucheur, Carmen Aubert, Olivier Loupy, Alexandre |
author_facet | Divard, Gillian Raynaud, Marc Tatapudi, Vasishta S. Abdalla, Basmah Bailly, Elodie Assayag, Maureen Binois, Yannick Cohen, Raphael Zhang, Huanxi Ulloa, Camillo Linhares, Kamila Tedesco, Helio S. Legendre, Christophe Jouven, Xavier Montgomery, Robert A. Lefaucheur, Carmen Aubert, Olivier Loupy, Alexandre |
author_sort | Divard, Gillian |
collection | PubMed |
description | BACKGROUND: Clinical decisions are mainly driven by the ability of physicians to apply risk stratification to patients. However, this task is difficult as it requires complex integration of numerous parameters and is impacted by patient heterogeneity. We sought to evaluate the ability of transplant physicians to predict the risk of long-term allograft failure and compare them to a validated artificial intelligence (AI) prediction algorithm. METHODS: We randomly selected 400 kidney transplant recipients from a qualified dataset of 4000 patients. For each patient, 44 features routinely collected during the first-year post-transplant were compiled in an electronic health record (EHR). We enrolled 9 transplant physicians at various career stages. At 1-year post-transplant, they blindly predicted the long-term graft survival with probabilities for each patient. Their predictions were compared with those of a validated prediction system (iBox). We assessed the determinants of each physician’s prediction using a random forest survival model. RESULTS: Among the 400 patients included, 84 graft failures occurred at 7 years post-evaluation. The iBox system demonstrates the best predictive performance with a discrimination of 0.79 and a median calibration error of 5.79%, while physicians tend to overestimate the risk of graft failure. Physicians’ risk predictions show wide heterogeneity with a moderate intraclass correlation of 0.58. The determinants of physicians’ prediction are disparate, with poor agreement regardless of their clinical experience. CONCLUSIONS: This study shows the overall limited performance and consistency of physicians to predict the risk of long-term graft failure, demonstrated by the superior performances of the iBox. This study supports the use of a companion tool to help physicians in their prognostic judgement and decision-making in clinical care. |
format | Online Article Text |
id | pubmed-9684574 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-96845742022-11-25 Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure Divard, Gillian Raynaud, Marc Tatapudi, Vasishta S. Abdalla, Basmah Bailly, Elodie Assayag, Maureen Binois, Yannick Cohen, Raphael Zhang, Huanxi Ulloa, Camillo Linhares, Kamila Tedesco, Helio S. Legendre, Christophe Jouven, Xavier Montgomery, Robert A. Lefaucheur, Carmen Aubert, Olivier Loupy, Alexandre Commun Med (Lond) Article BACKGROUND: Clinical decisions are mainly driven by the ability of physicians to apply risk stratification to patients. However, this task is difficult as it requires complex integration of numerous parameters and is impacted by patient heterogeneity. We sought to evaluate the ability of transplant physicians to predict the risk of long-term allograft failure and compare them to a validated artificial intelligence (AI) prediction algorithm. METHODS: We randomly selected 400 kidney transplant recipients from a qualified dataset of 4000 patients. For each patient, 44 features routinely collected during the first-year post-transplant were compiled in an electronic health record (EHR). We enrolled 9 transplant physicians at various career stages. At 1-year post-transplant, they blindly predicted the long-term graft survival with probabilities for each patient. Their predictions were compared with those of a validated prediction system (iBox). We assessed the determinants of each physician’s prediction using a random forest survival model. RESULTS: Among the 400 patients included, 84 graft failures occurred at 7 years post-evaluation. The iBox system demonstrates the best predictive performance with a discrimination of 0.79 and a median calibration error of 5.79%, while physicians tend to overestimate the risk of graft failure. Physicians’ risk predictions show wide heterogeneity with a moderate intraclass correlation of 0.58. The determinants of physicians’ prediction are disparate, with poor agreement regardless of their clinical experience. CONCLUSIONS: This study shows the overall limited performance and consistency of physicians to predict the risk of long-term graft failure, demonstrated by the superior performances of the iBox. This study supports the use of a companion tool to help physicians in their prognostic judgement and decision-making in clinical care. Nature Publishing Group UK 2022-11-23 /pmc/articles/PMC9684574/ /pubmed/36418380 http://dx.doi.org/10.1038/s43856-022-00201-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Divard, Gillian Raynaud, Marc Tatapudi, Vasishta S. Abdalla, Basmah Bailly, Elodie Assayag, Maureen Binois, Yannick Cohen, Raphael Zhang, Huanxi Ulloa, Camillo Linhares, Kamila Tedesco, Helio S. Legendre, Christophe Jouven, Xavier Montgomery, Robert A. Lefaucheur, Carmen Aubert, Olivier Loupy, Alexandre Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure |
title | Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure |
title_full | Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure |
title_fullStr | Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure |
title_full_unstemmed | Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure |
title_short | Comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure |
title_sort | comparison of artificial intelligence and human-based prediction and stratification of the risk of long-term kidney allograft failure |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9684574/ https://www.ncbi.nlm.nih.gov/pubmed/36418380 http://dx.doi.org/10.1038/s43856-022-00201-9 |
work_keys_str_mv | AT divardgillian comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT raynaudmarc comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT tatapudivasishtas comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT abdallabasmah comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT baillyelodie comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT assayagmaureen comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT binoisyannick comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT cohenraphael comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT zhanghuanxi comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT ulloacamillo comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT linhareskamila comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT tedescohelios comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT legendrechristophe comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT jouvenxavier comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT montgomeryroberta comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT lefaucheurcarmen comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT aubertolivier comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure AT loupyalexandre comparisonofartificialintelligenceandhumanbasedpredictionandstratificationoftheriskoflongtermkidneyallograftfailure |