Cargando…

Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning

PURPOSE: This article aims to assess how differences in maternal age distributions between IVF clinics affect the performance of an artificial intelligence model for embryo viability prediction and proposes a method to account for such differences. METHODS: Using retrospectively collected data from...

Descripción completa

Detalles Bibliográficos
Autores principales: Johansen, Martin N., Parner, Erik T., Kragh, Mikkel F., Kato, Keiichi, Ueno, Satoshi, Palm, Stefan, Kernbach, Manuel, Balaban, Başak, Keleş, İpek, Gabrielsen, Anette V., Iversen, Lea H., Berntsen, Jørgen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10440335/
https://www.ncbi.nlm.nih.gov/pubmed/37423932
http://dx.doi.org/10.1007/s10815-023-02871-3
_version_ 1785093129337044992
author Johansen, Martin N.
Parner, Erik T.
Kragh, Mikkel F.
Kato, Keiichi
Ueno, Satoshi
Palm, Stefan
Kernbach, Manuel
Balaban, Başak
Keleş, İpek
Gabrielsen, Anette V.
Iversen, Lea H.
Berntsen, Jørgen
author_facet Johansen, Martin N.
Parner, Erik T.
Kragh, Mikkel F.
Kato, Keiichi
Ueno, Satoshi
Palm, Stefan
Kernbach, Manuel
Balaban, Başak
Keleş, İpek
Gabrielsen, Anette V.
Iversen, Lea H.
Berntsen, Jørgen
author_sort Johansen, Martin N.
collection PubMed
description PURPOSE: This article aims to assess how differences in maternal age distributions between IVF clinics affect the performance of an artificial intelligence model for embryo viability prediction and proposes a method to account for such differences. METHODS: Using retrospectively collected data from 4805 fresh and frozen single blastocyst transfers of embryos incubated for 5 to 6 days, the discriminative performance was assessed based on fetal heartbeat outcomes. The data was collected from 4 clinics, and the discrimination was measured in terms of the area under ROC curves (AUC) for each clinic. To account for the different age distributions between clinics, a method for age-standardizing the AUCs was developed in which the clinic-specific AUCs were standardized using weights for each embryo according to the relative frequency of the maternal age in the relevant clinic compared to the age distribution in a common reference population. RESULTS: There was substantial variation in the clinic-specific AUCs with estimates ranging from 0.58 to 0.69 before standardization. The age-standardization of the AUCs reduced the between-clinic variance by 16%. Most notably, three of the clinics had quite similar AUCs after standardization, while the last clinic had a markedly lower AUC both with and without standardization. CONCLUSION: The method of using age-standardization of the AUCs that is proposed in this article mitigates some of the variability between clinics. This enables a comparison of clinic-specific AUCs where the difference in age distributions is accounted for.
format Online
Article
Text
id pubmed-10440335
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-104403352023-08-22 Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning Johansen, Martin N. Parner, Erik T. Kragh, Mikkel F. Kato, Keiichi Ueno, Satoshi Palm, Stefan Kernbach, Manuel Balaban, Başak Keleş, İpek Gabrielsen, Anette V. Iversen, Lea H. Berntsen, Jørgen J Assist Reprod Genet Assisted Reproduction Technologies PURPOSE: This article aims to assess how differences in maternal age distributions between IVF clinics affect the performance of an artificial intelligence model for embryo viability prediction and proposes a method to account for such differences. METHODS: Using retrospectively collected data from 4805 fresh and frozen single blastocyst transfers of embryos incubated for 5 to 6 days, the discriminative performance was assessed based on fetal heartbeat outcomes. The data was collected from 4 clinics, and the discrimination was measured in terms of the area under ROC curves (AUC) for each clinic. To account for the different age distributions between clinics, a method for age-standardizing the AUCs was developed in which the clinic-specific AUCs were standardized using weights for each embryo according to the relative frequency of the maternal age in the relevant clinic compared to the age distribution in a common reference population. RESULTS: There was substantial variation in the clinic-specific AUCs with estimates ranging from 0.58 to 0.69 before standardization. The age-standardization of the AUCs reduced the between-clinic variance by 16%. Most notably, three of the clinics had quite similar AUCs after standardization, while the last clinic had a markedly lower AUC both with and without standardization. CONCLUSION: The method of using age-standardization of the AUCs that is proposed in this article mitigates some of the variability between clinics. This enables a comparison of clinic-specific AUCs where the difference in age distributions is accounted for. Springer US 2023-07-10 2023-09 /pmc/articles/PMC10440335/ /pubmed/37423932 http://dx.doi.org/10.1007/s10815-023-02871-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Assisted Reproduction Technologies
Johansen, Martin N.
Parner, Erik T.
Kragh, Mikkel F.
Kato, Keiichi
Ueno, Satoshi
Palm, Stefan
Kernbach, Manuel
Balaban, Başak
Keleş, İpek
Gabrielsen, Anette V.
Iversen, Lea H.
Berntsen, Jørgen
Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning
title Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning
title_full Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning
title_fullStr Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning
title_full_unstemmed Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning
title_short Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning
title_sort comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning
topic Assisted Reproduction Technologies
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10440335/
https://www.ncbi.nlm.nih.gov/pubmed/37423932
http://dx.doi.org/10.1007/s10815-023-02871-3
work_keys_str_mv AT johansenmartinn comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT parnererikt comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT kraghmikkelf comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT katokeiichi comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT uenosatoshi comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT palmstefan comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT kernbachmanuel comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT balabanbasak comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT kelesipek comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT gabrielsenanettev comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT iversenleah comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning
AT berntsenjørgen comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning