Cargando…
Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning
PURPOSE: This article aims to assess how differences in maternal age distributions between IVF clinics affect the performance of an artificial intelligence model for embryo viability prediction and proposes a method to account for such differences. METHODS: Using retrospectively collected data from...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10440335/ https://www.ncbi.nlm.nih.gov/pubmed/37423932 http://dx.doi.org/10.1007/s10815-023-02871-3 |
_version_ | 1785093129337044992 |
---|---|
author | Johansen, Martin N. Parner, Erik T. Kragh, Mikkel F. Kato, Keiichi Ueno, Satoshi Palm, Stefan Kernbach, Manuel Balaban, Başak Keleş, İpek Gabrielsen, Anette V. Iversen, Lea H. Berntsen, Jørgen |
author_facet | Johansen, Martin N. Parner, Erik T. Kragh, Mikkel F. Kato, Keiichi Ueno, Satoshi Palm, Stefan Kernbach, Manuel Balaban, Başak Keleş, İpek Gabrielsen, Anette V. Iversen, Lea H. Berntsen, Jørgen |
author_sort | Johansen, Martin N. |
collection | PubMed |
description | PURPOSE: This article aims to assess how differences in maternal age distributions between IVF clinics affect the performance of an artificial intelligence model for embryo viability prediction and proposes a method to account for such differences. METHODS: Using retrospectively collected data from 4805 fresh and frozen single blastocyst transfers of embryos incubated for 5 to 6 days, the discriminative performance was assessed based on fetal heartbeat outcomes. The data was collected from 4 clinics, and the discrimination was measured in terms of the area under ROC curves (AUC) for each clinic. To account for the different age distributions between clinics, a method for age-standardizing the AUCs was developed in which the clinic-specific AUCs were standardized using weights for each embryo according to the relative frequency of the maternal age in the relevant clinic compared to the age distribution in a common reference population. RESULTS: There was substantial variation in the clinic-specific AUCs with estimates ranging from 0.58 to 0.69 before standardization. The age-standardization of the AUCs reduced the between-clinic variance by 16%. Most notably, three of the clinics had quite similar AUCs after standardization, while the last clinic had a markedly lower AUC both with and without standardization. CONCLUSION: The method of using age-standardization of the AUCs that is proposed in this article mitigates some of the variability between clinics. This enables a comparison of clinic-specific AUCs where the difference in age distributions is accounted for. |
format | Online Article Text |
id | pubmed-10440335 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-104403352023-08-22 Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning Johansen, Martin N. Parner, Erik T. Kragh, Mikkel F. Kato, Keiichi Ueno, Satoshi Palm, Stefan Kernbach, Manuel Balaban, Başak Keleş, İpek Gabrielsen, Anette V. Iversen, Lea H. Berntsen, Jørgen J Assist Reprod Genet Assisted Reproduction Technologies PURPOSE: This article aims to assess how differences in maternal age distributions between IVF clinics affect the performance of an artificial intelligence model for embryo viability prediction and proposes a method to account for such differences. METHODS: Using retrospectively collected data from 4805 fresh and frozen single blastocyst transfers of embryos incubated for 5 to 6 days, the discriminative performance was assessed based on fetal heartbeat outcomes. The data was collected from 4 clinics, and the discrimination was measured in terms of the area under ROC curves (AUC) for each clinic. To account for the different age distributions between clinics, a method for age-standardizing the AUCs was developed in which the clinic-specific AUCs were standardized using weights for each embryo according to the relative frequency of the maternal age in the relevant clinic compared to the age distribution in a common reference population. RESULTS: There was substantial variation in the clinic-specific AUCs with estimates ranging from 0.58 to 0.69 before standardization. The age-standardization of the AUCs reduced the between-clinic variance by 16%. Most notably, three of the clinics had quite similar AUCs after standardization, while the last clinic had a markedly lower AUC both with and without standardization. CONCLUSION: The method of using age-standardization of the AUCs that is proposed in this article mitigates some of the variability between clinics. This enables a comparison of clinic-specific AUCs where the difference in age distributions is accounted for. Springer US 2023-07-10 2023-09 /pmc/articles/PMC10440335/ /pubmed/37423932 http://dx.doi.org/10.1007/s10815-023-02871-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Assisted Reproduction Technologies Johansen, Martin N. Parner, Erik T. Kragh, Mikkel F. Kato, Keiichi Ueno, Satoshi Palm, Stefan Kernbach, Manuel Balaban, Başak Keleş, İpek Gabrielsen, Anette V. Iversen, Lea H. Berntsen, Jørgen Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning |
title | Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning |
title_full | Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning |
title_fullStr | Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning |
title_full_unstemmed | Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning |
title_short | Comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning |
title_sort | comparing performance between clinics of an embryo evaluation algorithm based on time-lapse images and machine learning |
topic | Assisted Reproduction Technologies |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10440335/ https://www.ncbi.nlm.nih.gov/pubmed/37423932 http://dx.doi.org/10.1007/s10815-023-02871-3 |
work_keys_str_mv | AT johansenmartinn comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT parnererikt comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT kraghmikkelf comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT katokeiichi comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT uenosatoshi comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT palmstefan comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT kernbachmanuel comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT balabanbasak comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT kelesipek comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT gabrielsenanettev comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT iversenleah comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning AT berntsenjørgen comparingperformancebetweenclinicsofanembryoevaluationalgorithmbasedontimelapseimagesandmachinelearning |