Cargando…
Assessment and quantification of ovarian reserve on the basis of machine learning models
BACKGROUND: Early detection of ovarian aging is of huge importance, although no ideal marker or acknowledged evaluation system exists. The purpose of this study was to develop a better prediction model to assess and quantify ovarian reserve using machine learning methods. METHODS: This is a multicen...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10050589/ https://www.ncbi.nlm.nih.gov/pubmed/37008906 http://dx.doi.org/10.3389/fendo.2023.1087429 |
_version_ | 1785014673544839168 |
---|---|
author | Ding, Ting Ren, Wu Wang, Tian Han, Yun Ma, Wenqing Wang, Man Fu, Fangfang Li, Yan Wang, Shixuan |
author_facet | Ding, Ting Ren, Wu Wang, Tian Han, Yun Ma, Wenqing Wang, Man Fu, Fangfang Li, Yan Wang, Shixuan |
author_sort | Ding, Ting |
collection | PubMed |
description | BACKGROUND: Early detection of ovarian aging is of huge importance, although no ideal marker or acknowledged evaluation system exists. The purpose of this study was to develop a better prediction model to assess and quantify ovarian reserve using machine learning methods. METHODS: This is a multicenter, nationwide population-based study including a total of 1,020 healthy women. For these healthy women, their ovarian reserve was quantified in the form of ovarian age, which was assumed equal to their chronological age, and least absolute shrinkage and selection operator (LASSO) regression was used to select features to construct models. Seven machine learning methods, namely artificial neural network (ANN), support vector machine (SVM), generalized linear model (GLM), K-nearest neighbors regression (KNN), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM) were applied to construct prediction models separately. Pearson’s correlation coefficient (PCC), mean absolute error (MAE), and mean squared error (MSE) were used to compare the efficiency and stability of these models. RESULTS: Anti-Müllerian hormone (AMH) and antral follicle count (AFC) were detected to have the highest absolute PCC values of 0.45 and 0.43 with age and held similar age distribution curves. The LightGBM model was thought to be the most suitable model for ovarian age after ranking analysis, combining PCC, MAE, and MSE values. The LightGBM model obtained PCC values of 0.82, 0.56, and 0.70 for the training set, the test set, and the entire dataset, respectively. The LightGBM method still held the lowest MAE and cross-validated MSE values. Further, in two different age groups (20–35 and >35 years), the LightGBM model also obtained the lowest MAE value of 2.88 for women between the ages of 20 and 35 years and the second lowest MAE value of 5.12 for women over the age of 35 years. CONCLUSION: Machine learning methods combining multi-features were reliable in assessing and quantifying ovarian reserve, and the LightGBM method turned out to be the approach with the best result, especially in the child-bearing age group of 20 to 35 years. |
format | Online Article Text |
id | pubmed-10050589 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-100505892023-03-30 Assessment and quantification of ovarian reserve on the basis of machine learning models Ding, Ting Ren, Wu Wang, Tian Han, Yun Ma, Wenqing Wang, Man Fu, Fangfang Li, Yan Wang, Shixuan Front Endocrinol (Lausanne) Endocrinology BACKGROUND: Early detection of ovarian aging is of huge importance, although no ideal marker or acknowledged evaluation system exists. The purpose of this study was to develop a better prediction model to assess and quantify ovarian reserve using machine learning methods. METHODS: This is a multicenter, nationwide population-based study including a total of 1,020 healthy women. For these healthy women, their ovarian reserve was quantified in the form of ovarian age, which was assumed equal to their chronological age, and least absolute shrinkage and selection operator (LASSO) regression was used to select features to construct models. Seven machine learning methods, namely artificial neural network (ANN), support vector machine (SVM), generalized linear model (GLM), K-nearest neighbors regression (KNN), gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), and light gradient boosting machine (LightGBM) were applied to construct prediction models separately. Pearson’s correlation coefficient (PCC), mean absolute error (MAE), and mean squared error (MSE) were used to compare the efficiency and stability of these models. RESULTS: Anti-Müllerian hormone (AMH) and antral follicle count (AFC) were detected to have the highest absolute PCC values of 0.45 and 0.43 with age and held similar age distribution curves. The LightGBM model was thought to be the most suitable model for ovarian age after ranking analysis, combining PCC, MAE, and MSE values. The LightGBM model obtained PCC values of 0.82, 0.56, and 0.70 for the training set, the test set, and the entire dataset, respectively. The LightGBM method still held the lowest MAE and cross-validated MSE values. Further, in two different age groups (20–35 and >35 years), the LightGBM model also obtained the lowest MAE value of 2.88 for women between the ages of 20 and 35 years and the second lowest MAE value of 5.12 for women over the age of 35 years. CONCLUSION: Machine learning methods combining multi-features were reliable in assessing and quantifying ovarian reserve, and the LightGBM method turned out to be the approach with the best result, especially in the child-bearing age group of 20 to 35 years. Frontiers Media S.A. 2023-03-15 /pmc/articles/PMC10050589/ /pubmed/37008906 http://dx.doi.org/10.3389/fendo.2023.1087429 Text en Copyright © 2023 Ding, Ren, Wang, Han, Ma, Wang, Fu, Li and Wang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Endocrinology Ding, Ting Ren, Wu Wang, Tian Han, Yun Ma, Wenqing Wang, Man Fu, Fangfang Li, Yan Wang, Shixuan Assessment and quantification of ovarian reserve on the basis of machine learning models |
title | Assessment and quantification of ovarian reserve on the basis of machine learning models |
title_full | Assessment and quantification of ovarian reserve on the basis of machine learning models |
title_fullStr | Assessment and quantification of ovarian reserve on the basis of machine learning models |
title_full_unstemmed | Assessment and quantification of ovarian reserve on the basis of machine learning models |
title_short | Assessment and quantification of ovarian reserve on the basis of machine learning models |
title_sort | assessment and quantification of ovarian reserve on the basis of machine learning models |
topic | Endocrinology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10050589/ https://www.ncbi.nlm.nih.gov/pubmed/37008906 http://dx.doi.org/10.3389/fendo.2023.1087429 |
work_keys_str_mv | AT dingting assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels AT renwu assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels AT wangtian assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels AT hanyun assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels AT mawenqing assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels AT wangman assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels AT fufangfang assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels AT liyan assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels AT wangshixuan assessmentandquantificationofovarianreserveonthebasisofmachinelearningmodels |