Cargando…
Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm
INTRODUCTION: Semen quality has decreased gradually in recent years, and lifestyle changes are among the primary causes for this issue. Thus far, the specific lifestyle factors affecting semen quality remain to be elucidated. MATERIALS AND METHODS: In this study, data on the following factors were c...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9514383/ https://www.ncbi.nlm.nih.gov/pubmed/36177329 http://dx.doi.org/10.3389/fmed.2022.811890 |
_version_ | 1784798262584147968 |
---|---|
author | Zhou, Mingjuan Yao, Tianci Li, Jian Hui, Hui Fan, Weimin Guan, Yunfeng Zhang, Aijun Xu, Bufang |
author_facet | Zhou, Mingjuan Yao, Tianci Li, Jian Hui, Hui Fan, Weimin Guan, Yunfeng Zhang, Aijun Xu, Bufang |
author_sort | Zhou, Mingjuan |
collection | PubMed |
description | INTRODUCTION: Semen quality has decreased gradually in recent years, and lifestyle changes are among the primary causes for this issue. Thus far, the specific lifestyle factors affecting semen quality remain to be elucidated. MATERIALS AND METHODS: In this study, data on the following factors were collected from 5,109 men examined at our reproductive medicine center: 10 lifestyle factors that potentially affect semen quality (smoking status, alcohol consumption, staying up late, sleeplessness, consumption of pungent food, intensity of sports activity, sedentary lifestyle, working in hot conditions, sauna use in the last 3 months, and exposure to radioactivity); general factors including age, abstinence period, and season of semen examination; and comprehensive semen parameters [semen volume, sperm concentration, progressive and total sperm motility, sperm morphology, and DNA fragmentation index (DFI)]. Then, machine learning with the XGBoost algorithm was applied to establish a primary prediction model by using the collected data. Furthermore, the accuracy of the model was verified via multiple logistic regression following k-fold cross-validation analyses. RESULTS: The results indicated that for semen volume, sperm concentration, progressive and total sperm motility, and DFI, the area under the curve (AUC) values ranged from 0.648 to 0.697, while the AUC for sperm morphology was only 0.506. Among the 13 factors, smoking status was the major factor affecting semen volume, sperm concentration, and progressive and total sperm motility. Age was the most important factor affecting DFI. Logistic combined with cross-validation analysis revealed similar results. Furthermore, it showed that heavy smoking (>20 cigarettes/day) had an overall negative effect on semen volume and sperm concentration and progressive and total sperm motility (OR = 4.69, 6.97, 11.16, and 10.35, respectively), while age of >35 years was associated with increased DFI (OR = 5.47). CONCLUSION: The preliminary lifestyle-based model developed for semen quality prediction by using the XGBoost algorithm showed potential for clinical application and further optimization with larger training datasets. |
format | Online Article Text |
id | pubmed-9514383 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95143832022-09-28 Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm Zhou, Mingjuan Yao, Tianci Li, Jian Hui, Hui Fan, Weimin Guan, Yunfeng Zhang, Aijun Xu, Bufang Front Med (Lausanne) Medicine INTRODUCTION: Semen quality has decreased gradually in recent years, and lifestyle changes are among the primary causes for this issue. Thus far, the specific lifestyle factors affecting semen quality remain to be elucidated. MATERIALS AND METHODS: In this study, data on the following factors were collected from 5,109 men examined at our reproductive medicine center: 10 lifestyle factors that potentially affect semen quality (smoking status, alcohol consumption, staying up late, sleeplessness, consumption of pungent food, intensity of sports activity, sedentary lifestyle, working in hot conditions, sauna use in the last 3 months, and exposure to radioactivity); general factors including age, abstinence period, and season of semen examination; and comprehensive semen parameters [semen volume, sperm concentration, progressive and total sperm motility, sperm morphology, and DNA fragmentation index (DFI)]. Then, machine learning with the XGBoost algorithm was applied to establish a primary prediction model by using the collected data. Furthermore, the accuracy of the model was verified via multiple logistic regression following k-fold cross-validation analyses. RESULTS: The results indicated that for semen volume, sperm concentration, progressive and total sperm motility, and DFI, the area under the curve (AUC) values ranged from 0.648 to 0.697, while the AUC for sperm morphology was only 0.506. Among the 13 factors, smoking status was the major factor affecting semen volume, sperm concentration, and progressive and total sperm motility. Age was the most important factor affecting DFI. Logistic combined with cross-validation analysis revealed similar results. Furthermore, it showed that heavy smoking (>20 cigarettes/day) had an overall negative effect on semen volume and sperm concentration and progressive and total sperm motility (OR = 4.69, 6.97, 11.16, and 10.35, respectively), while age of >35 years was associated with increased DFI (OR = 5.47). CONCLUSION: The preliminary lifestyle-based model developed for semen quality prediction by using the XGBoost algorithm showed potential for clinical application and further optimization with larger training datasets. Frontiers Media S.A. 2022-09-13 /pmc/articles/PMC9514383/ /pubmed/36177329 http://dx.doi.org/10.3389/fmed.2022.811890 Text en Copyright © 2022 Zhou, Yao, Li, Hui, Fan, Guan, Zhang and Xu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Medicine Zhou, Mingjuan Yao, Tianci Li, Jian Hui, Hui Fan, Weimin Guan, Yunfeng Zhang, Aijun Xu, Bufang Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm |
title | Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm |
title_full | Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm |
title_fullStr | Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm |
title_full_unstemmed | Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm |
title_short | Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm |
title_sort | preliminary prediction of semen quality based on modifiable lifestyle factors by using the xgboost algorithm |
topic | Medicine |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9514383/ https://www.ncbi.nlm.nih.gov/pubmed/36177329 http://dx.doi.org/10.3389/fmed.2022.811890 |
work_keys_str_mv | AT zhoumingjuan preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm AT yaotianci preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm AT lijian preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm AT huihui preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm AT fanweimin preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm AT guanyunfeng preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm AT zhangaijun preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm AT xubufang preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm |