Cargando…

Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm

INTRODUCTION: Semen quality has decreased gradually in recent years, and lifestyle changes are among the primary causes for this issue. Thus far, the specific lifestyle factors affecting semen quality remain to be elucidated. MATERIALS AND METHODS: In this study, data on the following factors were c...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Mingjuan, Yao, Tianci, Li, Jian, Hui, Hui, Fan, Weimin, Guan, Yunfeng, Zhang, Aijun, Xu, Bufang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9514383/
https://www.ncbi.nlm.nih.gov/pubmed/36177329
http://dx.doi.org/10.3389/fmed.2022.811890
_version_ 1784798262584147968
author Zhou, Mingjuan
Yao, Tianci
Li, Jian
Hui, Hui
Fan, Weimin
Guan, Yunfeng
Zhang, Aijun
Xu, Bufang
author_facet Zhou, Mingjuan
Yao, Tianci
Li, Jian
Hui, Hui
Fan, Weimin
Guan, Yunfeng
Zhang, Aijun
Xu, Bufang
author_sort Zhou, Mingjuan
collection PubMed
description INTRODUCTION: Semen quality has decreased gradually in recent years, and lifestyle changes are among the primary causes for this issue. Thus far, the specific lifestyle factors affecting semen quality remain to be elucidated. MATERIALS AND METHODS: In this study, data on the following factors were collected from 5,109 men examined at our reproductive medicine center: 10 lifestyle factors that potentially affect semen quality (smoking status, alcohol consumption, staying up late, sleeplessness, consumption of pungent food, intensity of sports activity, sedentary lifestyle, working in hot conditions, sauna use in the last 3 months, and exposure to radioactivity); general factors including age, abstinence period, and season of semen examination; and comprehensive semen parameters [semen volume, sperm concentration, progressive and total sperm motility, sperm morphology, and DNA fragmentation index (DFI)]. Then, machine learning with the XGBoost algorithm was applied to establish a primary prediction model by using the collected data. Furthermore, the accuracy of the model was verified via multiple logistic regression following k-fold cross-validation analyses. RESULTS: The results indicated that for semen volume, sperm concentration, progressive and total sperm motility, and DFI, the area under the curve (AUC) values ranged from 0.648 to 0.697, while the AUC for sperm morphology was only 0.506. Among the 13 factors, smoking status was the major factor affecting semen volume, sperm concentration, and progressive and total sperm motility. Age was the most important factor affecting DFI. Logistic combined with cross-validation analysis revealed similar results. Furthermore, it showed that heavy smoking (>20 cigarettes/day) had an overall negative effect on semen volume and sperm concentration and progressive and total sperm motility (OR = 4.69, 6.97, 11.16, and 10.35, respectively), while age of >35 years was associated with increased DFI (OR = 5.47). CONCLUSION: The preliminary lifestyle-based model developed for semen quality prediction by using the XGBoost algorithm showed potential for clinical application and further optimization with larger training datasets.
format Online
Article
Text
id pubmed-9514383
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95143832022-09-28 Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm Zhou, Mingjuan Yao, Tianci Li, Jian Hui, Hui Fan, Weimin Guan, Yunfeng Zhang, Aijun Xu, Bufang Front Med (Lausanne) Medicine INTRODUCTION: Semen quality has decreased gradually in recent years, and lifestyle changes are among the primary causes for this issue. Thus far, the specific lifestyle factors affecting semen quality remain to be elucidated. MATERIALS AND METHODS: In this study, data on the following factors were collected from 5,109 men examined at our reproductive medicine center: 10 lifestyle factors that potentially affect semen quality (smoking status, alcohol consumption, staying up late, sleeplessness, consumption of pungent food, intensity of sports activity, sedentary lifestyle, working in hot conditions, sauna use in the last 3 months, and exposure to radioactivity); general factors including age, abstinence period, and season of semen examination; and comprehensive semen parameters [semen volume, sperm concentration, progressive and total sperm motility, sperm morphology, and DNA fragmentation index (DFI)]. Then, machine learning with the XGBoost algorithm was applied to establish a primary prediction model by using the collected data. Furthermore, the accuracy of the model was verified via multiple logistic regression following k-fold cross-validation analyses. RESULTS: The results indicated that for semen volume, sperm concentration, progressive and total sperm motility, and DFI, the area under the curve (AUC) values ranged from 0.648 to 0.697, while the AUC for sperm morphology was only 0.506. Among the 13 factors, smoking status was the major factor affecting semen volume, sperm concentration, and progressive and total sperm motility. Age was the most important factor affecting DFI. Logistic combined with cross-validation analysis revealed similar results. Furthermore, it showed that heavy smoking (>20 cigarettes/day) had an overall negative effect on semen volume and sperm concentration and progressive and total sperm motility (OR = 4.69, 6.97, 11.16, and 10.35, respectively), while age of >35 years was associated with increased DFI (OR = 5.47). CONCLUSION: The preliminary lifestyle-based model developed for semen quality prediction by using the XGBoost algorithm showed potential for clinical application and further optimization with larger training datasets. Frontiers Media S.A. 2022-09-13 /pmc/articles/PMC9514383/ /pubmed/36177329 http://dx.doi.org/10.3389/fmed.2022.811890 Text en Copyright © 2022 Zhou, Yao, Li, Hui, Fan, Guan, Zhang and Xu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Zhou, Mingjuan
Yao, Tianci
Li, Jian
Hui, Hui
Fan, Weimin
Guan, Yunfeng
Zhang, Aijun
Xu, Bufang
Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm
title Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm
title_full Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm
title_fullStr Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm
title_full_unstemmed Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm
title_short Preliminary prediction of semen quality based on modifiable lifestyle factors by using the XGBoost algorithm
title_sort preliminary prediction of semen quality based on modifiable lifestyle factors by using the xgboost algorithm
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9514383/
https://www.ncbi.nlm.nih.gov/pubmed/36177329
http://dx.doi.org/10.3389/fmed.2022.811890
work_keys_str_mv AT zhoumingjuan preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm
AT yaotianci preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm
AT lijian preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm
AT huihui preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm
AT fanweimin preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm
AT guanyunfeng preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm
AT zhangaijun preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm
AT xubufang preliminarypredictionofsemenqualitybasedonmodifiablelifestylefactorsbyusingthexgboostalgorithm