Cargando…

Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators

In many countries, especially developed nations, the fertility rate and birth rate have continually declined. Taiwan’s fertility rate has paralleled this trend and reached its nadir in 2022. Therefore, the government uses many strategies to encourage more married couples to have children. However, c...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Hung-Hsiang, Hsieh, Shang-Ju, Chen, Ming-Shu, Jhou, Mao-Jhen, Liu, Tzu-Chi, Shen, Hsiang-Li, Yang, Chih-Te, Hung, Chung-Chih, Yu, Ya-Yen, Lu, Chi-Jie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9917545/
https://www.ncbi.nlm.nih.gov/pubmed/36769868
http://dx.doi.org/10.3390/jcm12031220
_version_ 1784886392780750848
author Huang, Hung-Hsiang
Hsieh, Shang-Ju
Chen, Ming-Shu
Jhou, Mao-Jhen
Liu, Tzu-Chi
Shen, Hsiang-Li
Yang, Chih-Te
Hung, Chung-Chih
Yu, Ya-Yen
Lu, Chi-Jie
author_facet Huang, Hung-Hsiang
Hsieh, Shang-Ju
Chen, Ming-Shu
Jhou, Mao-Jhen
Liu, Tzu-Chi
Shen, Hsiang-Li
Yang, Chih-Te
Hung, Chung-Chih
Yu, Ya-Yen
Lu, Chi-Jie
author_sort Huang, Hung-Hsiang
collection PubMed
description In many countries, especially developed nations, the fertility rate and birth rate have continually declined. Taiwan’s fertility rate has paralleled this trend and reached its nadir in 2022. Therefore, the government uses many strategies to encourage more married couples to have children. However, couples marrying at an older age may have declining physical status, as well as hypertension and other metabolic syndrome symptoms, in addition to possibly being overweight, which have been the focus of the studies for their influences on male and female gamete quality. Many previous studies based on infertile people are not truly representative of the general population. This study proposed a framework using five machine learning (ML) predictive algorithms—random forest, stochastic gradient boosting, least absolute shrinkage and selection operator regression, ridge regression, and extreme gradient boosting—to identify the major risk factors affecting male sperm count based on a major health screening database in Taiwan. Unlike traditional multiple linear regression, ML algorithms do not need statistical assumptions and can capture non-linear relationships or complex interactions between dependent and independent variables to generate promising performance. We analyzed annual health screening data of 1375 males from 2010 to 2017, including data on health screening indicators, sourced from the MJ Group, a major health screening center in Taiwan. The symmetric mean absolute percentage error, relative absolute error, root relative squared error, and root mean squared error were used as performance evaluation metrics. Our results show that sleep time (ST), alpha-fetoprotein (AFP), body fat (BF), systolic blood pressure (SBP), and blood urea nitrogen (BUN) are the top five risk factors associated with sperm count. ST is a known risk factor influencing reproductive hormone balance, which can affect spermatogenesis and final sperm count. BF and SBP are risk factors associated with metabolic syndrome, another known risk factor of altered male reproductive hormone systems. However, AFP has not been the focus of previous studies on male fertility or semen quality. BUN, the index for kidney function, is also identified as a risk factor by our established ML model. Our results support previous findings that metabolic syndrome has negative impacts on sperm count and semen quality. Sleep duration also has an impact on sperm generation in the testes. AFP and BUN are two novel risk factors linked to sperm counts. These findings could help healthcare personnel and law makers create strategies for creating environments to increase the country’s fertility rate. This study should also be of value to follow-up research.
format Online
Article
Text
id pubmed-9917545
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99175452023-02-11 Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators Huang, Hung-Hsiang Hsieh, Shang-Ju Chen, Ming-Shu Jhou, Mao-Jhen Liu, Tzu-Chi Shen, Hsiang-Li Yang, Chih-Te Hung, Chung-Chih Yu, Ya-Yen Lu, Chi-Jie J Clin Med Article In many countries, especially developed nations, the fertility rate and birth rate have continually declined. Taiwan’s fertility rate has paralleled this trend and reached its nadir in 2022. Therefore, the government uses many strategies to encourage more married couples to have children. However, couples marrying at an older age may have declining physical status, as well as hypertension and other metabolic syndrome symptoms, in addition to possibly being overweight, which have been the focus of the studies for their influences on male and female gamete quality. Many previous studies based on infertile people are not truly representative of the general population. This study proposed a framework using five machine learning (ML) predictive algorithms—random forest, stochastic gradient boosting, least absolute shrinkage and selection operator regression, ridge regression, and extreme gradient boosting—to identify the major risk factors affecting male sperm count based on a major health screening database in Taiwan. Unlike traditional multiple linear regression, ML algorithms do not need statistical assumptions and can capture non-linear relationships or complex interactions between dependent and independent variables to generate promising performance. We analyzed annual health screening data of 1375 males from 2010 to 2017, including data on health screening indicators, sourced from the MJ Group, a major health screening center in Taiwan. The symmetric mean absolute percentage error, relative absolute error, root relative squared error, and root mean squared error were used as performance evaluation metrics. Our results show that sleep time (ST), alpha-fetoprotein (AFP), body fat (BF), systolic blood pressure (SBP), and blood urea nitrogen (BUN) are the top five risk factors associated with sperm count. ST is a known risk factor influencing reproductive hormone balance, which can affect spermatogenesis and final sperm count. BF and SBP are risk factors associated with metabolic syndrome, another known risk factor of altered male reproductive hormone systems. However, AFP has not been the focus of previous studies on male fertility or semen quality. BUN, the index for kidney function, is also identified as a risk factor by our established ML model. Our results support previous findings that metabolic syndrome has negative impacts on sperm count and semen quality. Sleep duration also has an impact on sperm generation in the testes. AFP and BUN are two novel risk factors linked to sperm counts. These findings could help healthcare personnel and law makers create strategies for creating environments to increase the country’s fertility rate. This study should also be of value to follow-up research. MDPI 2023-02-03 /pmc/articles/PMC9917545/ /pubmed/36769868 http://dx.doi.org/10.3390/jcm12031220 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Huang, Hung-Hsiang
Hsieh, Shang-Ju
Chen, Ming-Shu
Jhou, Mao-Jhen
Liu, Tzu-Chi
Shen, Hsiang-Li
Yang, Chih-Te
Hung, Chung-Chih
Yu, Ya-Yen
Lu, Chi-Jie
Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators
title Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators
title_full Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators
title_fullStr Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators
title_full_unstemmed Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators
title_short Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators
title_sort machine learning predictive models for evaluating risk factors affecting sperm count: predictions based on health screening indicators
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9917545/
https://www.ncbi.nlm.nih.gov/pubmed/36769868
http://dx.doi.org/10.3390/jcm12031220
work_keys_str_mv AT huanghunghsiang machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT hsiehshangju machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT chenmingshu machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT jhoumaojhen machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT liutzuchi machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT shenhsiangli machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT yangchihte machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT hungchungchih machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT yuyayen machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators
AT luchijie machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators