Cargando…
Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators
In many countries, especially developed nations, the fertility rate and birth rate have continually declined. Taiwan’s fertility rate has paralleled this trend and reached its nadir in 2022. Therefore, the government uses many strategies to encourage more married couples to have children. However, c...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9917545/ https://www.ncbi.nlm.nih.gov/pubmed/36769868 http://dx.doi.org/10.3390/jcm12031220 |
_version_ | 1784886392780750848 |
---|---|
author | Huang, Hung-Hsiang Hsieh, Shang-Ju Chen, Ming-Shu Jhou, Mao-Jhen Liu, Tzu-Chi Shen, Hsiang-Li Yang, Chih-Te Hung, Chung-Chih Yu, Ya-Yen Lu, Chi-Jie |
author_facet | Huang, Hung-Hsiang Hsieh, Shang-Ju Chen, Ming-Shu Jhou, Mao-Jhen Liu, Tzu-Chi Shen, Hsiang-Li Yang, Chih-Te Hung, Chung-Chih Yu, Ya-Yen Lu, Chi-Jie |
author_sort | Huang, Hung-Hsiang |
collection | PubMed |
description | In many countries, especially developed nations, the fertility rate and birth rate have continually declined. Taiwan’s fertility rate has paralleled this trend and reached its nadir in 2022. Therefore, the government uses many strategies to encourage more married couples to have children. However, couples marrying at an older age may have declining physical status, as well as hypertension and other metabolic syndrome symptoms, in addition to possibly being overweight, which have been the focus of the studies for their influences on male and female gamete quality. Many previous studies based on infertile people are not truly representative of the general population. This study proposed a framework using five machine learning (ML) predictive algorithms—random forest, stochastic gradient boosting, least absolute shrinkage and selection operator regression, ridge regression, and extreme gradient boosting—to identify the major risk factors affecting male sperm count based on a major health screening database in Taiwan. Unlike traditional multiple linear regression, ML algorithms do not need statistical assumptions and can capture non-linear relationships or complex interactions between dependent and independent variables to generate promising performance. We analyzed annual health screening data of 1375 males from 2010 to 2017, including data on health screening indicators, sourced from the MJ Group, a major health screening center in Taiwan. The symmetric mean absolute percentage error, relative absolute error, root relative squared error, and root mean squared error were used as performance evaluation metrics. Our results show that sleep time (ST), alpha-fetoprotein (AFP), body fat (BF), systolic blood pressure (SBP), and blood urea nitrogen (BUN) are the top five risk factors associated with sperm count. ST is a known risk factor influencing reproductive hormone balance, which can affect spermatogenesis and final sperm count. BF and SBP are risk factors associated with metabolic syndrome, another known risk factor of altered male reproductive hormone systems. However, AFP has not been the focus of previous studies on male fertility or semen quality. BUN, the index for kidney function, is also identified as a risk factor by our established ML model. Our results support previous findings that metabolic syndrome has negative impacts on sperm count and semen quality. Sleep duration also has an impact on sperm generation in the testes. AFP and BUN are two novel risk factors linked to sperm counts. These findings could help healthcare personnel and law makers create strategies for creating environments to increase the country’s fertility rate. This study should also be of value to follow-up research. |
format | Online Article Text |
id | pubmed-9917545 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-99175452023-02-11 Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators Huang, Hung-Hsiang Hsieh, Shang-Ju Chen, Ming-Shu Jhou, Mao-Jhen Liu, Tzu-Chi Shen, Hsiang-Li Yang, Chih-Te Hung, Chung-Chih Yu, Ya-Yen Lu, Chi-Jie J Clin Med Article In many countries, especially developed nations, the fertility rate and birth rate have continually declined. Taiwan’s fertility rate has paralleled this trend and reached its nadir in 2022. Therefore, the government uses many strategies to encourage more married couples to have children. However, couples marrying at an older age may have declining physical status, as well as hypertension and other metabolic syndrome symptoms, in addition to possibly being overweight, which have been the focus of the studies for their influences on male and female gamete quality. Many previous studies based on infertile people are not truly representative of the general population. This study proposed a framework using five machine learning (ML) predictive algorithms—random forest, stochastic gradient boosting, least absolute shrinkage and selection operator regression, ridge regression, and extreme gradient boosting—to identify the major risk factors affecting male sperm count based on a major health screening database in Taiwan. Unlike traditional multiple linear regression, ML algorithms do not need statistical assumptions and can capture non-linear relationships or complex interactions between dependent and independent variables to generate promising performance. We analyzed annual health screening data of 1375 males from 2010 to 2017, including data on health screening indicators, sourced from the MJ Group, a major health screening center in Taiwan. The symmetric mean absolute percentage error, relative absolute error, root relative squared error, and root mean squared error were used as performance evaluation metrics. Our results show that sleep time (ST), alpha-fetoprotein (AFP), body fat (BF), systolic blood pressure (SBP), and blood urea nitrogen (BUN) are the top five risk factors associated with sperm count. ST is a known risk factor influencing reproductive hormone balance, which can affect spermatogenesis and final sperm count. BF and SBP are risk factors associated with metabolic syndrome, another known risk factor of altered male reproductive hormone systems. However, AFP has not been the focus of previous studies on male fertility or semen quality. BUN, the index for kidney function, is also identified as a risk factor by our established ML model. Our results support previous findings that metabolic syndrome has negative impacts on sperm count and semen quality. Sleep duration also has an impact on sperm generation in the testes. AFP and BUN are two novel risk factors linked to sperm counts. These findings could help healthcare personnel and law makers create strategies for creating environments to increase the country’s fertility rate. This study should also be of value to follow-up research. MDPI 2023-02-03 /pmc/articles/PMC9917545/ /pubmed/36769868 http://dx.doi.org/10.3390/jcm12031220 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Huang, Hung-Hsiang Hsieh, Shang-Ju Chen, Ming-Shu Jhou, Mao-Jhen Liu, Tzu-Chi Shen, Hsiang-Li Yang, Chih-Te Hung, Chung-Chih Yu, Ya-Yen Lu, Chi-Jie Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators |
title | Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators |
title_full | Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators |
title_fullStr | Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators |
title_full_unstemmed | Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators |
title_short | Machine Learning Predictive Models for Evaluating Risk Factors Affecting Sperm Count: Predictions Based on Health Screening Indicators |
title_sort | machine learning predictive models for evaluating risk factors affecting sperm count: predictions based on health screening indicators |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9917545/ https://www.ncbi.nlm.nih.gov/pubmed/36769868 http://dx.doi.org/10.3390/jcm12031220 |
work_keys_str_mv | AT huanghunghsiang machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT hsiehshangju machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT chenmingshu machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT jhoumaojhen machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT liutzuchi machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT shenhsiangli machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT yangchihte machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT hungchungchih machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT yuyayen machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators AT luchijie machinelearningpredictivemodelsforevaluatingriskfactorsaffectingspermcountpredictionsbasedonhealthscreeningindicators |