Cargando…

Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods

Male infertility has always been one of the important factors affecting the infertility of couples of gestational age. The reasons that affect male infertility includes living habits, hereditary factors, etc. Identifying the genetic causes of male infertility can help us understand the biology of ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Xuan, Yin, ZhiXiang, Peng, Yu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9911419/
https://www.ncbi.nlm.nih.gov/pubmed/36778885
http://dx.doi.org/10.3389/fmicb.2023.1092143
_version_ 1784884984554717184
author Jia, Xuan
Yin, ZhiXiang
Peng, Yu
author_facet Jia, Xuan
Yin, ZhiXiang
Peng, Yu
author_sort Jia, Xuan
collection PubMed
description Male infertility has always been one of the important factors affecting the infertility of couples of gestational age. The reasons that affect male infertility includes living habits, hereditary factors, etc. Identifying the genetic causes of male infertility can help us understand the biology of male infertility, as well as the diagnosis of genetic testing and the determination of clinical treatment options. While current research has made significant progress in the genes that cause sperm defects in men, genetic studies of sperm content defects are still lacking. This article is based on a dataset of gene expression data on the X chromosome in patients with azoospermia, mild and severe oligospermia. Due to the difference in the degree of disease between patients and the possible difference in genetic causes, common classical clustering methods such as k-means, hierarchical clustering, etc. cannot effectively identify samples (realize simultaneous clustering of samples and features). In this paper, we use machine learning and various statistical methods such as hypergeometric distribution, Gibbs sampling, Fisher test, etc. and genes the interaction network for cluster analysis of gene expression data of male infertility patients has certain advantages compared with existing methods. The cluster results were identified by differential co-expression analysis of gene expression data in male infertility patients, and the model recognition clusters were analyzed by multiple gene enrichment methods, showing different degrees of enrichment in various enzyme activities, cancer, virus-related, ATP and ADP production, and other pathways. At the same time, as this paper is an unsupervised analysis of genetic factors of male infertility patients, we constructed a simulated data set, in which the clustering results have been determined, which can be used to measure the effect of discriminant model recognition. Through comparison, it finds that the proposed model has a better identification effect.
format Online
Article
Text
id pubmed-9911419
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99114192023-02-11 Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods Jia, Xuan Yin, ZhiXiang Peng, Yu Front Microbiol Microbiology Male infertility has always been one of the important factors affecting the infertility of couples of gestational age. The reasons that affect male infertility includes living habits, hereditary factors, etc. Identifying the genetic causes of male infertility can help us understand the biology of male infertility, as well as the diagnosis of genetic testing and the determination of clinical treatment options. While current research has made significant progress in the genes that cause sperm defects in men, genetic studies of sperm content defects are still lacking. This article is based on a dataset of gene expression data on the X chromosome in patients with azoospermia, mild and severe oligospermia. Due to the difference in the degree of disease between patients and the possible difference in genetic causes, common classical clustering methods such as k-means, hierarchical clustering, etc. cannot effectively identify samples (realize simultaneous clustering of samples and features). In this paper, we use machine learning and various statistical methods such as hypergeometric distribution, Gibbs sampling, Fisher test, etc. and genes the interaction network for cluster analysis of gene expression data of male infertility patients has certain advantages compared with existing methods. The cluster results were identified by differential co-expression analysis of gene expression data in male infertility patients, and the model recognition clusters were analyzed by multiple gene enrichment methods, showing different degrees of enrichment in various enzyme activities, cancer, virus-related, ATP and ADP production, and other pathways. At the same time, as this paper is an unsupervised analysis of genetic factors of male infertility patients, we constructed a simulated data set, in which the clustering results have been determined, which can be used to measure the effect of discriminant model recognition. Through comparison, it finds that the proposed model has a better identification effect. Frontiers Media S.A. 2023-01-27 /pmc/articles/PMC9911419/ /pubmed/36778885 http://dx.doi.org/10.3389/fmicb.2023.1092143 Text en Copyright © 2023 Jia, Yin and Peng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Jia, Xuan
Yin, ZhiXiang
Peng, Yu
Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods
title Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods
title_full Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods
title_fullStr Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods
title_full_unstemmed Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods
title_short Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods
title_sort gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9911419/
https://www.ncbi.nlm.nih.gov/pubmed/36778885
http://dx.doi.org/10.3389/fmicb.2023.1092143
work_keys_str_mv AT jiaxuan genedifferentialcoexpressionanalysisofmaleinfertilitypatientsbasedonstatisticalandmachinelearningmethods
AT yinzhixiang genedifferentialcoexpressionanalysisofmaleinfertilitypatientsbasedonstatisticalandmachinelearningmethods
AT pengyu genedifferentialcoexpressionanalysisofmaleinfertilitypatientsbasedonstatisticalandmachinelearningmethods