Cargando…
Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods
Male infertility has always been one of the important factors affecting the infertility of couples of gestational age. The reasons that affect male infertility includes living habits, hereditary factors, etc. Identifying the genetic causes of male infertility can help us understand the biology of ma...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9911419/ https://www.ncbi.nlm.nih.gov/pubmed/36778885 http://dx.doi.org/10.3389/fmicb.2023.1092143 |
_version_ | 1784884984554717184 |
---|---|
author | Jia, Xuan Yin, ZhiXiang Peng, Yu |
author_facet | Jia, Xuan Yin, ZhiXiang Peng, Yu |
author_sort | Jia, Xuan |
collection | PubMed |
description | Male infertility has always been one of the important factors affecting the infertility of couples of gestational age. The reasons that affect male infertility includes living habits, hereditary factors, etc. Identifying the genetic causes of male infertility can help us understand the biology of male infertility, as well as the diagnosis of genetic testing and the determination of clinical treatment options. While current research has made significant progress in the genes that cause sperm defects in men, genetic studies of sperm content defects are still lacking. This article is based on a dataset of gene expression data on the X chromosome in patients with azoospermia, mild and severe oligospermia. Due to the difference in the degree of disease between patients and the possible difference in genetic causes, common classical clustering methods such as k-means, hierarchical clustering, etc. cannot effectively identify samples (realize simultaneous clustering of samples and features). In this paper, we use machine learning and various statistical methods such as hypergeometric distribution, Gibbs sampling, Fisher test, etc. and genes the interaction network for cluster analysis of gene expression data of male infertility patients has certain advantages compared with existing methods. The cluster results were identified by differential co-expression analysis of gene expression data in male infertility patients, and the model recognition clusters were analyzed by multiple gene enrichment methods, showing different degrees of enrichment in various enzyme activities, cancer, virus-related, ATP and ADP production, and other pathways. At the same time, as this paper is an unsupervised analysis of genetic factors of male infertility patients, we constructed a simulated data set, in which the clustering results have been determined, which can be used to measure the effect of discriminant model recognition. Through comparison, it finds that the proposed model has a better identification effect. |
format | Online Article Text |
id | pubmed-9911419 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-99114192023-02-11 Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods Jia, Xuan Yin, ZhiXiang Peng, Yu Front Microbiol Microbiology Male infertility has always been one of the important factors affecting the infertility of couples of gestational age. The reasons that affect male infertility includes living habits, hereditary factors, etc. Identifying the genetic causes of male infertility can help us understand the biology of male infertility, as well as the diagnosis of genetic testing and the determination of clinical treatment options. While current research has made significant progress in the genes that cause sperm defects in men, genetic studies of sperm content defects are still lacking. This article is based on a dataset of gene expression data on the X chromosome in patients with azoospermia, mild and severe oligospermia. Due to the difference in the degree of disease between patients and the possible difference in genetic causes, common classical clustering methods such as k-means, hierarchical clustering, etc. cannot effectively identify samples (realize simultaneous clustering of samples and features). In this paper, we use machine learning and various statistical methods such as hypergeometric distribution, Gibbs sampling, Fisher test, etc. and genes the interaction network for cluster analysis of gene expression data of male infertility patients has certain advantages compared with existing methods. The cluster results were identified by differential co-expression analysis of gene expression data in male infertility patients, and the model recognition clusters were analyzed by multiple gene enrichment methods, showing different degrees of enrichment in various enzyme activities, cancer, virus-related, ATP and ADP production, and other pathways. At the same time, as this paper is an unsupervised analysis of genetic factors of male infertility patients, we constructed a simulated data set, in which the clustering results have been determined, which can be used to measure the effect of discriminant model recognition. Through comparison, it finds that the proposed model has a better identification effect. Frontiers Media S.A. 2023-01-27 /pmc/articles/PMC9911419/ /pubmed/36778885 http://dx.doi.org/10.3389/fmicb.2023.1092143 Text en Copyright © 2023 Jia, Yin and Peng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Microbiology Jia, Xuan Yin, ZhiXiang Peng, Yu Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods |
title | Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods |
title_full | Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods |
title_fullStr | Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods |
title_full_unstemmed | Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods |
title_short | Gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods |
title_sort | gene differential co-expression analysis of male infertility patients based on statistical and machine learning methods |
topic | Microbiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9911419/ https://www.ncbi.nlm.nih.gov/pubmed/36778885 http://dx.doi.org/10.3389/fmicb.2023.1092143 |
work_keys_str_mv | AT jiaxuan genedifferentialcoexpressionanalysisofmaleinfertilitypatientsbasedonstatisticalandmachinelearningmethods AT yinzhixiang genedifferentialcoexpressionanalysisofmaleinfertilitypatientsbasedonstatisticalandmachinelearningmethods AT pengyu genedifferentialcoexpressionanalysisofmaleinfertilitypatientsbasedonstatisticalandmachinelearningmethods |