Cargando…

Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network

Polycystic ovary syndrome (PCOS) is one of the most common metabolic and reproductive endocrinopathies. However, few studies have tried to develop a diagnostic model based on gene biomarkers. In this study, we applied a computational method by combining two machine learning algorithms, including ran...

Descripción completa

Detalles Bibliográficos
Autores principales: Xie, Ning-Ning, Wang, Fang-Fang, Zhou, Jue, Liu, Chang, Qu, Fan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7455828/
https://www.ncbi.nlm.nih.gov/pubmed/32884937
http://dx.doi.org/10.1155/2020/2613091
_version_ 1783575697460035584
author Xie, Ning-Ning
Wang, Fang-Fang
Zhou, Jue
Liu, Chang
Qu, Fan
author_facet Xie, Ning-Ning
Wang, Fang-Fang
Zhou, Jue
Liu, Chang
Qu, Fan
author_sort Xie, Ning-Ning
collection PubMed
description Polycystic ovary syndrome (PCOS) is one of the most common metabolic and reproductive endocrinopathies. However, few studies have tried to develop a diagnostic model based on gene biomarkers. In this study, we applied a computational method by combining two machine learning algorithms, including random forest (RF) and artificial neural network (ANN), to identify gene biomarkers and construct diagnostic model. We collected gene expression data from Gene Expression Omnibus (GEO) database containing 76 PCOS samples and 57 normal samples; five datasets were utilized, including one dataset for screening differentially expressed genes (DEGs), two training datasets, and two validation datasets. Firstly, based on RF, 12 key genes in 264 DEGs were identified to be vital for classification of PCOS and normal samples. Moreover, the weights of these key genes were calculated using ANN with microarray and RNA-seq training dataset, respectively. Furthermore, the diagnostic models for two types of datasets were developed and named neuralPCOS. Finally, two validation datasets were used to test and compare the performance of neuralPCOS with other two set of marker genes by area under curve (AUC). Our model achieved an AUC of 0.7273 in microarray dataset, and 0.6488 in RNA-seq dataset. To conclude, we uncovered gene biomarkers and developed a novel diagnostic model of PCOS, which would be helpful for diagnosis.
format Online
Article
Text
id pubmed-7455828
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-74558282020-09-02 Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network Xie, Ning-Ning Wang, Fang-Fang Zhou, Jue Liu, Chang Qu, Fan Biomed Res Int Research Article Polycystic ovary syndrome (PCOS) is one of the most common metabolic and reproductive endocrinopathies. However, few studies have tried to develop a diagnostic model based on gene biomarkers. In this study, we applied a computational method by combining two machine learning algorithms, including random forest (RF) and artificial neural network (ANN), to identify gene biomarkers and construct diagnostic model. We collected gene expression data from Gene Expression Omnibus (GEO) database containing 76 PCOS samples and 57 normal samples; five datasets were utilized, including one dataset for screening differentially expressed genes (DEGs), two training datasets, and two validation datasets. Firstly, based on RF, 12 key genes in 264 DEGs were identified to be vital for classification of PCOS and normal samples. Moreover, the weights of these key genes were calculated using ANN with microarray and RNA-seq training dataset, respectively. Furthermore, the diagnostic models for two types of datasets were developed and named neuralPCOS. Finally, two validation datasets were used to test and compare the performance of neuralPCOS with other two set of marker genes by area under curve (AUC). Our model achieved an AUC of 0.7273 in microarray dataset, and 0.6488 in RNA-seq dataset. To conclude, we uncovered gene biomarkers and developed a novel diagnostic model of PCOS, which would be helpful for diagnosis. Hindawi 2020-08-20 /pmc/articles/PMC7455828/ /pubmed/32884937 http://dx.doi.org/10.1155/2020/2613091 Text en Copyright © 2020 Ning-Ning Xie et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xie, Ning-Ning
Wang, Fang-Fang
Zhou, Jue
Liu, Chang
Qu, Fan
Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network
title Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network
title_full Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network
title_fullStr Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network
title_full_unstemmed Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network
title_short Establishment and Analysis of a Combined Diagnostic Model of Polycystic Ovary Syndrome with Random Forest and Artificial Neural Network
title_sort establishment and analysis of a combined diagnostic model of polycystic ovary syndrome with random forest and artificial neural network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7455828/
https://www.ncbi.nlm.nih.gov/pubmed/32884937
http://dx.doi.org/10.1155/2020/2613091
work_keys_str_mv AT xieningning establishmentandanalysisofacombineddiagnosticmodelofpolycysticovarysyndromewithrandomforestandartificialneuralnetwork
AT wangfangfang establishmentandanalysisofacombineddiagnosticmodelofpolycysticovarysyndromewithrandomforestandartificialneuralnetwork
AT zhoujue establishmentandanalysisofacombineddiagnosticmodelofpolycysticovarysyndromewithrandomforestandartificialneuralnetwork
AT liuchang establishmentandanalysisofacombineddiagnosticmodelofpolycysticovarysyndromewithrandomforestandartificialneuralnetwork
AT qufan establishmentandanalysisofacombineddiagnosticmodelofpolycysticovarysyndromewithrandomforestandartificialneuralnetwork