Cargando…
Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease
Non-alcoholic fatty liver disease (NAFLD) comprises a range of chronic liver diseases that result from the accumulation of excess triglycerides in the liver, and which, in its early phases, is categorized NAFLD, or hepato-steatosis with pure fatty liver. The mortality rate of non-alcoholic steatohep...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665370/ https://www.ncbi.nlm.nih.gov/pubmed/37993474 http://dx.doi.org/10.1038/s41598-023-46711-x |
_version_ | 1785148855173513216 |
---|---|
author | Nazari, Elham Khalili-Tanha, Ghazaleh Asadnia, Alireza Pourali, Ghazaleh Maftooh, Mina Khazaei, Majid Nasiri, Mohammadreza Hassanian, Seyed Mahdi Ghayour-Mobarhan, Majid Ferns, Gordon A. Kiani, Mohammad Ali Avan, Amir |
author_facet | Nazari, Elham Khalili-Tanha, Ghazaleh Asadnia, Alireza Pourali, Ghazaleh Maftooh, Mina Khazaei, Majid Nasiri, Mohammadreza Hassanian, Seyed Mahdi Ghayour-Mobarhan, Majid Ferns, Gordon A. Kiani, Mohammad Ali Avan, Amir |
author_sort | Nazari, Elham |
collection | PubMed |
description | Non-alcoholic fatty liver disease (NAFLD) comprises a range of chronic liver diseases that result from the accumulation of excess triglycerides in the liver, and which, in its early phases, is categorized NAFLD, or hepato-steatosis with pure fatty liver. The mortality rate of non-alcoholic steatohepatitis (NASH) is more than NAFLD; therefore, diagnosing the disease in its early stages may decrease liver damage and increase the survival rate. In the current study, we screened the gene expression data of NAFLD patients and control samples from the public dataset GEO to detect DEGs. Then, the correlation betweenbetween the top selected DEGs and clinical data was evaluated. In the present study, two GEO datasets (GSE48452, GSE126848) were downloaded. The dysregulated expressed genes (DEGs) were identified by machine learning methods (Penalize regression models). Then, the shared DEGs between the two training datasets were validated using validation datasets. ROC-curve analysis was used to identify diagnostic markers. R software analyzed the interactions between DEGs, clinical data, and fatty liver. Ten novel genes, including ABCF1, SART3, APC5, NONO, KAT7, ZPR1, RABGAP1, SLC7A8, SPAG9, and KAT6A were found to have a differential expression between NAFLD and healthy individuals. Based on validation results and ROC analysis, NR4A2 and IGFBP1b were identified as diagnostic markers. These key genes may be predictive markers for the development of fatty liver. It is recommended that these key genes are assessed further as possible predictive markers during the development of fatty liver. |
format | Online Article Text |
id | pubmed-10665370 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106653702023-11-22 Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease Nazari, Elham Khalili-Tanha, Ghazaleh Asadnia, Alireza Pourali, Ghazaleh Maftooh, Mina Khazaei, Majid Nasiri, Mohammadreza Hassanian, Seyed Mahdi Ghayour-Mobarhan, Majid Ferns, Gordon A. Kiani, Mohammad Ali Avan, Amir Sci Rep Article Non-alcoholic fatty liver disease (NAFLD) comprises a range of chronic liver diseases that result from the accumulation of excess triglycerides in the liver, and which, in its early phases, is categorized NAFLD, or hepato-steatosis with pure fatty liver. The mortality rate of non-alcoholic steatohepatitis (NASH) is more than NAFLD; therefore, diagnosing the disease in its early stages may decrease liver damage and increase the survival rate. In the current study, we screened the gene expression data of NAFLD patients and control samples from the public dataset GEO to detect DEGs. Then, the correlation betweenbetween the top selected DEGs and clinical data was evaluated. In the present study, two GEO datasets (GSE48452, GSE126848) were downloaded. The dysregulated expressed genes (DEGs) were identified by machine learning methods (Penalize regression models). Then, the shared DEGs between the two training datasets were validated using validation datasets. ROC-curve analysis was used to identify diagnostic markers. R software analyzed the interactions between DEGs, clinical data, and fatty liver. Ten novel genes, including ABCF1, SART3, APC5, NONO, KAT7, ZPR1, RABGAP1, SLC7A8, SPAG9, and KAT6A were found to have a differential expression between NAFLD and healthy individuals. Based on validation results and ROC analysis, NR4A2 and IGFBP1b were identified as diagnostic markers. These key genes may be predictive markers for the development of fatty liver. It is recommended that these key genes are assessed further as possible predictive markers during the development of fatty liver. Nature Publishing Group UK 2023-11-22 /pmc/articles/PMC10665370/ /pubmed/37993474 http://dx.doi.org/10.1038/s41598-023-46711-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Nazari, Elham Khalili-Tanha, Ghazaleh Asadnia, Alireza Pourali, Ghazaleh Maftooh, Mina Khazaei, Majid Nasiri, Mohammadreza Hassanian, Seyed Mahdi Ghayour-Mobarhan, Majid Ferns, Gordon A. Kiani, Mohammad Ali Avan, Amir Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease |
title | Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease |
title_full | Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease |
title_fullStr | Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease |
title_full_unstemmed | Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease |
title_short | Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease |
title_sort | bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665370/ https://www.ncbi.nlm.nih.gov/pubmed/37993474 http://dx.doi.org/10.1038/s41598-023-46711-x |
work_keys_str_mv | AT nazarielham bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT khalilitanhaghazaleh bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT asadniaalireza bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT pouralighazaleh bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT maftoohmina bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT khazaeimajid bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT nasirimohammadreza bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT hassanianseyedmahdi bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT ghayourmobarhanmajid bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT fernsgordona bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT kianimohammadali bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease AT avanamir bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease |