Cargando…

Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease

Non-alcoholic fatty liver disease (NAFLD) comprises a range of chronic liver diseases that result from the accumulation of excess triglycerides in the liver, and which, in its early phases, is categorized NAFLD, or hepato-steatosis with pure fatty liver. The mortality rate of non-alcoholic steatohep...

Descripción completa

Detalles Bibliográficos
Autores principales: Nazari, Elham, Khalili-Tanha, Ghazaleh, Asadnia, Alireza, Pourali, Ghazaleh, Maftooh, Mina, Khazaei, Majid, Nasiri, Mohammadreza, Hassanian, Seyed Mahdi, Ghayour-Mobarhan, Majid, Ferns, Gordon A., Kiani, Mohammad Ali, Avan, Amir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665370/
https://www.ncbi.nlm.nih.gov/pubmed/37993474
http://dx.doi.org/10.1038/s41598-023-46711-x
_version_ 1785148855173513216
author Nazari, Elham
Khalili-Tanha, Ghazaleh
Asadnia, Alireza
Pourali, Ghazaleh
Maftooh, Mina
Khazaei, Majid
Nasiri, Mohammadreza
Hassanian, Seyed Mahdi
Ghayour-Mobarhan, Majid
Ferns, Gordon A.
Kiani, Mohammad Ali
Avan, Amir
author_facet Nazari, Elham
Khalili-Tanha, Ghazaleh
Asadnia, Alireza
Pourali, Ghazaleh
Maftooh, Mina
Khazaei, Majid
Nasiri, Mohammadreza
Hassanian, Seyed Mahdi
Ghayour-Mobarhan, Majid
Ferns, Gordon A.
Kiani, Mohammad Ali
Avan, Amir
author_sort Nazari, Elham
collection PubMed
description Non-alcoholic fatty liver disease (NAFLD) comprises a range of chronic liver diseases that result from the accumulation of excess triglycerides in the liver, and which, in its early phases, is categorized NAFLD, or hepato-steatosis with pure fatty liver. The mortality rate of non-alcoholic steatohepatitis (NASH) is more than NAFLD; therefore, diagnosing the disease in its early stages may decrease liver damage and increase the survival rate. In the current study, we screened the gene expression data of NAFLD patients and control samples from the public dataset GEO to detect DEGs. Then, the correlation betweenbetween the top selected DEGs and clinical data was evaluated. In the present study, two GEO datasets (GSE48452, GSE126848) were downloaded. The dysregulated expressed genes (DEGs) were identified by machine learning methods (Penalize regression models). Then, the shared DEGs between the two training datasets were validated using validation datasets. ROC-curve analysis was used to identify diagnostic markers. R software analyzed the interactions between DEGs, clinical data, and fatty liver. Ten novel genes, including ABCF1, SART3, APC5, NONO, KAT7, ZPR1, RABGAP1, SLC7A8, SPAG9, and KAT6A were found to have a differential expression between NAFLD and healthy individuals. Based on validation results and ROC analysis, NR4A2 and IGFBP1b were identified as diagnostic markers. These key genes may be predictive markers for the development of fatty liver. It is recommended that these key genes are assessed further as possible predictive markers during the development of fatty liver.
format Online
Article
Text
id pubmed-10665370
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106653702023-11-22 Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease Nazari, Elham Khalili-Tanha, Ghazaleh Asadnia, Alireza Pourali, Ghazaleh Maftooh, Mina Khazaei, Majid Nasiri, Mohammadreza Hassanian, Seyed Mahdi Ghayour-Mobarhan, Majid Ferns, Gordon A. Kiani, Mohammad Ali Avan, Amir Sci Rep Article Non-alcoholic fatty liver disease (NAFLD) comprises a range of chronic liver diseases that result from the accumulation of excess triglycerides in the liver, and which, in its early phases, is categorized NAFLD, or hepato-steatosis with pure fatty liver. The mortality rate of non-alcoholic steatohepatitis (NASH) is more than NAFLD; therefore, diagnosing the disease in its early stages may decrease liver damage and increase the survival rate. In the current study, we screened the gene expression data of NAFLD patients and control samples from the public dataset GEO to detect DEGs. Then, the correlation betweenbetween the top selected DEGs and clinical data was evaluated. In the present study, two GEO datasets (GSE48452, GSE126848) were downloaded. The dysregulated expressed genes (DEGs) were identified by machine learning methods (Penalize regression models). Then, the shared DEGs between the two training datasets were validated using validation datasets. ROC-curve analysis was used to identify diagnostic markers. R software analyzed the interactions between DEGs, clinical data, and fatty liver. Ten novel genes, including ABCF1, SART3, APC5, NONO, KAT7, ZPR1, RABGAP1, SLC7A8, SPAG9, and KAT6A were found to have a differential expression between NAFLD and healthy individuals. Based on validation results and ROC analysis, NR4A2 and IGFBP1b were identified as diagnostic markers. These key genes may be predictive markers for the development of fatty liver. It is recommended that these key genes are assessed further as possible predictive markers during the development of fatty liver. Nature Publishing Group UK 2023-11-22 /pmc/articles/PMC10665370/ /pubmed/37993474 http://dx.doi.org/10.1038/s41598-023-46711-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Nazari, Elham
Khalili-Tanha, Ghazaleh
Asadnia, Alireza
Pourali, Ghazaleh
Maftooh, Mina
Khazaei, Majid
Nasiri, Mohammadreza
Hassanian, Seyed Mahdi
Ghayour-Mobarhan, Majid
Ferns, Gordon A.
Kiani, Mohammad Ali
Avan, Amir
Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease
title Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease
title_full Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease
title_fullStr Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease
title_full_unstemmed Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease
title_short Bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease
title_sort bioinformatics analysis and machine learning approach applied to the identification of novel key genes involved in non-alcoholic fatty liver disease
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665370/
https://www.ncbi.nlm.nih.gov/pubmed/37993474
http://dx.doi.org/10.1038/s41598-023-46711-x
work_keys_str_mv AT nazarielham bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT khalilitanhaghazaleh bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT asadniaalireza bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT pouralighazaleh bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT maftoohmina bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT khazaeimajid bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT nasirimohammadreza bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT hassanianseyedmahdi bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT ghayourmobarhanmajid bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT fernsgordona bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT kianimohammadali bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease
AT avanamir bioinformaticsanalysisandmachinelearningapproachappliedtotheidentificationofnovelkeygenesinvolvedinnonalcoholicfattyliverdisease