Cargando…

Environmental pathways affecting gene expression (E.PAGE) as an R package to predict gene–environment associations

The purpose of this study is to manually and semi-automatically curate a database and develop an R package that will act as a comprehensive resource to understand how biological processes are dysregulated due to interactions with environmental factors. The initial database search run on the Gene Exp...

Descripción completa

Detalles Bibliográficos
Autores principales: Muralidharan, Sachin, Ali, Sarah, Yang, Lilin, Badshah, Joshua, Zahir, Syeda Farah, Ali, Rubbiya A., Chandra, Janin, Frazer, Ian H., Thomas, Ranjeny, Mehdi, Ahmed M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9636158/
https://www.ncbi.nlm.nih.gov/pubmed/36333579
http://dx.doi.org/10.1038/s41598-022-21988-6
_version_ 1784824875880284160
author Muralidharan, Sachin
Ali, Sarah
Yang, Lilin
Badshah, Joshua
Zahir, Syeda Farah
Ali, Rubbiya A.
Chandra, Janin
Frazer, Ian H.
Thomas, Ranjeny
Mehdi, Ahmed M.
author_facet Muralidharan, Sachin
Ali, Sarah
Yang, Lilin
Badshah, Joshua
Zahir, Syeda Farah
Ali, Rubbiya A.
Chandra, Janin
Frazer, Ian H.
Thomas, Ranjeny
Mehdi, Ahmed M.
author_sort Muralidharan, Sachin
collection PubMed
description The purpose of this study is to manually and semi-automatically curate a database and develop an R package that will act as a comprehensive resource to understand how biological processes are dysregulated due to interactions with environmental factors. The initial database search run on the Gene Expression Omnibus and the Molecular Signature Database retrieved a total of 90,018 articles. After title and abstract screening against pre-set criteria, a total of 237 datasets were selected and 522 gene modules were manually annotated. We then curated a database containing four environmental factors, cigarette smoking, diet, infections and toxic chemicals, along with a total of 25,789 genes that had an association with one or more of gene modules. The database and statistical analysis package was then tested with the differentially expressed genes obtained from the published literature related to type 1 diabetes, rheumatoid arthritis, small cell lung cancer, COVID-19, cobalt exposure and smoking. On testing, we uncovered statistically enriched biological processes, which revealed pathways associated with environmental factors and the genes. The curated database and enrichment tool are available as R packages at https://github.com/AhmedMehdiLab/E.PATH and https://github.com/AhmedMehdiLab/E.PAGE respectively.
format Online
Article
Text
id pubmed-9636158
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96361582022-11-06 Environmental pathways affecting gene expression (E.PAGE) as an R package to predict gene–environment associations Muralidharan, Sachin Ali, Sarah Yang, Lilin Badshah, Joshua Zahir, Syeda Farah Ali, Rubbiya A. Chandra, Janin Frazer, Ian H. Thomas, Ranjeny Mehdi, Ahmed M. Sci Rep Article The purpose of this study is to manually and semi-automatically curate a database and develop an R package that will act as a comprehensive resource to understand how biological processes are dysregulated due to interactions with environmental factors. The initial database search run on the Gene Expression Omnibus and the Molecular Signature Database retrieved a total of 90,018 articles. After title and abstract screening against pre-set criteria, a total of 237 datasets were selected and 522 gene modules were manually annotated. We then curated a database containing four environmental factors, cigarette smoking, diet, infections and toxic chemicals, along with a total of 25,789 genes that had an association with one or more of gene modules. The database and statistical analysis package was then tested with the differentially expressed genes obtained from the published literature related to type 1 diabetes, rheumatoid arthritis, small cell lung cancer, COVID-19, cobalt exposure and smoking. On testing, we uncovered statistically enriched biological processes, which revealed pathways associated with environmental factors and the genes. The curated database and enrichment tool are available as R packages at https://github.com/AhmedMehdiLab/E.PATH and https://github.com/AhmedMehdiLab/E.PAGE respectively. Nature Publishing Group UK 2022-11-04 /pmc/articles/PMC9636158/ /pubmed/36333579 http://dx.doi.org/10.1038/s41598-022-21988-6 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Muralidharan, Sachin
Ali, Sarah
Yang, Lilin
Badshah, Joshua
Zahir, Syeda Farah
Ali, Rubbiya A.
Chandra, Janin
Frazer, Ian H.
Thomas, Ranjeny
Mehdi, Ahmed M.
Environmental pathways affecting gene expression (E.PAGE) as an R package to predict gene–environment associations
title Environmental pathways affecting gene expression (E.PAGE) as an R package to predict gene–environment associations
title_full Environmental pathways affecting gene expression (E.PAGE) as an R package to predict gene–environment associations
title_fullStr Environmental pathways affecting gene expression (E.PAGE) as an R package to predict gene–environment associations
title_full_unstemmed Environmental pathways affecting gene expression (E.PAGE) as an R package to predict gene–environment associations
title_short Environmental pathways affecting gene expression (E.PAGE) as an R package to predict gene–environment associations
title_sort environmental pathways affecting gene expression (e.page) as an r package to predict gene–environment associations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9636158/
https://www.ncbi.nlm.nih.gov/pubmed/36333579
http://dx.doi.org/10.1038/s41598-022-21988-6
work_keys_str_mv AT muralidharansachin environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT alisarah environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT yanglilin environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT badshahjoshua environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT zahirsyedafarah environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT alirubbiyaa environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT chandrajanin environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT frazerianh environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT thomasranjeny environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations
AT mehdiahmedm environmentalpathwaysaffectinggeneexpressionepageasanrpackagetopredictgeneenvironmentassociations