Cargando…

Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials

This article contains data related to the research article ‘Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials’ (Williams and Halappanavar, 2015) [1]. The presence of diverse types of nanomaterials (NMs)...

Descripción completa

Detalles Bibliográficos
Autores principales: Williams, Andrew, Halappanavar, Sabina
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5683856/
https://www.ncbi.nlm.nih.gov/pubmed/29159232
http://dx.doi.org/10.1016/j.dib.2017.10.060
_version_ 1783278372202217472
author Williams, Andrew
Halappanavar, Sabina
author_facet Williams, Andrew
Halappanavar, Sabina
author_sort Williams, Andrew
collection PubMed
description This article contains data related to the research article ‘Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials’ (Williams and Halappanavar, 2015) [1]. The presence of diverse types of nanomaterials (NMs) in commerce has grown significantly in the past decade and as a result, human exposure to these materials in the environment is inevitable. The traditional toxicity testing approaches that are reliant on animals are both time- and cost- intensive; employing which, it is not possible to complete the challenging task of safety assessment of NMs currently on the market in a timely manner. Thus, there is an urgent need for comprehensive understanding of the biological behavior of NMs, and efficient toxicity screening tools that will enable the development of predictive toxicology paradigms suited to rapidly assessing the human health impacts of exposure to NMs. In an effort to predict the long term health impacts of acute exposure to NMs, in Williams and Halappanavar (2015) [1], we applied bi-clustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related bi-clusters showing similar gene expression profiles were identified. The identified bi-clusters were then used to conduct a gene set enrichment analysis on lung gene expression profiles derived from mice exposed to nano-titanium dioxide, carbon black or carbon nanotubes (nano-TiO2, CB and CNTs) to determine the disease significance of these data-driven gene sets. The results of the analysis correctly identified all NMs to be inflammogenic, and only CB and CNTs as potentially fibrogenic. Here, we elaborate on the details of the statistical methods and algorithms used to derive the disease relevant gene signatures. These details will enable other investigators to use the gene signature in future Gene Set Enrichment Analysis studies involving NMs or as features for clustering and classifying NMs of diverse properties.
format Online
Article
Text
id pubmed-5683856
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-56838562017-11-20 Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials Williams, Andrew Halappanavar, Sabina Data Brief Genetics, Genomics and Molecular Biology    This article contains data related to the research article ‘Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials’ (Williams and Halappanavar, 2015) [1]. The presence of diverse types of nanomaterials (NMs) in commerce has grown significantly in the past decade and as a result, human exposure to these materials in the environment is inevitable. The traditional toxicity testing approaches that are reliant on animals are both time- and cost- intensive; employing which, it is not possible to complete the challenging task of safety assessment of NMs currently on the market in a timely manner. Thus, there is an urgent need for comprehensive understanding of the biological behavior of NMs, and efficient toxicity screening tools that will enable the development of predictive toxicology paradigms suited to rapidly assessing the human health impacts of exposure to NMs. In an effort to predict the long term health impacts of acute exposure to NMs, in Williams and Halappanavar (2015) [1], we applied bi-clustering and gene set enrichment analysis methods to derive essential features of altered lung transcriptome following exposure to NMs that are associated with lung-specific diseases. Several datasets from public microarray repositories describing pulmonary diseases in mouse models following exposure to a variety of substances were examined and functionally related bi-clusters showing similar gene expression profiles were identified. The identified bi-clusters were then used to conduct a gene set enrichment analysis on lung gene expression profiles derived from mice exposed to nano-titanium dioxide, carbon black or carbon nanotubes (nano-TiO2, CB and CNTs) to determine the disease significance of these data-driven gene sets. The results of the analysis correctly identified all NMs to be inflammogenic, and only CB and CNTs as potentially fibrogenic. Here, we elaborate on the details of the statistical methods and algorithms used to derive the disease relevant gene signatures. These details will enable other investigators to use the gene signature in future Gene Set Enrichment Analysis studies involving NMs or as features for clustering and classifying NMs of diverse properties. Elsevier 2017-10-26 /pmc/articles/PMC5683856/ /pubmed/29159232 http://dx.doi.org/10.1016/j.dib.2017.10.060 Text en Crown Copyright © 2017 Published by Elsevier Inc. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Genetics, Genomics and Molecular Biology   
Williams, Andrew
Halappanavar, Sabina
Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials
title Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials
title_full Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials
title_fullStr Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials
title_full_unstemmed Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials
title_short Application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials
title_sort application of bi-clustering of gene expression data and gene set enrichment analysis methods to identify potentially disease causing nanomaterials
topic Genetics, Genomics and Molecular Biology   
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5683856/
https://www.ncbi.nlm.nih.gov/pubmed/29159232
http://dx.doi.org/10.1016/j.dib.2017.10.060
work_keys_str_mv AT williamsandrew applicationofbiclusteringofgeneexpressiondataandgenesetenrichmentanalysismethodstoidentifypotentiallydiseasecausingnanomaterials
AT halappanavarsabina applicationofbiclusteringofgeneexpressiondataandgenesetenrichmentanalysismethodstoidentifypotentiallydiseasecausingnanomaterials