Cargando…
EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities
Gene co-expression networks (GCNs) are constructed from Gene Expression Matrices (GEMs) in a bottom up approach where all gene pairs are tested for correlation within the context of the input sample set. This approach is computationally intensive for many current GEMs and may not be scalable to mill...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6684082/ https://www.ncbi.nlm.nih.gov/pubmed/31386677 http://dx.doi.org/10.1371/journal.pone.0220279 |
_version_ | 1783442211193487360 |
---|---|
author | Husain, Benafsh Feltus, F. Alex |
author_facet | Husain, Benafsh Feltus, F. Alex |
author_sort | Husain, Benafsh |
collection | PubMed |
description | Gene co-expression networks (GCNs) are constructed from Gene Expression Matrices (GEMs) in a bottom up approach where all gene pairs are tested for correlation within the context of the input sample set. This approach is computationally intensive for many current GEMs and may not be scalable to millions of samples. Further, traditional GCNs do not detect non-linear relationships missed by correlation tests and do not place genetic relationships in a gene expression intensity context. In this report, we propose EdgeScaping, which constructs and analyzes the pairwise gene intensity network in a holistic, top down approach where no edges are filtered. EdgeScaping uses a novel technique to convert traditional pairwise gene expression data to an image based format. This conversion not only performs feature compression, making our algorithm highly scalable, but it also allows for exploring non-linear relationships between genes by leveraging deep learning image analysis algorithms. Using the learned embedded feature space we implement a fast, efficient algorithm to cluster the entire space of gene expression relationships while retaining gene expression intensity. Since EdgeScaping does not eliminate conventionally noisy edges, it extends the identification of co-expression relationships beyond classically correlated edges to facilitate the discovery of novel or unusual expression patterns within the network. We applied EdgeScaping to a human tumor GEM to identify sets of genes that exhibit conventional and non-conventional interdependent non-linear behavior associated with brain specific tumor sub-types that would be eliminated in conventional bottom-up construction of GCNs. Edgescaping source code is available at https://github.com/bhusain/EdgeScaping under the MIT license. |
format | Online Article Text |
id | pubmed-6684082 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-66840822019-08-15 EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities Husain, Benafsh Feltus, F. Alex PLoS One Research Article Gene co-expression networks (GCNs) are constructed from Gene Expression Matrices (GEMs) in a bottom up approach where all gene pairs are tested for correlation within the context of the input sample set. This approach is computationally intensive for many current GEMs and may not be scalable to millions of samples. Further, traditional GCNs do not detect non-linear relationships missed by correlation tests and do not place genetic relationships in a gene expression intensity context. In this report, we propose EdgeScaping, which constructs and analyzes the pairwise gene intensity network in a holistic, top down approach where no edges are filtered. EdgeScaping uses a novel technique to convert traditional pairwise gene expression data to an image based format. This conversion not only performs feature compression, making our algorithm highly scalable, but it also allows for exploring non-linear relationships between genes by leveraging deep learning image analysis algorithms. Using the learned embedded feature space we implement a fast, efficient algorithm to cluster the entire space of gene expression relationships while retaining gene expression intensity. Since EdgeScaping does not eliminate conventionally noisy edges, it extends the identification of co-expression relationships beyond classically correlated edges to facilitate the discovery of novel or unusual expression patterns within the network. We applied EdgeScaping to a human tumor GEM to identify sets of genes that exhibit conventional and non-conventional interdependent non-linear behavior associated with brain specific tumor sub-types that would be eliminated in conventional bottom-up construction of GCNs. Edgescaping source code is available at https://github.com/bhusain/EdgeScaping under the MIT license. Public Library of Science 2019-08-06 /pmc/articles/PMC6684082/ /pubmed/31386677 http://dx.doi.org/10.1371/journal.pone.0220279 Text en © 2019 Husain, Feltus http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Husain, Benafsh Feltus, F. Alex EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities |
title | EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities |
title_full | EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities |
title_fullStr | EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities |
title_full_unstemmed | EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities |
title_short | EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities |
title_sort | edgescaping: mapping the spatial distribution of pairwise gene expression intensities |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6684082/ https://www.ncbi.nlm.nih.gov/pubmed/31386677 http://dx.doi.org/10.1371/journal.pone.0220279 |
work_keys_str_mv | AT husainbenafsh edgescapingmappingthespatialdistributionofpairwisegeneexpressionintensities AT feltusfalex edgescapingmappingthespatialdistributionofpairwisegeneexpressionintensities |