Cargando…

EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities

Gene co-expression networks (GCNs) are constructed from Gene Expression Matrices (GEMs) in a bottom up approach where all gene pairs are tested for correlation within the context of the input sample set. This approach is computationally intensive for many current GEMs and may not be scalable to mill...

Descripción completa

Detalles Bibliográficos
Autores principales: Husain, Benafsh, Feltus, F. Alex
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6684082/
https://www.ncbi.nlm.nih.gov/pubmed/31386677
http://dx.doi.org/10.1371/journal.pone.0220279
_version_ 1783442211193487360
author Husain, Benafsh
Feltus, F. Alex
author_facet Husain, Benafsh
Feltus, F. Alex
author_sort Husain, Benafsh
collection PubMed
description Gene co-expression networks (GCNs) are constructed from Gene Expression Matrices (GEMs) in a bottom up approach where all gene pairs are tested for correlation within the context of the input sample set. This approach is computationally intensive for many current GEMs and may not be scalable to millions of samples. Further, traditional GCNs do not detect non-linear relationships missed by correlation tests and do not place genetic relationships in a gene expression intensity context. In this report, we propose EdgeScaping, which constructs and analyzes the pairwise gene intensity network in a holistic, top down approach where no edges are filtered. EdgeScaping uses a novel technique to convert traditional pairwise gene expression data to an image based format. This conversion not only performs feature compression, making our algorithm highly scalable, but it also allows for exploring non-linear relationships between genes by leveraging deep learning image analysis algorithms. Using the learned embedded feature space we implement a fast, efficient algorithm to cluster the entire space of gene expression relationships while retaining gene expression intensity. Since EdgeScaping does not eliminate conventionally noisy edges, it extends the identification of co-expression relationships beyond classically correlated edges to facilitate the discovery of novel or unusual expression patterns within the network. We applied EdgeScaping to a human tumor GEM to identify sets of genes that exhibit conventional and non-conventional interdependent non-linear behavior associated with brain specific tumor sub-types that would be eliminated in conventional bottom-up construction of GCNs. Edgescaping source code is available at https://github.com/bhusain/EdgeScaping under the MIT license.
format Online
Article
Text
id pubmed-6684082
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-66840822019-08-15 EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities Husain, Benafsh Feltus, F. Alex PLoS One Research Article Gene co-expression networks (GCNs) are constructed from Gene Expression Matrices (GEMs) in a bottom up approach where all gene pairs are tested for correlation within the context of the input sample set. This approach is computationally intensive for many current GEMs and may not be scalable to millions of samples. Further, traditional GCNs do not detect non-linear relationships missed by correlation tests and do not place genetic relationships in a gene expression intensity context. In this report, we propose EdgeScaping, which constructs and analyzes the pairwise gene intensity network in a holistic, top down approach where no edges are filtered. EdgeScaping uses a novel technique to convert traditional pairwise gene expression data to an image based format. This conversion not only performs feature compression, making our algorithm highly scalable, but it also allows for exploring non-linear relationships between genes by leveraging deep learning image analysis algorithms. Using the learned embedded feature space we implement a fast, efficient algorithm to cluster the entire space of gene expression relationships while retaining gene expression intensity. Since EdgeScaping does not eliminate conventionally noisy edges, it extends the identification of co-expression relationships beyond classically correlated edges to facilitate the discovery of novel or unusual expression patterns within the network. We applied EdgeScaping to a human tumor GEM to identify sets of genes that exhibit conventional and non-conventional interdependent non-linear behavior associated with brain specific tumor sub-types that would be eliminated in conventional bottom-up construction of GCNs. Edgescaping source code is available at https://github.com/bhusain/EdgeScaping under the MIT license. Public Library of Science 2019-08-06 /pmc/articles/PMC6684082/ /pubmed/31386677 http://dx.doi.org/10.1371/journal.pone.0220279 Text en © 2019 Husain, Feltus http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Husain, Benafsh
Feltus, F. Alex
EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities
title EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities
title_full EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities
title_fullStr EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities
title_full_unstemmed EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities
title_short EdgeScaping: Mapping the spatial distribution of pairwise gene expression intensities
title_sort edgescaping: mapping the spatial distribution of pairwise gene expression intensities
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6684082/
https://www.ncbi.nlm.nih.gov/pubmed/31386677
http://dx.doi.org/10.1371/journal.pone.0220279
work_keys_str_mv AT husainbenafsh edgescapingmappingthespatialdistributionofpairwisegeneexpressionintensities
AT feltusfalex edgescapingmappingthespatialdistributionofpairwisegeneexpressionintensities