Cargando…
Multiscale Embedded Gene Co-expression Network Analysis
Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, num...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4664553/ https://www.ncbi.nlm.nih.gov/pubmed/26618778 http://dx.doi.org/10.1371/journal.pcbi.1004574 |
_version_ | 1782403443982336000 |
---|---|
author | Song, Won-Min Zhang, Bin |
author_facet | Song, Won-Min Zhang, Bin |
author_sort | Song, Won-Min |
collection | PubMed |
description | Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|(3)), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma. |
format | Online Article Text |
id | pubmed-4664553 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-46645532015-12-10 Multiscale Embedded Gene Co-expression Network Analysis Song, Won-Min Zhang, Bin PLoS Comput Biol Research Article Gene co-expression network analysis has been shown effective in identifying functional co-expressed gene modules associated with complex human diseases. However, existing techniques to construct co-expression networks require some critical prior information such as predefined number of clusters, numerical thresholds for defining co-expression/interaction, or do not naturally reproduce the hallmarks of complex systems such as the scale-free degree distribution of small-worldness. Previously, a graph filtering technique called Planar Maximally Filtered Graph (PMFG) has been applied to many real-world data sets such as financial stock prices and gene expression to extract meaningful and relevant interactions. However, PMFG is not suitable for large-scale genomic data due to several drawbacks, such as the high computation complexity O(|V|(3)), the presence of false-positives due to the maximal planarity constraint, and the inadequacy of the clustering framework. Here, we developed a new co-expression network analysis framework called Multiscale Embedded Gene Co-expression Network Analysis (MEGENA) by: i) introducing quality control of co-expression similarities, ii) parallelizing embedded network construction, and iii) developing a novel clustering technique to identify multi-scale clustering structures in Planar Filtered Networks (PFNs). We applied MEGENA to a series of simulated data and the gene expression data in breast carcinoma and lung adenocarcinoma from The Cancer Genome Atlas (TCGA). MEGENA showed improved performance over well-established clustering methods and co-expression network construction approaches. MEGENA revealed not only meaningful multi-scale organizations of co-expressed gene clusters but also novel targets in breast carcinoma and lung adenocarcinoma. Public Library of Science 2015-11-30 /pmc/articles/PMC4664553/ /pubmed/26618778 http://dx.doi.org/10.1371/journal.pcbi.1004574 Text en © 2015 Song, Zhang http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Song, Won-Min Zhang, Bin Multiscale Embedded Gene Co-expression Network Analysis |
title | Multiscale Embedded Gene Co-expression Network Analysis |
title_full | Multiscale Embedded Gene Co-expression Network Analysis |
title_fullStr | Multiscale Embedded Gene Co-expression Network Analysis |
title_full_unstemmed | Multiscale Embedded Gene Co-expression Network Analysis |
title_short | Multiscale Embedded Gene Co-expression Network Analysis |
title_sort | multiscale embedded gene co-expression network analysis |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4664553/ https://www.ncbi.nlm.nih.gov/pubmed/26618778 http://dx.doi.org/10.1371/journal.pcbi.1004574 |
work_keys_str_mv | AT songwonmin multiscaleembeddedgenecoexpressionnetworkanalysis AT zhangbin multiscaleembeddedgenecoexpressionnetworkanalysis |