Cargando…
Network-based cancer genomic data integration for pattern discovery
BACKGROUND: Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662848/ https://www.ncbi.nlm.nih.gov/pubmed/34886811 http://dx.doi.org/10.1186/s12863-021-01004-y |
_version_ | 1784613522508873728 |
---|---|
author | Zhu, Fangfang Li, Jiang Liu, Juan Min, Wenwen |
author_facet | Zhu, Fangfang Li, Jiang Liu, Juan Min, Wenwen |
author_sort | Zhu, Fangfang |
collection | PubMed |
description | BACKGROUND: Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has been proposed to bicluster gene expression data to identify gene modules. However, this model can only handle the gene expression data where no gene interaction information is integrated. Ignoring the prior gene interaction information may produce the identified gene modules hard to be biologically interpreted. RESULTS: In this paper, we develop a Sparse Network-regularized SVD (SNSVD) method that integrates a prior gene interaction network from a protein protein interaction network and gene expression data to identify underlying gene functional modules. The results on a set of simulated data show that SNSVD is more effective than the traditional SVD-based methods. The further experiment results on real cancer genomic data show that most co-expressed modules are not only significantly enriched on GO/KEGG pathways, but also correspond to dense sub-networks in the prior gene interaction network. Besides, we also use our method to identify ten differentially co-expressed miRNA-gene modules by integrating matched miRNA and mRNA expression data of breast cancer from The Cancer Genome Atlas (TCGA). Several important breast cancer related miRNA-gene modules are discovered. CONCLUSIONS: All the results demonstrate that SNSVD can overcome the drawbacks of SSVD and capture more biologically relevant functional modules by incorporating a prior gene interaction network. These identified functional modules may provide a new perspective to understand the diagnostics, occurrence and progression of cancer. |
format | Online Article Text |
id | pubmed-8662848 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-86628482021-12-13 Network-based cancer genomic data integration for pattern discovery Zhu, Fangfang Li, Jiang Liu, Juan Min, Wenwen BMC Genom Data Research BACKGROUND: Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has been proposed to bicluster gene expression data to identify gene modules. However, this model can only handle the gene expression data where no gene interaction information is integrated. Ignoring the prior gene interaction information may produce the identified gene modules hard to be biologically interpreted. RESULTS: In this paper, we develop a Sparse Network-regularized SVD (SNSVD) method that integrates a prior gene interaction network from a protein protein interaction network and gene expression data to identify underlying gene functional modules. The results on a set of simulated data show that SNSVD is more effective than the traditional SVD-based methods. The further experiment results on real cancer genomic data show that most co-expressed modules are not only significantly enriched on GO/KEGG pathways, but also correspond to dense sub-networks in the prior gene interaction network. Besides, we also use our method to identify ten differentially co-expressed miRNA-gene modules by integrating matched miRNA and mRNA expression data of breast cancer from The Cancer Genome Atlas (TCGA). Several important breast cancer related miRNA-gene modules are discovered. CONCLUSIONS: All the results demonstrate that SNSVD can overcome the drawbacks of SSVD and capture more biologically relevant functional modules by incorporating a prior gene interaction network. These identified functional modules may provide a new perspective to understand the diagnostics, occurrence and progression of cancer. BioMed Central 2021-12-10 /pmc/articles/PMC8662848/ /pubmed/34886811 http://dx.doi.org/10.1186/s12863-021-01004-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Zhu, Fangfang Li, Jiang Liu, Juan Min, Wenwen Network-based cancer genomic data integration for pattern discovery |
title | Network-based cancer genomic data integration for pattern discovery |
title_full | Network-based cancer genomic data integration for pattern discovery |
title_fullStr | Network-based cancer genomic data integration for pattern discovery |
title_full_unstemmed | Network-based cancer genomic data integration for pattern discovery |
title_short | Network-based cancer genomic data integration for pattern discovery |
title_sort | network-based cancer genomic data integration for pattern discovery |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662848/ https://www.ncbi.nlm.nih.gov/pubmed/34886811 http://dx.doi.org/10.1186/s12863-021-01004-y |
work_keys_str_mv | AT zhufangfang networkbasedcancergenomicdataintegrationforpatterndiscovery AT lijiang networkbasedcancergenomicdataintegrationforpatterndiscovery AT liujuan networkbasedcancergenomicdataintegrationforpatterndiscovery AT minwenwen networkbasedcancergenomicdataintegrationforpatterndiscovery |