Cargando…

Network-based cancer genomic data integration for pattern discovery

BACKGROUND: Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Fangfang, Li, Jiang, Liu, Juan, Min, Wenwen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662848/
https://www.ncbi.nlm.nih.gov/pubmed/34886811
http://dx.doi.org/10.1186/s12863-021-01004-y
_version_ 1784613522508873728
author Zhu, Fangfang
Li, Jiang
Liu, Juan
Min, Wenwen
author_facet Zhu, Fangfang
Li, Jiang
Liu, Juan
Min, Wenwen
author_sort Zhu, Fangfang
collection PubMed
description BACKGROUND: Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has been proposed to bicluster gene expression data to identify gene modules. However, this model can only handle the gene expression data where no gene interaction information is integrated. Ignoring the prior gene interaction information may produce the identified gene modules hard to be biologically interpreted. RESULTS: In this paper, we develop a Sparse Network-regularized SVD (SNSVD) method that integrates a prior gene interaction network from a protein protein interaction network and gene expression data to identify underlying gene functional modules. The results on a set of simulated data show that SNSVD is more effective than the traditional SVD-based methods. The further experiment results on real cancer genomic data show that most co-expressed modules are not only significantly enriched on GO/KEGG pathways, but also correspond to dense sub-networks in the prior gene interaction network. Besides, we also use our method to identify ten differentially co-expressed miRNA-gene modules by integrating matched miRNA and mRNA expression data of breast cancer from The Cancer Genome Atlas (TCGA). Several important breast cancer related miRNA-gene modules are discovered. CONCLUSIONS: All the results demonstrate that SNSVD can overcome the drawbacks of SSVD and capture more biologically relevant functional modules by incorporating a prior gene interaction network. These identified functional modules may provide a new perspective to understand the diagnostics, occurrence and progression of cancer.
format Online
Article
Text
id pubmed-8662848
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-86628482021-12-13 Network-based cancer genomic data integration for pattern discovery Zhu, Fangfang Li, Jiang Liu, Juan Min, Wenwen BMC Genom Data Research BACKGROUND: Since genes involved in the same biological modules usually present correlated expression profiles, lots of computational methods have been proposed to identify gene functional modules based on the expression profiles data. Recently, Sparse Singular Value Decomposition (SSVD) method has been proposed to bicluster gene expression data to identify gene modules. However, this model can only handle the gene expression data where no gene interaction information is integrated. Ignoring the prior gene interaction information may produce the identified gene modules hard to be biologically interpreted. RESULTS: In this paper, we develop a Sparse Network-regularized SVD (SNSVD) method that integrates a prior gene interaction network from a protein protein interaction network and gene expression data to identify underlying gene functional modules. The results on a set of simulated data show that SNSVD is more effective than the traditional SVD-based methods. The further experiment results on real cancer genomic data show that most co-expressed modules are not only significantly enriched on GO/KEGG pathways, but also correspond to dense sub-networks in the prior gene interaction network. Besides, we also use our method to identify ten differentially co-expressed miRNA-gene modules by integrating matched miRNA and mRNA expression data of breast cancer from The Cancer Genome Atlas (TCGA). Several important breast cancer related miRNA-gene modules are discovered. CONCLUSIONS: All the results demonstrate that SNSVD can overcome the drawbacks of SSVD and capture more biologically relevant functional modules by incorporating a prior gene interaction network. These identified functional modules may provide a new perspective to understand the diagnostics, occurrence and progression of cancer. BioMed Central 2021-12-10 /pmc/articles/PMC8662848/ /pubmed/34886811 http://dx.doi.org/10.1186/s12863-021-01004-y Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Zhu, Fangfang
Li, Jiang
Liu, Juan
Min, Wenwen
Network-based cancer genomic data integration for pattern discovery
title Network-based cancer genomic data integration for pattern discovery
title_full Network-based cancer genomic data integration for pattern discovery
title_fullStr Network-based cancer genomic data integration for pattern discovery
title_full_unstemmed Network-based cancer genomic data integration for pattern discovery
title_short Network-based cancer genomic data integration for pattern discovery
title_sort network-based cancer genomic data integration for pattern discovery
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8662848/
https://www.ncbi.nlm.nih.gov/pubmed/34886811
http://dx.doi.org/10.1186/s12863-021-01004-y
work_keys_str_mv AT zhufangfang networkbasedcancergenomicdataintegrationforpatterndiscovery
AT lijiang networkbasedcancergenomicdataintegrationforpatterndiscovery
AT liujuan networkbasedcancergenomicdataintegrationforpatterndiscovery
AT minwenwen networkbasedcancergenomicdataintegrationforpatterndiscovery