Cargando…
NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer
Pancreatic cancer (PC) is a highly fatal disease, yet its causes remain unclear. Comprehensive analysis of different types of PC genetic data plays a crucial role in understanding its pathogenic mechanisms. Currently, non-negative matrix factorization (NMF)-based methods are widely used for genetic...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8340025/ https://www.ncbi.nlm.nih.gov/pubmed/34367241 http://dx.doi.org/10.3389/fgene.2021.678642 |
_version_ | 1783733719987650560 |
---|---|
author | Ding, Qian Sun, Yan Shang, Junliang Li, Feng Zhang, Yuanyuan Liu, Jin-Xing |
author_facet | Ding, Qian Sun, Yan Shang, Junliang Li, Feng Zhang, Yuanyuan Liu, Jin-Xing |
author_sort | Ding, Qian |
collection | PubMed |
description | Pancreatic cancer (PC) is a highly fatal disease, yet its causes remain unclear. Comprehensive analysis of different types of PC genetic data plays a crucial role in understanding its pathogenic mechanisms. Currently, non-negative matrix factorization (NMF)-based methods are widely used for genetic data analysis. Nevertheless, it is a challenge for them to integrate and decompose different types of genetic data simultaneously. In this paper, a non-NMF network analysis method, NMFNA, is proposed, which introduces a graph-regularized constraint to the NMF, for identifying modules and characteristic genes from two-type PC data of methylation (ME) and copy number variation (CNV). Firstly, three PC networks, i.e., ME network, CNV network, and ME–CNV network, are constructed using the Pearson correlation coefficient (PCC). Then, modules are detected from these three PC networks effectively due to the introduced graph-regularized constraint, which is the highlight of the NMFNA. Finally, both gene ontology (GO) and pathway enrichment analyses are performed, and characteristic genes are detected by the multimeasure score, to deeply understand biological functions of PC core modules. Experimental results demonstrated that the NMFNA facilitates the integration and decomposition of two types of PC data simultaneously and can further serve as an alternative method for detecting modules and characteristic genes from multiple genetic data of complex diseases. |
format | Online Article Text |
id | pubmed-8340025 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-83400252021-08-06 NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer Ding, Qian Sun, Yan Shang, Junliang Li, Feng Zhang, Yuanyuan Liu, Jin-Xing Front Genet Genetics Pancreatic cancer (PC) is a highly fatal disease, yet its causes remain unclear. Comprehensive analysis of different types of PC genetic data plays a crucial role in understanding its pathogenic mechanisms. Currently, non-negative matrix factorization (NMF)-based methods are widely used for genetic data analysis. Nevertheless, it is a challenge for them to integrate and decompose different types of genetic data simultaneously. In this paper, a non-NMF network analysis method, NMFNA, is proposed, which introduces a graph-regularized constraint to the NMF, for identifying modules and characteristic genes from two-type PC data of methylation (ME) and copy number variation (CNV). Firstly, three PC networks, i.e., ME network, CNV network, and ME–CNV network, are constructed using the Pearson correlation coefficient (PCC). Then, modules are detected from these three PC networks effectively due to the introduced graph-regularized constraint, which is the highlight of the NMFNA. Finally, both gene ontology (GO) and pathway enrichment analyses are performed, and characteristic genes are detected by the multimeasure score, to deeply understand biological functions of PC core modules. Experimental results demonstrated that the NMFNA facilitates the integration and decomposition of two types of PC data simultaneously and can further serve as an alternative method for detecting modules and characteristic genes from multiple genetic data of complex diseases. Frontiers Media S.A. 2021-07-22 /pmc/articles/PMC8340025/ /pubmed/34367241 http://dx.doi.org/10.3389/fgene.2021.678642 Text en Copyright © 2021 Ding, Sun, Shang, Li, Zhang and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Ding, Qian Sun, Yan Shang, Junliang Li, Feng Zhang, Yuanyuan Liu, Jin-Xing NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer |
title | NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer |
title_full | NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer |
title_fullStr | NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer |
title_full_unstemmed | NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer |
title_short | NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer |
title_sort | nmfna: a non-negative matrix factorization network analysis method for identifying modules and characteristic genes of pancreatic cancer |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8340025/ https://www.ncbi.nlm.nih.gov/pubmed/34367241 http://dx.doi.org/10.3389/fgene.2021.678642 |
work_keys_str_mv | AT dingqian nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer AT sunyan nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer AT shangjunliang nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer AT lifeng nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer AT zhangyuanyuan nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer AT liujinxing nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer |