Cargando…

NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer

Pancreatic cancer (PC) is a highly fatal disease, yet its causes remain unclear. Comprehensive analysis of different types of PC genetic data plays a crucial role in understanding its pathogenic mechanisms. Currently, non-negative matrix factorization (NMF)-based methods are widely used for genetic...

Descripción completa

Detalles Bibliográficos
Autores principales: Ding, Qian, Sun, Yan, Shang, Junliang, Li, Feng, Zhang, Yuanyuan, Liu, Jin-Xing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8340025/
https://www.ncbi.nlm.nih.gov/pubmed/34367241
http://dx.doi.org/10.3389/fgene.2021.678642
_version_ 1783733719987650560
author Ding, Qian
Sun, Yan
Shang, Junliang
Li, Feng
Zhang, Yuanyuan
Liu, Jin-Xing
author_facet Ding, Qian
Sun, Yan
Shang, Junliang
Li, Feng
Zhang, Yuanyuan
Liu, Jin-Xing
author_sort Ding, Qian
collection PubMed
description Pancreatic cancer (PC) is a highly fatal disease, yet its causes remain unclear. Comprehensive analysis of different types of PC genetic data plays a crucial role in understanding its pathogenic mechanisms. Currently, non-negative matrix factorization (NMF)-based methods are widely used for genetic data analysis. Nevertheless, it is a challenge for them to integrate and decompose different types of genetic data simultaneously. In this paper, a non-NMF network analysis method, NMFNA, is proposed, which introduces a graph-regularized constraint to the NMF, for identifying modules and characteristic genes from two-type PC data of methylation (ME) and copy number variation (CNV). Firstly, three PC networks, i.e., ME network, CNV network, and ME–CNV network, are constructed using the Pearson correlation coefficient (PCC). Then, modules are detected from these three PC networks effectively due to the introduced graph-regularized constraint, which is the highlight of the NMFNA. Finally, both gene ontology (GO) and pathway enrichment analyses are performed, and characteristic genes are detected by the multimeasure score, to deeply understand biological functions of PC core modules. Experimental results demonstrated that the NMFNA facilitates the integration and decomposition of two types of PC data simultaneously and can further serve as an alternative method for detecting modules and characteristic genes from multiple genetic data of complex diseases.
format Online
Article
Text
id pubmed-8340025
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-83400252021-08-06 NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer Ding, Qian Sun, Yan Shang, Junliang Li, Feng Zhang, Yuanyuan Liu, Jin-Xing Front Genet Genetics Pancreatic cancer (PC) is a highly fatal disease, yet its causes remain unclear. Comprehensive analysis of different types of PC genetic data plays a crucial role in understanding its pathogenic mechanisms. Currently, non-negative matrix factorization (NMF)-based methods are widely used for genetic data analysis. Nevertheless, it is a challenge for them to integrate and decompose different types of genetic data simultaneously. In this paper, a non-NMF network analysis method, NMFNA, is proposed, which introduces a graph-regularized constraint to the NMF, for identifying modules and characteristic genes from two-type PC data of methylation (ME) and copy number variation (CNV). Firstly, three PC networks, i.e., ME network, CNV network, and ME–CNV network, are constructed using the Pearson correlation coefficient (PCC). Then, modules are detected from these three PC networks effectively due to the introduced graph-regularized constraint, which is the highlight of the NMFNA. Finally, both gene ontology (GO) and pathway enrichment analyses are performed, and characteristic genes are detected by the multimeasure score, to deeply understand biological functions of PC core modules. Experimental results demonstrated that the NMFNA facilitates the integration and decomposition of two types of PC data simultaneously and can further serve as an alternative method for detecting modules and characteristic genes from multiple genetic data of complex diseases. Frontiers Media S.A. 2021-07-22 /pmc/articles/PMC8340025/ /pubmed/34367241 http://dx.doi.org/10.3389/fgene.2021.678642 Text en Copyright © 2021 Ding, Sun, Shang, Li, Zhang and Liu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Ding, Qian
Sun, Yan
Shang, Junliang
Li, Feng
Zhang, Yuanyuan
Liu, Jin-Xing
NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer
title NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer
title_full NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer
title_fullStr NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer
title_full_unstemmed NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer
title_short NMFNA: A Non-negative Matrix Factorization Network Analysis Method for Identifying Modules and Characteristic Genes of Pancreatic Cancer
title_sort nmfna: a non-negative matrix factorization network analysis method for identifying modules and characteristic genes of pancreatic cancer
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8340025/
https://www.ncbi.nlm.nih.gov/pubmed/34367241
http://dx.doi.org/10.3389/fgene.2021.678642
work_keys_str_mv AT dingqian nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer
AT sunyan nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer
AT shangjunliang nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer
AT lifeng nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer
AT zhangyuanyuan nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer
AT liujinxing nmfnaanonnegativematrixfactorizationnetworkanalysismethodforidentifyingmodulesandcharacteristicgenesofpancreaticcancer