Cargando…

UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data

Factor analysis, ranging from principal component analysis to nonnegative matrix factorization, represents a foremost approach in analyzing multi-dimensional data to extract valuable patterns, and is increasingly being applied in the context of multi-dimensional omics datasets represented in tensor...

Descripción completa

Detalles Bibliográficos
Autores principales: Abe, Ko, Shimamura, Teppei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516365/
https://www.ncbi.nlm.nih.gov/pubmed/37478378
http://dx.doi.org/10.1093/bib/bbad253
_version_ 1785109114910670848
author Abe, Ko
Shimamura, Teppei
author_facet Abe, Ko
Shimamura, Teppei
author_sort Abe, Ko
collection PubMed
description Factor analysis, ranging from principal component analysis to nonnegative matrix factorization, represents a foremost approach in analyzing multi-dimensional data to extract valuable patterns, and is increasingly being applied in the context of multi-dimensional omics datasets represented in tensor form. However, traditional analytical methods are heavily dependent on the format and structure of the data itself, and if these change even slightly, the analyst must change their data analysis strategy and techniques and spend a considerable amount of time on data preprocessing. Additionally, many traditional methods cannot be applied as-is in the presence of missing values in the data. We present a new statistical framework, unified nonnegative matrix factorization (UNMF), for finding informative patterns in messy biological data sets. UNMF is designed for tidy data format and structure, making data analysis easier and simplifying the development of data analysis tools. UNMF can handle a wide range of data structures and formats, and works seamlessly with tensor data including missing observations and repeated measurements. The usefulness of UNMF is demonstrated through its application to several multi-dimensional omics data, offering user-friendly and unified features for analysis and integration. Its application holds great potential for the life science community. UNMF is implemented with R and is available from GitHub (https://github.com/abikoushi/moltenNMF).
format Online
Article
Text
id pubmed-10516365
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105163652023-09-23 UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data Abe, Ko Shimamura, Teppei Brief Bioinform Problem Solving Protocol Factor analysis, ranging from principal component analysis to nonnegative matrix factorization, represents a foremost approach in analyzing multi-dimensional data to extract valuable patterns, and is increasingly being applied in the context of multi-dimensional omics datasets represented in tensor form. However, traditional analytical methods are heavily dependent on the format and structure of the data itself, and if these change even slightly, the analyst must change their data analysis strategy and techniques and spend a considerable amount of time on data preprocessing. Additionally, many traditional methods cannot be applied as-is in the presence of missing values in the data. We present a new statistical framework, unified nonnegative matrix factorization (UNMF), for finding informative patterns in messy biological data sets. UNMF is designed for tidy data format and structure, making data analysis easier and simplifying the development of data analysis tools. UNMF can handle a wide range of data structures and formats, and works seamlessly with tensor data including missing observations and repeated measurements. The usefulness of UNMF is demonstrated through its application to several multi-dimensional omics data, offering user-friendly and unified features for analysis and integration. Its application holds great potential for the life science community. UNMF is implemented with R and is available from GitHub (https://github.com/abikoushi/moltenNMF). Oxford University Press 2023-07-20 /pmc/articles/PMC10516365/ /pubmed/37478378 http://dx.doi.org/10.1093/bib/bbad253 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Problem Solving Protocol
Abe, Ko
Shimamura, Teppei
UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data
title UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data
title_full UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data
title_fullStr UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data
title_full_unstemmed UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data
title_short UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data
title_sort unmf: a unified nonnegative matrix factorization for multi-dimensional omics data
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516365/
https://www.ncbi.nlm.nih.gov/pubmed/37478378
http://dx.doi.org/10.1093/bib/bbad253
work_keys_str_mv AT abeko unmfaunifiednonnegativematrixfactorizationformultidimensionalomicsdata
AT shimamurateppei unmfaunifiednonnegativematrixfactorizationformultidimensionalomicsdata